3151-3200 of 10000 results (105ms)
2024-05-14 ยง
11:33 <ladsgroup@deploy1002> ladsgroup: Backport for [[gerrit:1030938|etcd: Ignore parsercache clusters in externalLoads (T362786)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
11:30 <ladsgroup@deploy1002> Started scap: Backport for [[gerrit:1030938|etcd: Ignore parsercache clusters in externalLoads (T362786)]] [production]
11:28 <marostegui@cumin1002> dbctl commit (dc=all): 'db2152 (re)pooling @ 25%: After schema change', diff saved to https://phabricator.wikimedia.org/P62381 and previous config saved to /var/cache/conftool/dbconfig/20240514-112807-root.json [production]
11:18 <ladsgroup@deploy1002> Finished scap: Backport for [[gerrit:1031177|rdbms: Fix picking the database from the LB domain (T364827)]] (duration: 15m 47s) [production]
11:13 <marostegui@cumin1002> dbctl commit (dc=all): 'db2152 (re)pooling @ 10%: After schema change', diff saved to https://phabricator.wikimedia.org/P62379 and previous config saved to /var/cache/conftool/dbconfig/20240514-111302-root.json [production]
11:07 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2154 (T364299)', diff saved to https://phabricator.wikimedia.org/P62378 and previous config saved to /var/cache/conftool/dbconfig/20240514-110704-marostegui.json [production]
11:06 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2154.codfw.wmnet with reason: Maintenance [production]
11:06 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2154.codfw.wmnet with reason: Maintenance [production]
11:05 <ladsgroup@deploy1002> ladsgroup: Continuing with sync [production]
11:05 <ladsgroup@deploy1002> ladsgroup: Backport for [[gerrit:1031177|rdbms: Fix picking the database from the LB domain (T364827)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
11:02 <ladsgroup@deploy1002> Started scap: Backport for [[gerrit:1031177|rdbms: Fix picking the database from the LB domain (T364827)]] [production]
10:17 <jayme@cumin1002> conftool action : set/weight=10; selector: name=kubestagemaster2004.codfw.wmnet [production]
10:17 <jayme@cumin1002> conftool action : set/pooled=yes; selector: name=kubestagemaster2004.codfw.wmnet [production]
10:12 <hashar@deploy1002> rebuilt and synchronized wikiversions files: Revert "group0 wikis to 1.43.0-wmf.5" - T361399 [production]
09:53 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host serpens.wikimedia.org [production]
09:53 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:05:00 on 6 hosts with reason: Checking RO status [production]
09:52 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 0:05:00 on 6 hosts with reason: Checking RO status [production]
09:52 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:05:00 on 6 hosts with reason: Primary switchover es4 T364451 [production]
09:51 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 0:05:00 on 6 hosts with reason: Primary switchover es4 T364451 [production]
09:50 <marostegui@deploy1002> Finished scap: Backport for [[gerrit:1030918|db-production.php: Make es4 and es5 RO (T364447)]] (duration: 15m 28s) [production]
09:50 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host serpens.wikimedia.org [production]
09:50 <jayme@cumin1002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of kubestagemaster1005.eqiad.wmnet to plain [production]
09:49 <jayme@cumin1002> START - Cookbook sre.ganeti.changedisk for changing disk type of kubestagemaster1005.eqiad.wmnet to plain [production]
09:49 <jayme@cumin1002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of kubestagemaster1004.eqiad.wmnet to plain [production]
09:48 <jayme@cumin1002> START - Cookbook sre.ganeti.changedisk for changing disk type of kubestagemaster1004.eqiad.wmnet to plain [production]
09:48 <jayme@cumin1002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of kubestagemaster1003.eqiad.wmnet to plain [production]
09:47 <jayme@cumin1002> START - Cookbook sre.ganeti.changedisk for changing disk type of kubestagemaster1003.eqiad.wmnet to plain [production]
09:47 <jayme@cumin1002> END (FAIL) - Cookbook sre.ganeti.changedisk (exit_code=99) for changing disk type of kubestagemaster1003.eqiad.wmnet to plain [production]
09:47 <jayme@cumin1002> START - Cookbook sre.ganeti.changedisk for changing disk type of kubestagemaster1003.eqiad.wmnet to plain [production]
09:45 <jayme@cumin1002> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host kubestagemaster1005.eqiad.wmnet [production]
09:45 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubestagemaster1005.eqiad.wmnet with OS bullseye [production]
09:37 <marostegui@deploy1002> marostegui: Continuing with sync [production]
09:37 <marostegui@deploy1002> marostegui: Backport for [[gerrit:1030918|db-production.php: Make es4 and es5 RO (T364447)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
09:35 <marostegui@deploy1002> Started scap: Backport for [[gerrit:1030918|db-production.php: Make es4 and es5 RO (T364447)]] [production]
09:31 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubestagemaster1005.eqiad.wmnet with reason: host reimage [production]
09:27 <jayme@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on kubestagemaster1005.eqiad.wmnet with reason: host reimage [production]
09:24 <hashar@deploy1002> rebuilt and synchronized wikiversions files: group0 wikis to 1.43.0-wmf.5 refs T361399 [production]
09:20 <jayme@cumin1002> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host kubestagemaster1004.eqiad.wmnet [production]
09:20 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubestagemaster1004.eqiad.wmnet with OS bullseye [production]
09:18 <jayme@cumin1002> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host kubestagemaster1003.eqiad.wmnet [production]
09:18 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubestagemaster1003.eqiad.wmnet with OS bullseye [production]
09:14 <jayme@cumin1002> START - Cookbook sre.hosts.reimage for host kubestagemaster1005.eqiad.wmnet with OS bullseye [production]
09:06 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubestagemaster1004.eqiad.wmnet with reason: host reimage [production]
09:04 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubestagemaster1003.eqiad.wmnet with reason: host reimage [production]
09:02 <jayme@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on kubestagemaster1004.eqiad.wmnet with reason: host reimage [production]
09:02 <jayme@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on kubestagemaster1003.eqiad.wmnet with reason: host reimage [production]
08:58 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on serpens.wikimedia.org with reason: OS update [production]
08:58 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 1:00:00 on serpens.wikimedia.org with reason: OS update [production]
08:57 <jayme@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM kubestagemaster1005.eqiad.wmnet - jayme@cumin1002" [production]
08:54 <jayme@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM kubestagemaster1005.eqiad.wmnet - jayme@cumin1002" [production]