1151-1200 of 10000 results (92ms)
2024-05-21 ยง
09:33 <hnowlan> decommissioning 6 appservers in advance of reimaging to k8s control nodes [production]
09:33 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host db1160.eqiad.wmnet [production]
09:32 <marostegui@cumin1002> dbctl commit (dc=all): 'db1221 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P62768 and previous config saved to /var/cache/conftool/dbconfig/20240521-093238-root.json [production]
09:31 <btullis@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-launcher1002.eqiad.wmnet with reason: host reimage [production]
09:29 <taavi@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudnet1005.eqiad.wmnet with OS bookworm [production]
09:28 <btullis@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on an-launcher1002.eqiad.wmnet with reason: host reimage [production]
09:17 <marostegui@cumin1002> dbctl commit (dc=all): 'db1221 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P62767 and previous config saved to /var/cache/conftool/dbconfig/20240521-091732-root.json [production]
09:16 <btullis@cumin1002> START - Cookbook sre.hosts.reimage for host an-launcher1002.eqiad.wmnet with OS bullseye [production]
09:13 <taavi@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudnet1005.eqiad.wmnet with reason: host reimage [production]
09:10 <taavi@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudnet1005.eqiad.wmnet with reason: host reimage [production]
09:09 <tgr|away> UTC morning deploys done [production]
09:06 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host db1160.eqiad.wmnet [production]
09:05 <tgr@deploy1002> Finished scap: Backport for [[gerrit:1034173|Temporarily restore $wgCentralAuthDatabase (T348486)]] (duration: 17m 45s) [production]
09:02 <marostegui@cumin1002> dbctl commit (dc=all): 'db1221 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P62766 and previous config saved to /var/cache/conftool/dbconfig/20240521-090224-root.json [production]
09:02 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host db2219.codfw.wmnet [production]
08:55 <taavi@cumin1002> START - Cookbook sre.hosts.reimage for host cloudnet1005.eqiad.wmnet with OS bookworm [production]
08:51 <tgr@deploy1002> tgr: Continuing with sync [production]
08:51 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host db2219.codfw.wmnet [production]
08:50 <tgr@deploy1002> tgr: Backport for [[gerrit:1034173|Temporarily restore $wgCentralAuthDatabase (T348486)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
08:49 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host db2210.codfw.wmnet [production]
08:48 <moritzm> installing edk2 security updates [production]
08:47 <tgr@deploy1002> Started scap: Backport for [[gerrit:1034173|Temporarily restore $wgCentralAuthDatabase (T348486)]] [production]
08:47 <marostegui@cumin1002> dbctl commit (dc=all): 'db1221 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P62765 and previous config saved to /var/cache/conftool/dbconfig/20240521-084718-root.json [production]
08:43 <moritzm> installing ghostscript security updates [production]
08:41 <matthiasmullie> UTC morning backports done [production]
08:41 <mlitn@deploy1002> Finished scap: Backport for [[gerrit:1032888|Allow async (job queue based) chunked upload on all wikis (T364644)]] (duration: 17m 32s) [production]
08:40 <cmooney@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) sretest2002.wikimedia.org on all recursors [production]
08:40 <cmooney@cumin1002> START - Cookbook sre.dns.wipe-cache sretest2002.wikimedia.org on all recursors [production]
08:38 <cmooney@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:38 <cmooney@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add dns for sretest2002 - cmooney@cumin1002" [production]
08:38 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host db2210.codfw.wmnet [production]
08:37 <effie> enable puppet on all mw* baremetal hosts [production]
08:37 <cmooney@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add dns for sretest2002 - cmooney@cumin1002" [production]
08:35 <marostegui> Deploy schema change on s8 eqiad, this will cause a few hours of replication lag in s8 clouddb replicas T364299 [production]
08:34 <cmooney@cumin1002> START - Cookbook sre.dns.netbox [production]
08:34 <cmooney@cumin1002> END (ERROR) - Cookbook sre.dns.netbox (exit_code=97) [production]
08:34 <cmooney@cumin1002> START - Cookbook sre.dns.netbox [production]
08:33 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1167.eqiad.wmnet with reason: Long schema change [production]
08:32 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1167.eqiad.wmnet with reason: Long schema change [production]
08:32 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on clouddb[1016,1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Long schema change [production]
08:32 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on clouddb[1016,1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Long schema change [production]
08:32 <marostegui@cumin1002> dbctl commit (dc=all): 'db1221 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P62764 and previous config saved to /var/cache/conftool/dbconfig/20240521-083212-root.json [production]
08:30 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db1167 for a schema change', diff saved to https://phabricator.wikimedia.org/P62763 and previous config saved to /var/cache/conftool/dbconfig/20240521-083053-root.json [production]
08:28 <marostegui@cumin1002> dbctl commit (dc=all): 'db1237 (re)pooling @ 100%: After reimage', diff saved to https://phabricator.wikimedia.org/P62762 and previous config saved to /var/cache/conftool/dbconfig/20240521-082842-root.json [production]
08:27 <mlitn@deploy1002> mlitn and bawolff: Continuing with sync [production]
08:26 <mlitn@deploy1002> mlitn and bawolff: Backport for [[gerrit:1032888|Allow async (job queue based) chunked upload on all wikis (T364644)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
08:23 <mlitn@deploy1002> Started scap: Backport for [[gerrit:1032888|Allow async (job queue based) chunked upload on all wikis (T364644)]] [production]
08:22 <mlitn@deploy1002> Finished scap: Backport for [[gerrit:1032824|Remove complicated synchronization of caption/description inputs (T365119)]] (duration: 17m 40s) [production]
08:19 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1216.eqiad.wmnet with reason: Maintenance [production]
08:19 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1216.eqiad.wmnet with reason: Maintenance [production]