3651-3700 of 10000 results (82ms)
2023-11-20 ยง
20:18 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2138:3314', diff saved to https://phabricator.wikimedia.org/P53644 and previous config saved to /var/cache/conftool/dbconfig/20231120-201831-arnaudb.json [production]
20:10 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host aqs1014.eqiad.wmnet with OS bullseye [production]
20:08 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host aqs1013.eqiad.wmnet with OS bullseye [production]
20:03 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2138:3314', diff saved to https://phabricator.wikimedia.org/P53643 and previous config saved to /var/cache/conftool/dbconfig/20231120-200324-arnaudb.json [production]
19:59 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for acmechief2001.codfw.wmnet [production]
19:59 <brett@cumin2002> START - Cookbook sre.hosts.remove-downtime for acmechief2001.codfw.wmnet [production]
19:50 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aqs1013.eqiad.wmnet with reason: host reimage [production]
19:48 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2138:3314 (T348183)', diff saved to https://phabricator.wikimedia.org/P53642 and previous config saved to /var/cache/conftool/dbconfig/20231120-194818-arnaudb.json [production]
19:48 <eevans@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on aqs1013.eqiad.wmnet with reason: host reimage [production]
19:36 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host aqs1013.eqiad.wmnet with OS bullseye [production]
19:21 <sukhe> pool cp4045.ulsfo.wmnet post reboot and puppet 7 upgrade [production]
19:16 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp4045.ulsfo.wmnet [production]
19:05 <sukhe@cumin2002> START - Cookbook sre.hosts.reboot-single for host cp4045.ulsfo.wmnet [production]
19:04 <cmooney@cumin1001> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
19:03 <brett@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host acmechief2001.codfw.wmnet with OS bookworm [production]
19:03 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
19:02 <sukhe> depool cp4045 for reboot [production]
18:59 <cmooney@cumin1001> END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox [production]
18:59 <cmooney@cumin1001> START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox [production]
18:59 <cmooney@cumin1001> END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox-canary [production]
18:59 <cmooney@cumin1001> START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox-canary [production]
18:57 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host cp4045.ulsfo.wmnet [production]
18:48 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host cp4045.ulsfo.wmnet [production]
18:44 <brett@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on acmechief2001.codfw.wmnet with reason: host reimage [production]
18:41 <brett@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on acmechief2001.codfw.wmnet with reason: host reimage [production]
18:39 <bking@cumin1001> START - Cookbook sre.wdqs.data-reload [production]
18:38 <bking@cumin1001> END (ERROR) - Cookbook sre.wdqs.data-reload (exit_code=97) [production]
18:37 <ebernhardson@deploy2002> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
18:37 <ebernhardson@deploy2002> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
18:27 <brett@cumin1001> START - Cookbook sre.hosts.reimage for host acmechief2001.codfw.wmnet with OS bookworm [production]
18:25 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: wikidough [production]
18:18 <volans> installed spicerack v8.1.0 on the cumin hosts [production]
18:13 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-role for role: wikidough [production]
18:08 <ebernhardson> start test backfill of 4 days of itwiki and frwiki edits to relforge from cirrus updater [production]
18:06 <ebernhardson@deploy2002> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
18:06 <ebernhardson@deploy2002> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
17:49 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudelastic1010.wikimedia.org with OS bullseye [production]
17:47 <jforrester@deploy2002> helmfile [staging] DONE helmfile.d/services/wikifunctions: apply [production]
17:39 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ganeti1035.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:39 <jclark@cumin1001> START - Cookbook sre.hosts.provision for host ganeti1035.mgmt.eqiad.wmnet with reboot policy FORCED [production]
17:37 <jforrester@deploy2002> helmfile [staging] START helmfile.d/services/wikifunctions: apply [production]
17:32 <volans> uploaded spicerack_8.1.0 to apt.wikimedia.org bullseye-wikimedia [production]
17:30 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: durum [production]
17:28 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudelastic1010.wikimedia.org with reason: host reimage [production]
17:25 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudelastic1010.wikimedia.org with reason: host reimage [production]
17:18 <hashar> Restarting Gerrit # T351658 [production]
17:15 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-role for role: durum [production]
17:10 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host cloudelastic1010.wikimedia.org with OS bullseye [production]
16:56 <ladsgroup@deploy2002> Finished scap: Backport for [[gerrit:975806|Set pagelinks migration to read new in testwiki, fawikiquote, cebwiki (T351237)]] (duration: 10m 06s) [production]
16:51 <ladsgroup@deploy2002> ladsgroup: Continuing with sync [production]