2201-2250 of 10000 results (84ms)
2023-11-17 §
08:45 <jmm@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on crm2001.codfw.wmnet with reason: host reimage [production]
08:42 <jmm@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on crm2001.codfw.wmnet with reason: host reimage [production]
08:30 <jelto@cumin1001> START - Cookbook sre.gitlab.reboot-runner rolling reboot on A:gitlab-runner [production]
08:25 <jmm@cumin1001> START - Cookbook sre.hosts.reimage for host crm2001.codfw.wmnet with OS bookworm [production]
08:15 <jmm@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM crm2001.codfw.wmnet - jmm@cumin1001" [production]
08:14 <jmm@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM crm2001.codfw.wmnet - jmm@cumin1001" [production]
08:14 <jmm@cumin1001> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) crm2001.codfw.wmnet on all recursors [production]
08:14 <jmm@cumin1001> START - Cookbook sre.dns.wipe-cache crm2001.codfw.wmnet on all recursors [production]
08:14 <jmm@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:14 <jmm@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM crm2001.codfw.wmnet - jmm@cumin1001" [production]
08:13 <jmm@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM crm2001.codfw.wmnet - jmm@cumin1001" [production]
08:10 <jmm@cumin1001> START - Cookbook sre.dns.netbox [production]
08:09 <jmm@cumin1001> START - Cookbook sre.ganeti.makevm for new host crm2001.codfw.wmnet [production]
08:06 <jmm@cumin1001> END (PASS) - Cookbook sre.ganeti.resource-report (exit_code=0) [production]
08:05 <jmm@cumin1001> START - Cookbook sre.ganeti.resource-report [production]
07:57 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host debmonitor2003.codfw.wmnet [production]
07:49 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host debmonitor2003.codfw.wmnet [production]
07:34 <jmm@cumin1001> END (PASS) - Cookbook sre.ganeti.resource-report (exit_code=0) [production]
07:34 <jmm@cumin1001> START - Cookbook sre.ganeti.resource-report [production]
07:34 <jmm@cumin1001> END (PASS) - Cookbook sre.ganeti.resource-report (exit_code=0) [production]
07:34 <jmm@cumin1001> START - Cookbook sre.ganeti.resource-report [production]
07:30 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2133.codfw.wmnet with OS bookworm [production]
07:16 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2133.codfw.wmnet with reason: host reimage [production]
07:13 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db2133.codfw.wmnet with reason: host reimage [production]
06:55 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host db2133.codfw.wmnet with OS bookworm [production]
06:48 <mabualruz@deploy2002> Backport cancelled. [production]
04:45 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db1147 (T348183)', diff saved to https://phabricator.wikimedia.org/P53535 and previous config saved to /var/cache/conftool/dbconfig/20231117-044504-arnaudb.json [production]
04:44 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1147.eqiad.wmnet with reason: Maintenance [production]
04:44 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1147.eqiad.wmnet with reason: Maintenance [production]
04:44 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1146:3314 (T348183)', diff saved to https://phabricator.wikimedia.org/P53534 and previous config saved to /var/cache/conftool/dbconfig/20231117-044443-arnaudb.json [production]
04:29 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1146:3314', diff saved to https://phabricator.wikimedia.org/P53533 and previous config saved to /var/cache/conftool/dbconfig/20231117-042937-arnaudb.json [production]
04:14 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1146:3314', diff saved to https://phabricator.wikimedia.org/P53532 and previous config saved to /var/cache/conftool/dbconfig/20231117-041430-arnaudb.json [production]
03:59 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1146:3314 (T348183)', diff saved to https://phabricator.wikimedia.org/P53531 and previous config saved to /var/cache/conftool/dbconfig/20231117-035924-arnaudb.json [production]
01:19 <cstone> payments-wiki upgraded from eae2f35e to 56790715 [production]
01:12 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1158.eqiad.wmnet with OS bullseye [production]
01:00 <jclark@cumin1001> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['an-worker1158'] [production]
00:55 <jclark@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['an-worker1158'] [production]
00:50 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1157.eqiad.wmnet with OS bullseye [production]
00:48 <ejegg> fundraising civiproxy upgraded from c000fc1e to 6625c844 [production]
00:39 <jclark@cumin1001> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['an-worker1157'] [production]
00:32 <jclark@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['an-worker1157'] [production]
2023-11-16 §
23:52 <jclark@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['an-worker1158'] [production]
23:51 <jclark@cumin1001> START - Cookbook sre.hosts.reimage for host an-worker1158.eqiad.wmnet with OS bullseye [production]
23:46 <jclark@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['an-worker1158'] [production]
23:43 <samtar@deploy2002> Finished scap: Backport for [[gerrit:975029|Revert "Disable drawer temporarily while erroring"]] (duration: 07m 31s) [production]
23:37 <samtar@deploy2002> samtar: Continuing with sync [production]
23:37 <samtar@deploy2002> samtar: Backport for [[gerrit:975029|Revert "Disable drawer temporarily while erroring"]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
23:35 <samtar@deploy2002> Started scap: Backport for [[gerrit:975029|Revert "Disable drawer temporarily while erroring"]] [production]
23:34 <samtar@deploy2002> Sync cancelled. [production]
23:33 <topranks> Change VRRP IP for public1-a-codfw vlan on codfw CRs T347191 [production]