201-250 of 10000 results (87ms)
2023-11-17 §
07:16 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2133.codfw.wmnet with reason: host reimage [production]
07:13 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db2133.codfw.wmnet with reason: host reimage [production]
06:55 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host db2133.codfw.wmnet with OS bookworm [production]
06:48 <mabualruz@deploy2002> Backport cancelled. [production]
04:45 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db1147 (T348183)', diff saved to https://phabricator.wikimedia.org/P53535 and previous config saved to /var/cache/conftool/dbconfig/20231117-044504-arnaudb.json [production]
04:44 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1147.eqiad.wmnet with reason: Maintenance [production]
04:44 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1147.eqiad.wmnet with reason: Maintenance [production]
04:44 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1146:3314 (T348183)', diff saved to https://phabricator.wikimedia.org/P53534 and previous config saved to /var/cache/conftool/dbconfig/20231117-044443-arnaudb.json [production]
04:29 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1146:3314', diff saved to https://phabricator.wikimedia.org/P53533 and previous config saved to /var/cache/conftool/dbconfig/20231117-042937-arnaudb.json [production]
04:14 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1146:3314', diff saved to https://phabricator.wikimedia.org/P53532 and previous config saved to /var/cache/conftool/dbconfig/20231117-041430-arnaudb.json [production]
03:59 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1146:3314 (T348183)', diff saved to https://phabricator.wikimedia.org/P53531 and previous config saved to /var/cache/conftool/dbconfig/20231117-035924-arnaudb.json [production]
01:19 <cstone> payments-wiki upgraded from eae2f35e to 56790715 [production]
01:12 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1158.eqiad.wmnet with OS bullseye [production]
01:00 <jclark@cumin1001> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['an-worker1158'] [production]
00:55 <jclark@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['an-worker1158'] [production]
00:50 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1157.eqiad.wmnet with OS bullseye [production]
00:48 <ejegg> fundraising civiproxy upgraded from c000fc1e to 6625c844 [production]
00:39 <jclark@cumin1001> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['an-worker1157'] [production]
00:32 <jclark@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['an-worker1157'] [production]
2023-11-16 §
23:52 <jclark@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['an-worker1158'] [production]
23:51 <jclark@cumin1001> START - Cookbook sre.hosts.reimage for host an-worker1158.eqiad.wmnet with OS bullseye [production]
23:46 <jclark@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['an-worker1158'] [production]
23:43 <samtar@deploy2002> Finished scap: Backport for [[gerrit:975029|Revert "Disable drawer temporarily while erroring"]] (duration: 07m 31s) [production]
23:37 <samtar@deploy2002> samtar: Continuing with sync [production]
23:37 <samtar@deploy2002> samtar: Backport for [[gerrit:975029|Revert "Disable drawer temporarily while erroring"]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
23:35 <samtar@deploy2002> Started scap: Backport for [[gerrit:975029|Revert "Disable drawer temporarily while erroring"]] [production]
23:34 <samtar@deploy2002> Sync cancelled. [production]
23:33 <topranks> Change VRRP IP for public1-a-codfw vlan on codfw CRs T347191 [production]
23:30 <topranks> Add gateway IP for public1-a-codfw Vlan to ssw in codfw T347191 [production]
23:30 <jclark@cumin1001> START - Cookbook sre.hosts.reimage for host an-worker1157.eqiad.wmnet with OS bullseye [production]
23:29 <samtar@deploy2002> jdlrobson and samtar: Backport for [[gerrit:975097|Disable drawer temporarily while erroring (T351362)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
23:29 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1157.eqiad.wmnet with OS bullseye [production]
23:28 <samtar@deploy2002> Started scap: Backport for [[gerrit:975097|Disable drawer temporarily while erroring (T351362)]] [production]
23:28 <cmooney@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
23:28 <cmooney@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Remove old vlan 2001 entries - cmooney@cumin1001" [production]
23:27 <cmooney@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Remove old vlan 2001 entries - cmooney@cumin1001" [production]
23:25 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
23:10 <cmooney@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on cr[1-2]-codfw,cr[1-2]-codfw IPv6 with reason: Move public1-a-codfw vlan GW from codfw CR routers to ssw [production]
23:10 <cmooney@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on cr[1-2]-codfw,cr[1-2]-codfw IPv6 with reason: Move public1-a-codfw vlan GW from codfw CR routers to ssw [production]
22:39 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db1146:3314 (T348183)', diff saved to https://phabricator.wikimedia.org/P53529 and previous config saved to /var/cache/conftool/dbconfig/20231116-223915-arnaudb.json [production]
22:39 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1146.eqiad.wmnet with reason: Maintenance [production]
22:38 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1146.eqiad.wmnet with reason: Maintenance [production]
22:36 <mutante> disabled puppet on miscweb*, netmon* and phab* hosts, deploying gerrit:974285, confirming noop [production]
22:31 <cmooney@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
22:31 <cmooney@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Remove old vlan 1117 entries - cmooney@cumin1001" [production]
22:30 <cmooney@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Remove old vlan 1117 entries - cmooney@cumin1001" [production]
22:29 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
22:09 <jclark@cumin1001> START - Cookbook sre.hosts.reimage for host an-worker1157.eqiad.wmnet with OS bullseye [production]
22:00 <dr0ptp4kt@deploy2002> Finished scap: Backport for [[gerrit:975028|Make the feed gracefully handle long snippets and urls (T347732 T351463)]] (duration: 09m 50s) [production]
21:59 <jclark@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['an-worker1157'] [production]