3801-3850 of 10000 results (89ms)
2024-03-04 ยง
18:26 <akosiaris@cumin1002> START - Cookbook sre.hosts.reimage for host parse1024.eqiad.wmnet with OS bullseye [production]
18:26 <akosiaris@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host parse1023.eqiad.wmnet with OS bullseye [production]
18:26 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['es2035'] [production]
18:24 <jhancock@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['es2035'] [production]
18:17 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1235', diff saved to https://phabricator.wikimedia.org/P58390 and previous config saved to /var/cache/conftool/dbconfig/20240304-181705-marostegui.json [production]
18:16 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host es2037.mgmt.codfw.wmnet with reboot policy FORCED [production]
18:12 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1234 (T357189)', diff saved to https://phabricator.wikimedia.org/P58389 and previous config saved to /var/cache/conftool/dbconfig/20240304-181219-arnaudb.json [production]
18:09 <akosiaris@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host parse1012.eqiad.wmnet with OS bullseye [production]
18:08 <akosiaris@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on parse1023.eqiad.wmnet with reason: host reimage [production]
18:07 <arnaudb@cumin1002> dbctl commit (dc=all): 'Depooling db1234 (T357189)', diff saved to https://phabricator.wikimedia.org/P58388 and previous config saved to /var/cache/conftool/dbconfig/20240304-180717-arnaudb.json [production]
18:07 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1234.eqiad.wmnet with reason: Maintenance [production]
18:07 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 12:00:00 on db1234.eqiad.wmnet with reason: Maintenance [production]
18:06 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1232 (T357189)', diff saved to https://phabricator.wikimedia.org/P58387 and previous config saved to /var/cache/conftool/dbconfig/20240304-180655-arnaudb.json [production]
18:02 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1235', diff saved to https://phabricator.wikimedia.org/P58386 and previous config saved to /var/cache/conftool/dbconfig/20240304-180159-marostegui.json [production]
17:59 <jforrester@deploy2002> Finished scap: Backport for [[gerrit:1007885|ZObjectStore::updateZObjectAsSystemUser: Also give wf-staff rights]] (duration: 38m 44s) [production]
17:52 <akosiaris@cumin1002> START - Cookbook sre.hosts.reimage for host parse1023.eqiad.wmnet with OS bullseye [production]
17:51 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1232', diff saved to https://phabricator.wikimedia.org/P58385 and previous config saved to /var/cache/conftool/dbconfig/20240304-175148-arnaudb.json [production]
17:51 <akosiaris@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on parse1012.eqiad.wmnet with reason: host reimage [production]
17:49 <akosiaris@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host parse1022.eqiad.wmnet with OS bullseye [production]
17:49 <akosiaris@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on parse1012.eqiad.wmnet with reason: host reimage [production]
17:46 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1235 (T354015)', diff saved to https://phabricator.wikimedia.org/P58384 and previous config saved to /var/cache/conftool/dbconfig/20240304-174653-marostegui.json [production]
17:36 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1232', diff saved to https://phabricator.wikimedia.org/P58383 and previous config saved to /var/cache/conftool/dbconfig/20240304-173642-arnaudb.json [production]
17:36 <akosiaris@cumin1002> START - Cookbook sre.hosts.reimage for host parse1012.eqiad.wmnet with OS bullseye [production]
17:34 <jforrester@deploy2002> jforrester: Continuing with sync [production]
17:34 <jforrester@deploy2002> jforrester: Backport for [[gerrit:1007885|ZObjectStore::updateZObjectAsSystemUser: Also give wf-staff rights]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
17:31 <akosiaris@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on parse1022.eqiad.wmnet with reason: host reimage [production]
17:29 <akosiaris@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on parse1022.eqiad.wmnet with reason: host reimage [production]
17:21 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1232 (T357189)', diff saved to https://phabricator.wikimedia.org/P58382 and previous config saved to /var/cache/conftool/dbconfig/20240304-172136-arnaudb.json [production]
17:21 <jforrester@deploy2002> Started scap: Backport for [[gerrit:1007885|ZObjectStore::updateZObjectAsSystemUser: Also give wf-staff rights]] [production]
17:20 <jdrewniak@deploy2002> Synchronized portals/wikipedia.org/assets: Wikimedia Portals Update: [[gerrit:1008501| Bumping portals to master (T128546)]] (duration: 45m 54s) [production]
17:19 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2171 (re)pooling @ 100%: Maintenance done', diff saved to https://phabricator.wikimedia.org/P58381 and previous config saved to /var/cache/conftool/dbconfig/20240304-171913-arnaudb.json [production]
17:16 <akosiaris@cumin1002> START - Cookbook sre.hosts.reimage for host parse1022.eqiad.wmnet with OS bullseye [production]
17:15 <arnaudb@cumin1002> dbctl commit (dc=all): 'Depooling db1232 (T357189)', diff saved to https://phabricator.wikimedia.org/P58380 and previous config saved to /var/cache/conftool/dbconfig/20240304-171543-arnaudb.json [production]
17:15 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1232.eqiad.wmnet with reason: Maintenance [production]
17:15 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 12:00:00 on db1232.eqiad.wmnet with reason: Maintenance [production]
17:15 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1228 (T357189)', diff saved to https://phabricator.wikimedia.org/P58379 and previous config saved to /var/cache/conftool/dbconfig/20240304-171521-arnaudb.json [production]
17:14 <akosiaris@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host parse1021.eqiad.wmnet with OS bullseye [production]
17:11 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host dbprov2006.mgmt.codfw.wmnet with reboot policy FORCED [production]
17:11 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host dbprov2005.mgmt.codfw.wmnet with reboot policy FORCED [production]
17:09 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host es2037.mgmt.codfw.wmnet with reboot policy FORCED [production]
17:04 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2171 (re)pooling @ 75%: Maintenance done', diff saved to https://phabricator.wikimedia.org/P58378 and previous config saved to /var/cache/conftool/dbconfig/20240304-170408-arnaudb.json [production]
17:03 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2124 (re)pooling @ 100%: Maintenance done', diff saved to https://phabricator.wikimedia.org/P58377 and previous config saved to /var/cache/conftool/dbconfig/20240304-170320-arnaudb.json [production]
17:00 <arnaudb@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1228', diff saved to https://phabricator.wikimedia.org/P58376 and previous config saved to /var/cache/conftool/dbconfig/20240304-170015-arnaudb.json [production]
16:59 <sukhe> sudo cumin -b1 -s120 "A:dns-rec" "run-puppet-agent --enable 'merging CR 1007918'": finish rolling out confd state management: T347054 [production]
16:57 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=dns2004.wikimedia.org,service=authdns-ns1 [production]
16:56 <sukhe@puppetmaster1001> conftool action : set/pooled=no; selector: name=dns2004.wikimedia.org,service=authdns-ns1 [production]
16:56 <akosiaris@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on parse1021.eqiad.wmnet with reason: host reimage [production]
16:53 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=dns6001.wikimedia.org,service=authdns-ns2 [production]
16:53 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=dns4003.wikimedia.org,service=authdns-ns2 [production]
16:53 <akosiaris@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on parse1021.eqiad.wmnet with reason: host reimage [production]