151-200 of 10000 results (90ms)
2024-10-22 ยง
14:57 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ms-be2084.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
14:56 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2227 (re)pooling @ 100%: T377718', diff saved to https://phabricator.wikimedia.org/P70512 and previous config saved to /var/cache/conftool/dbconfig/20241022-145653-arnaudb.json [production]
14:53 <pt1979@cumin2002> START - Cookbook sre.hosts.provision for host ms-be2084.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
14:52 <hashar@deploy2002> Finished deploy [gerrit/gerrit@30691f2]: Update patch demo to recognize both legacy and new URLs - T374954 (duration: 00m 10s) [production]
14:52 <hashar@deploy2002> Started deploy [gerrit/gerrit@30691f2]: Update patch demo to recognize both legacy and new URLs - T374954 [production]
14:50 <jmm@cumin2002> END (PASS) - Cookbook sre.netbox.restart-reboot (exit_code=0) rolling reboot on A:netbox [production]
14:49 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1206', diff saved to https://phabricator.wikimedia.org/P70511 and previous config saved to /var/cache/conftool/dbconfig/20241022-144902-ladsgroup.json [production]
14:41 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2227 (re)pooling @ 75%: T377718', diff saved to https://phabricator.wikimedia.org/P70510 and previous config saved to /var/cache/conftool/dbconfig/20241022-144148-arnaudb.json [production]
14:40 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) netbox.discovery.wmnet. on all recursors [production]
14:40 <jmm@cumin2002> START - Cookbook sre.dns.wipe-cache netbox.discovery.wmnet. on all recursors [production]
14:37 <jhancock@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:37 <jhancock@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding ms-be2084 to codfw - jhancock@cumin2002" [production]
14:37 <jhancock@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding ms-be2084 to codfw - jhancock@cumin2002" [production]
14:36 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2149 (re)pooling @ 100%: post clone', diff saved to https://phabricator.wikimedia.org/P70509 and previous config saved to /var/cache/conftool/dbconfig/20241022-143628-arnaudb.json [production]
14:34 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) netbox.discovery.wmnet. on all recursors [production]
14:34 <jmm@cumin2002> START - Cookbook sre.dns.wipe-cache netbox.discovery.wmnet. on all recursors [production]
14:34 <jmm@cumin2002> START - Cookbook sre.netbox.restart-reboot rolling reboot on A:netbox [production]
14:33 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1206', diff saved to https://phabricator.wikimedia.org/P70507 and previous config saved to /var/cache/conftool/dbconfig/20241022-143355-ladsgroup.json [production]
14:32 <dreamyjazz@deploy2002> Finished scap sync-world: Backport for [[gerrit:1082219|Fix performer link on Special:GlobalBlockList (T377398)]] (duration: 07m 43s) [production]
14:31 <jhancock@cumin2002> START - Cookbook sre.dns.netbox [production]
14:30 <arnaudb@cumin1002> dbctl commit (dc=all): 'Depooling db2155 (T367781)', diff saved to https://phabricator.wikimedia.org/P70506 and previous config saved to /var/cache/conftool/dbconfig/20241022-143005-arnaudb.json [production]
14:30 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2187.codfw.wmnet with reason: Maintenance [production]
14:29 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 8:00:00 on db2187.codfw.wmnet with reason: Maintenance [production]
14:29 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2155.codfw.wmnet with reason: Maintenance [production]
14:29 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on db2155.codfw.wmnet with reason: Maintenance [production]
14:27 <dreamyjazz@deploy2002> dreamyjazz: Continuing with sync [production]
14:27 <dreamyjazz@deploy2002> dreamyjazz: Backport for [[gerrit:1082219|Fix performer link on Special:GlobalBlockList (T377398)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:26 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2227 (re)pooling @ 50%: T377718', diff saved to https://phabricator.wikimedia.org/P70505 and previous config saved to /var/cache/conftool/dbconfig/20241022-142642-arnaudb.json [production]
14:24 <dreamyjazz@deploy2002> Started scap sync-world: Backport for [[gerrit:1082219|Fix performer link on Special:GlobalBlockList (T377398)]] [production]
14:21 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2149 (re)pooling @ 75%: post clone', diff saved to https://phabricator.wikimedia.org/P70504 and previous config saved to /var/cache/conftool/dbconfig/20241022-142123-arnaudb.json [production]
14:18 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1206 (T376905)', diff saved to https://phabricator.wikimedia.org/P70503 and previous config saved to /var/cache/conftool/dbconfig/20241022-141848-ladsgroup.json [production]
14:11 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2227 (re)pooling @ 25%: T377718', diff saved to https://phabricator.wikimedia.org/P70502 and previous config saved to /var/cache/conftool/dbconfig/20241022-141137-arnaudb.json [production]
14:10 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2011.codfw.wmnet [production]
14:10 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti2011.codfw.wmnet [production]
14:10 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2011.codfw.wmnet [production]
14:09 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1206 (T376905)', diff saved to https://phabricator.wikimedia.org/P70501 and previous config saved to /var/cache/conftool/dbconfig/20241022-140956-ladsgroup.json [production]
14:09 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1206.eqiad.wmnet with reason: Maintenance [production]
14:09 <ejegg> payments-wiki upgraded from 7ae3479f to a039cd50 [production]
14:09 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1206.eqiad.wmnet with reason: Maintenance [production]
14:09 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1196 (T376905)', diff saved to https://phabricator.wikimedia.org/P70500 and previous config saved to /var/cache/conftool/dbconfig/20241022-140931-ladsgroup.json [production]
14:08 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2011.codfw.wmnet [production]
14:06 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2149 (re)pooling @ 50%: post clone', diff saved to https://phabricator.wikimedia.org/P70499 and previous config saved to /var/cache/conftool/dbconfig/20241022-140617-arnaudb.json [production]
14:03 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2011.codfw.wmnet [production]
13:59 <moritzm> rebalance ganeti clusters in magru following reboots [production]
13:58 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti7001.magru.wmnet [production]
13:57 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti7001.magru.wmnet [production]
13:56 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2227 (re)pooling @ 10%: T377718', diff saved to https://phabricator.wikimedia.org/P70498 and previous config saved to /var/cache/conftool/dbconfig/20241022-135631-arnaudb.json [production]
13:54 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1196', diff saved to https://phabricator.wikimedia.org/P70497 and previous config saved to /var/cache/conftool/dbconfig/20241022-135424-ladsgroup.json [production]
13:52 <Lucas_WMDE> UTC afternoon backport+window done (a further GlobalBlocking fix will be backported out-of-window soon) [production]
13:51 <aqu@deploy2002> Finished deploy [analytics/refinery@ffc985a] (hadoop-test): Adding refinery/source 0.2.49.2 & 0.2.53 [analytics/refinery@ffc985a7] (duration: 03m 17s) [production]