|
2025-02-13
ยง
|
| 20:09 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 20:07 |
<urbanecm> |
mwscript-k8s --attach extensions/Translate/scripts/moveTranslatableBundle.php -- --wiki=metawiki 'Wiki_Movement_Brazil_User_Group' 'Wikimedia Brasil' 'Martin Urbanec' --reason='per [[special:Permalink/28261149#Wikimedia_Brasil|request]] ([[:phab:T386402]])' # T386402 |
[production] |
| 20:03 |
<jhancock@cumin2002> |
START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 20:03 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 20:03 |
<jhancock@cumin2002> |
START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 19:59 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 19:59 |
<jhancock@cumin2002> |
START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 19:58 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 19:58 |
<jhancock@cumin2002> |
START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 19:56 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 19:56 |
<jhancock@cumin2002> |
START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 19:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2146 (re)pooling @ 100%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73475 and previous config saved to /var/cache/conftool/dbconfig/20250213-195454-root.json |
[production] |
| 19:44 |
<rzl@deploy2002> |
Finished scap sync-world: T383952, T384137 (duration: 11m 16s) |
[production] |
| 19:39 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2146 (re)pooling @ 75%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73474 and previous config saved to /var/cache/conftool/dbconfig/20250213-193949-root.json |
[production] |
| 19:38 |
<rzl@deploy2002> |
rzl: Continuing with sync |
[production] |
| 19:36 |
<rzl@deploy2002> |
rzl: T383952, T384137 synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
| 19:35 |
<rzl@deploy2002> |
Started scap sync-world: T383952, T384137 |
[production] |
| 19:24 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2146 (re)pooling @ 50%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73473 and previous config saved to /var/cache/conftool/dbconfig/20250213-192444-root.json |
[production] |
| 19:20 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1219 (re)pooling @ 100%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73472 and previous config saved to /var/cache/conftool/dbconfig/20250213-192047-root.json |
[production] |
| 19:18 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host relforge1006.eqiad.wmnet with OS bullseye |
[production] |
| 19:15 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host relforge1007.eqiad.wmnet with OS bullseye |
[production] |
| 19:09 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2146 (re)pooling @ 25%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73471 and previous config saved to /var/cache/conftool/dbconfig/20250213-190938-root.json |
[production] |
| 19:05 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1219 (re)pooling @ 75%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73470 and previous config saved to /var/cache/conftool/dbconfig/20250213-190542-root.json |
[production] |
| 19:01 |
<tchin@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/eventstreams: apply |
[production] |
| 19:00 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on relforge1006.eqiad.wmnet with reason: host reimage |
[production] |
| 19:00 |
<tchin@deploy2002> |
helmfile [eqiad] START helmfile.d/services/eventstreams: apply |
[production] |
| 18:58 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on relforge1007.eqiad.wmnet with reason: host reimage |
[production] |
| 18:55 |
<bking@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on relforge1006.eqiad.wmnet with reason: host reimage |
[production] |
| 18:54 |
<bking@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on relforge1007.eqiad.wmnet with reason: host reimage |
[production] |
| 18:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2146 (re)pooling @ 10%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73469 and previous config saved to /var/cache/conftool/dbconfig/20250213-185433-root.json |
[production] |
| 18:50 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1219 (re)pooling @ 50%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73467 and previous config saved to /var/cache/conftool/dbconfig/20250213-185036-root.json |
[production] |
| 18:40 |
<stevemunene@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dse-k8s-worker1009.eqiad.wmnet with OS bookworm |
[production] |
| 18:39 |
<bking@cumin2002> |
START - Cookbook sre.hosts.reimage for host relforge1006.eqiad.wmnet with OS bullseye |
[production] |
| 18:39 |
<bking@cumin2002> |
START - Cookbook sre.hosts.reimage for host relforge1007.eqiad.wmnet with OS bullseye |
[production] |
| 18:35 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1219 (re)pooling @ 25%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73466 and previous config saved to /var/cache/conftool/dbconfig/20250213-183531-root.json |
[production] |
| 18:28 |
<bking@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host relforge1007.eqiad.wmnet with OS bullseye |
[production] |
| 18:28 |
<bking@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host relforge1006.eqiad.wmnet with OS bullseye |
[production] |
| 18:22 |
<tchin@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/eventstreams: apply |
[production] |
| 18:22 |
<stevemunene@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dse-k8s-worker1009.eqiad.wmnet with reason: host reimage |
[production] |
| 18:21 |
<tchin@deploy2002> |
helmfile [codfw] START helmfile.d/services/eventstreams: apply |
[production] |
| 18:20 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1219 (re)pooling @ 10%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P73465 and previous config saved to /var/cache/conftool/dbconfig/20250213-182026-root.json |
[production] |
| 18:20 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host relforge1005.eqiad.wmnet with OS bullseye |
[production] |
| 18:18 |
<stevemunene@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on dse-k8s-worker1009.eqiad.wmnet with reason: host reimage |
[production] |
| 18:05 |
<stevemunene@cumin1002> |
START - Cookbook sre.hosts.reimage for host dse-k8s-worker1009.eqiad.wmnet with OS bookworm |
[production] |
| 18:05 |
<tchin@deploy2002> |
helmfile [staging] DONE helmfile.d/services/eventstreams: apply |
[production] |
| 18:04 |
<tchin@deploy2002> |
helmfile [staging] START helmfile.d/services/eventstreams: apply |
[production] |
| 18:04 |
<stevemunene> |
reimaging dse-k8s-worker1009 |
[analytics] |
| 18:03 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on relforge1005.eqiad.wmnet with reason: host reimage |
[production] |
| 18:01 |
<stevemunene> |
draining dse-k8s-worker1009 ready for reimage to bookworm and containerd for T377875 |
[analytics] |
| 17:58 |
<bking@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on relforge1005.eqiad.wmnet with reason: host reimage |
[production] |