|
2026-01-13
ยง
|
| 21:25 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.reimage for host wikikube-worker1352.eqiad.wmnet with OS bookworm |
[production] |
| 21:24 |
<dani@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1225664|Undeploy Safety survey (T413022)]] (duration: 06m 59s) |
[production] |
| 21:24 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db2247 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87484 and previous config saved to /var/cache/conftool/dbconfig/20260113-212400-marostegui.json |
[production] |
| 21:23 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db2247.codfw.wmnet with reason: Maintenance |
[production] |
| 21:23 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.reimage for host wikikube-worker1353.eqiad.wmnet with OS bookworm |
[production] |
| 21:23 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2246 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87483 and previous config saved to /var/cache/conftool/dbconfig/20260113-212333-marostegui.json |
[production] |
| 21:21 |
<jclark@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1355.eqiad.wmnet with OS bookworm |
[production] |
| 21:21 |
<jclark@cumin1003> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1003" |
[production] |
| 21:21 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker1353.eqiad.wmnet with OS bookworm |
[production] |
| 21:21 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker1352.eqiad.wmnet with OS bookworm |
[production] |
| 21:20 |
<dani@deploy2002> |
dani: Continuing with sync |
[production] |
| 21:20 |
<jclark@cumin1003> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1003" |
[production] |
| 21:20 |
<dani@deploy2002> |
dani: Backport for [[gerrit:1225664|Undeploy Safety survey (T413022)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 21:17 |
<dani@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1225664|Undeploy Safety survey (T413022)]] |
[production] |
| 21:16 |
<cjming@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1223261|Update description of the Math API (T411517)]], [[gerrit:1226257|Replaced mpic-next.wikimedia.org with test-kitchen-next.wikimedia.org (T407805)]] (duration: 09m 18s) |
[production] |
| 21:13 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2246', diff saved to https://phabricator.wikimedia.org/P87482 and previous config saved to /var/cache/conftool/dbconfig/20260113-211325-marostegui.json |
[production] |
| 21:12 |
<cjming@deploy2002> |
aaron, cjming, sfaci: Continuing with sync |
[production] |
| 21:11 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.reimage for host wikikube-worker1352.eqiad.wmnet with OS bookworm |
[production] |
| 21:11 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.reimage for host wikikube-worker1353.eqiad.wmnet with OS bookworm |
[production] |
| 21:09 |
<cjming@deploy2002> |
aaron, cjming, sfaci: Backport for [[gerrit:1223261|Update description of the Math API (T411517)]], [[gerrit:1226257|Replaced mpic-next.wikimedia.org with test-kitchen-next.wikimedia.org (T407805)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 21:06 |
<cjming@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1223261|Update description of the Math API (T411517)]], [[gerrit:1226257|Replaced mpic-next.wikimedia.org with test-kitchen-next.wikimedia.org (T407805)]] |
[production] |
| 21:04 |
<jclark@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1355.eqiad.wmnet with reason: host reimage |
[production] |
| 21:03 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2246', diff saved to https://phabricator.wikimedia.org/P87481 and previous config saved to /var/cache/conftool/dbconfig/20260113-210317-marostegui.json |
[production] |
| 21:02 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker1352.eqiad.wmnet with OS bookworm |
[production] |
| 21:01 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker1353.eqiad.wmnet with OS bookworm |
[production] |
| 20:58 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1355.eqiad.wmnet with reason: host reimage |
[production] |
| 20:53 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2246 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87480 and previous config saved to /var/cache/conftool/dbconfig/20260113-205308-marostegui.json |
[production] |
| 20:49 |
<jclark@cumin1003> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker1355 |
[production] |
| 20:48 |
<jclark@cumin1003> |
START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker1355 |
[production] |
| 20:48 |
<jclark@cumin1003> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker1353 |
[production] |
| 20:48 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.reimage for host wikikube-worker1352.eqiad.wmnet with OS bookworm |
[production] |
| 20:48 |
<jclark@cumin1003> |
START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker1353 |
[production] |
| 20:48 |
<jclark@cumin1003> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker1352 |
[production] |
| 20:48 |
<jclark@cumin1003> |
START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker1352 |
[production] |
| 20:47 |
<jclark@cumin1003> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
| 20:47 |
<jclark@cumin1003> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added network and mgmt WIKIKUBE - jclark@cumin1003" |
[production] |
| 20:47 |
<jclark@cumin1003> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added network and mgmt WIKIKUBE - jclark@cumin1003" |
[production] |
| 20:47 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.reimage for host wikikube-worker1355.eqiad.wmnet with OS bookworm |
[production] |
| 20:47 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.reimage for host wikikube-worker1353.eqiad.wmnet with OS bookworm |
[production] |
| 20:44 |
<jclark@cumin1003> |
START - Cookbook sre.dns.netbox |
[production] |
| 19:43 |
<jhuneidi@deploy2002> |
Finished scap sync-world: test sync for T411516 (duration: 02m 38s) |
[production] |
| 19:40 |
<jhuneidi@deploy2002> |
Started scap sync-world: test sync for T411516 |
[production] |
| 19:22 |
<jhuneidi@deploy2002> |
rebuilt and synchronized wikiversions files: group0 to 1.46.0-wmf.11 refs T413802 |
[production] |
| 16:41 |
<elukey> |
roll restart docker-registry-swift daemons on registry* to pick up the new settings (apparently the service refresh issued by puppet didn't work as intended) - T390251 |
[production] |
| 16:38 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker1355.eqiad.wmnet with OS bookworm |
[production] |
| 16:28 |
<cgoubert@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply |
[production] |
| 16:27 |
<cgoubert@deploy2002> |
helmfile [codfw] START helmfile.d/services/rest-gateway: apply |
[production] |
| 16:27 |
<cgoubert@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply |
[production] |
| 16:27 |
<cgoubert@deploy2002> |
helmfile [eqiad] START helmfile.d/services/rest-gateway: apply |
[production] |
| 16:26 |
<cgoubert@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply |
[production] |