2024-07-02
ยง
|
09:20 |
<brouberol@cumin1002> |
START - Cookbook sre.k8s.reboot-nodes rolling reboot on A:dse-k8s-worker |
[production] |
09:15 |
<jynus@cumin1002> |
dbctl commit (dc=all): 'Repool es1025 at 50% weight T363812', diff saved to https://phabricator.wikimedia.org/P65644 and previous config saved to /var/cache/conftool/dbconfig/20240702-091508-jynus.json |
[production] |
08:57 |
<jynus@cumin1002> |
dbctl commit (dc=all): 'Repool es1025 at 10% weight T363812', diff saved to https://phabricator.wikimedia.org/P65643 and previous config saved to /var/cache/conftool/dbconfig/20240702-085733-jynus.json |
[production] |
08:44 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1197 (T367856)', diff saved to https://phabricator.wikimedia.org/P65642 and previous config saved to /var/cache/conftool/dbconfig/20240702-084447-marostegui.json |
[production] |
08:44 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1197.eqiad.wmnet with reason: Maintenance |
[production] |
08:44 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1197.eqiad.wmnet with reason: Maintenance |
[production] |
08:44 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1188 (T367856)', diff saved to https://phabricator.wikimedia.org/P65641 and previous config saved to /var/cache/conftool/dbconfig/20240702-084425-marostegui.json |
[production] |
08:40 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on P{cp6009.*} and A:cp |
[production] |
08:38 |
<fabfur@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on P{cp6009.*} and A:cp |
[production] |
08:36 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-text_magru |
[production] |
08:34 |
<hashar@deploy1002> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.43.0-wmf.12 refs T366957 |
[production] |
08:34 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-upload_magru |
[production] |
08:30 |
<jayme@cumin1002> |
conftool action : set/pooled=inactive; selector: name=kubernetes1051.eqiad.wmnet |
[production] |
08:29 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1188', diff saved to https://phabricator.wikimedia.org/P65640 and previous config saved to /var/cache/conftool/dbconfig/20240702-082918-marostegui.json |
[production] |
08:22 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on P{cp2031.*} and A:cp |
[production] |
08:20 |
<fabfur@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on P{cp2031.*} and A:cp |
[production] |
08:17 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on P{cp2030.*} and A:cp |
[production] |
08:16 |
<jayme@deploy1002> |
helmfile [codfw] DONE helmfile.d/admin 'apply'. |
[production] |
08:15 |
<fabfur@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on P{cp2030.*} and A:cp |
[production] |
08:15 |
<jayme@deploy1002> |
helmfile [codfw] START helmfile.d/admin 'apply'. |
[production] |
08:14 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on P{cp2028.*} and A:cp |
[production] |
08:14 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1188', diff saved to https://phabricator.wikimedia.org/P65639 and previous config saved to /var/cache/conftool/dbconfig/20240702-081411-marostegui.json |
[production] |
08:13 |
<fabfur@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on P{cp2028.*} and A:cp |
[production] |
08:12 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on P{cp2027.*} and A:cp |
[production] |
08:11 |
<fabfur@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on P{cp2027.*} and A:cp |
[production] |
08:10 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2173 (T364069)', diff saved to https://phabricator.wikimedia.org/P65638 and previous config saved to /var/cache/conftool/dbconfig/20240702-081025-marostegui.json |
[production] |
08:10 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db2186.codfw.wmnet with reason: Maintenance |
[production] |
08:10 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on db2186.codfw.wmnet with reason: Maintenance |
[production] |
08:10 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2173.codfw.wmnet with reason: Maintenance |
[production] |
08:09 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2173.codfw.wmnet with reason: Maintenance |
[production] |
08:09 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2170 (T364069)', diff saved to https://phabricator.wikimedia.org/P65637 and previous config saved to /var/cache/conftool/dbconfig/20240702-080948-marostegui.json |
[production] |
08:07 |
<jayme> |
draining kubernetes1051.eqiad.wmnet |
[production] |
08:07 |
<fabfur@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-text_magru |
[production] |
08:06 |
<fabfur@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_magru |
[production] |
08:01 |
<jayme> |
cordon kubernetes1051.eqiad.wmnet because of several failed image pulls |
[production] |
07:59 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1188 (T367856)', diff saved to https://phabricator.wikimedia.org/P65635 and previous config saved to /var/cache/conftool/dbconfig/20240702-075904-marostegui.json |
[production] |
07:58 |
<kharlan@deploy1002> |
Finished scap: Backport for [[gerrit:1051246|Revert "QuickSurveys: Add testing survey configuration" (T368459)]] (duration: 41m 45s) |
[production] |
07:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2170', diff saved to https://phabricator.wikimedia.org/P65634 and previous config saved to /var/cache/conftool/dbconfig/20240702-075440-marostegui.json |
[production] |
07:52 |
<kharlan@deploy1002> |
kharlan: Continuing with sync |
[production] |
07:51 |
<kharlan@deploy1002> |
kharlan: Backport for [[gerrit:1051246|Revert "QuickSurveys: Add testing survey configuration" (T368459)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
07:39 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2170', diff saved to https://phabricator.wikimedia.org/P65633 and previous config saved to /var/cache/conftool/dbconfig/20240702-073933-marostegui.json |
[production] |
07:24 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2170 (T364069)', diff saved to https://phabricator.wikimedia.org/P65632 and previous config saved to /var/cache/conftool/dbconfig/20240702-072426-marostegui.json |
[production] |
07:16 |
<kharlan@deploy1002> |
Started scap sync-world: Backport for [[gerrit:1051246|Revert "QuickSurveys: Add testing survey configuration" (T368459)]] |
[production] |
07:06 |
<kharlan@deploy1002> |
Started scap sync-world: Backport for [[gerrit:1051246|Revert "QuickSurveys: Add testing survey configuration" (T368459)]] |
[production] |
07:01 |
<oblivian@deploy1002> |
Finished scap: Rebuilding images for change to the base image for httpd (duration: 26m 52s) |
[production] |
06:59 |
<XioNoX> |
update netboot bookworm image to pickup new point release |
[production] |
06:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1192 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P65631 and previous config saved to /var/cache/conftool/dbconfig/20240702-065831-root.json |
[production] |
06:35 |
<oblivian@deploy1002> |
Started scap sync-world: Rebuilding images for change to the base image for httpd |
[production] |
06:28 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1192 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P65629 and previous config saved to /var/cache/conftool/dbconfig/20240702-062820-root.json |
[production] |
06:21 |
<_joe_> |
rebuilding httpd-fcgi, mediawiki-httpd images T363342 T368640 |
[production] |