2024-07-02
ยง
|
08:17 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on P{cp2030.*} and A:cp |
[production] |
08:16 |
<jayme@deploy1002> |
helmfile [codfw] DONE helmfile.d/admin 'apply'. |
[production] |
08:15 |
<fabfur@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on P{cp2030.*} and A:cp |
[production] |
08:15 |
<jayme@deploy1002> |
helmfile [codfw] START helmfile.d/admin 'apply'. |
[production] |
08:14 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on P{cp2028.*} and A:cp |
[production] |
08:14 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1188', diff saved to https://phabricator.wikimedia.org/P65639 and previous config saved to /var/cache/conftool/dbconfig/20240702-081411-marostegui.json |
[production] |
08:13 |
<fabfur@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on P{cp2028.*} and A:cp |
[production] |
08:12 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on P{cp2027.*} and A:cp |
[production] |
08:11 |
<fabfur@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on P{cp2027.*} and A:cp |
[production] |
08:10 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2173 (T364069)', diff saved to https://phabricator.wikimedia.org/P65638 and previous config saved to /var/cache/conftool/dbconfig/20240702-081025-marostegui.json |
[production] |
08:10 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db2186.codfw.wmnet with reason: Maintenance |
[production] |
08:10 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on db2186.codfw.wmnet with reason: Maintenance |
[production] |
08:10 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2173.codfw.wmnet with reason: Maintenance |
[production] |
08:09 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2173.codfw.wmnet with reason: Maintenance |
[production] |
08:09 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2170 (T364069)', diff saved to https://phabricator.wikimedia.org/P65637 and previous config saved to /var/cache/conftool/dbconfig/20240702-080948-marostegui.json |
[production] |
08:07 |
<jayme> |
draining kubernetes1051.eqiad.wmnet |
[production] |
08:07 |
<fabfur@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-text_magru |
[production] |
08:06 |
<fabfur@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_magru |
[production] |
08:01 |
<jayme> |
cordon kubernetes1051.eqiad.wmnet because of several failed image pulls |
[production] |
07:59 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1188 (T367856)', diff saved to https://phabricator.wikimedia.org/P65635 and previous config saved to /var/cache/conftool/dbconfig/20240702-075904-marostegui.json |
[production] |
07:58 |
<kharlan@deploy1002> |
Finished scap: Backport for [[gerrit:1051246|Revert "QuickSurveys: Add testing survey configuration" (T368459)]] (duration: 41m 45s) |
[production] |
07:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2170', diff saved to https://phabricator.wikimedia.org/P65634 and previous config saved to /var/cache/conftool/dbconfig/20240702-075440-marostegui.json |
[production] |
07:52 |
<kharlan@deploy1002> |
kharlan: Continuing with sync |
[production] |
07:51 |
<kharlan@deploy1002> |
kharlan: Backport for [[gerrit:1051246|Revert "QuickSurveys: Add testing survey configuration" (T368459)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
07:39 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2170', diff saved to https://phabricator.wikimedia.org/P65633 and previous config saved to /var/cache/conftool/dbconfig/20240702-073933-marostegui.json |
[production] |
07:24 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2170 (T364069)', diff saved to https://phabricator.wikimedia.org/P65632 and previous config saved to /var/cache/conftool/dbconfig/20240702-072426-marostegui.json |
[production] |
07:16 |
<kharlan@deploy1002> |
Started scap sync-world: Backport for [[gerrit:1051246|Revert "QuickSurveys: Add testing survey configuration" (T368459)]] |
[production] |
07:06 |
<kharlan@deploy1002> |
Started scap sync-world: Backport for [[gerrit:1051246|Revert "QuickSurveys: Add testing survey configuration" (T368459)]] |
[production] |
07:01 |
<oblivian@deploy1002> |
Finished scap: Rebuilding images for change to the base image for httpd (duration: 26m 52s) |
[production] |
06:59 |
<XioNoX> |
update netboot bookworm image to pickup new point release |
[production] |
06:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1192 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P65631 and previous config saved to /var/cache/conftool/dbconfig/20240702-065831-root.json |
[production] |
06:35 |
<oblivian@deploy1002> |
Started scap sync-world: Rebuilding images for change to the base image for httpd |
[production] |
06:28 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1192 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P65629 and previous config saved to /var/cache/conftool/dbconfig/20240702-062820-root.json |
[production] |
06:21 |
<_joe_> |
rebuilding httpd-fcgi, mediawiki-httpd images T363342 T368640 |
[production] |
06:13 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1192 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P65628 and previous config saved to /var/cache/conftool/dbconfig/20240702-061315-root.json |
[production] |
05:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1192 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P65627 and previous config saved to /var/cache/conftool/dbconfig/20240702-055809-root.json |
[production] |
05:43 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1192 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P65626 and previous config saved to /var/cache/conftool/dbconfig/20240702-054304-root.json |
[production] |
05:27 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1192 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P65625 and previous config saved to /var/cache/conftool/dbconfig/20240702-052759-root.json |
[production] |
05:25 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1192 T368371', diff saved to https://phabricator.wikimedia.org/P65624 and previous config saved to /var/cache/conftool/dbconfig/20240702-052543-root.json |
[production] |
05:24 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Promote db1209 to s8 primary and set section read-write T368371', diff saved to https://phabricator.wikimedia.org/P65623 and previous config saved to /var/cache/conftool/dbconfig/20240702-052447-marostegui.json |
[production] |
05:24 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Set s8 eqiad as read-only for maintenance - T368371', diff saved to https://phabricator.wikimedia.org/P65622 and previous config saved to /var/cache/conftool/dbconfig/20240702-052408-marostegui.json |
[production] |
05:23 |
<marostegui> |
Starting s8 eqiad failover from db1192 to db1209 - T368371 |
[production] |
04:59 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Set db1209 remove from API T368371', diff saved to https://phabricator.wikimedia.org/P65621 and previous config saved to /var/cache/conftool/dbconfig/20240702-045929-marostegui.json |
[production] |
04:59 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 33 hosts with reason: Primary switchover s8 T368371 |
[production] |
04:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Set db1209 with weight 0 T368371', diff saved to https://phabricator.wikimedia.org/P65620 and previous config saved to /var/cache/conftool/dbconfig/20240702-045856-marostegui.json |
[production] |
04:58 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on 33 hosts with reason: Primary switchover s8 T368371 |
[production] |
04:33 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2170 (T364069)', diff saved to https://phabricator.wikimedia.org/P65619 and previous config saved to /var/cache/conftool/dbconfig/20240702-043349-marostegui.json |
[production] |
04:33 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2170.codfw.wmnet with reason: Maintenance |
[production] |
04:33 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2170.codfw.wmnet with reason: Maintenance |
[production] |
04:33 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2153 (T364069)', diff saved to https://phabricator.wikimedia.org/P65618 and previous config saved to /var/cache/conftool/dbconfig/20240702-043326-marostegui.json |
[production] |