2025-06-04
ยง
|
13:04 |
<jforrester@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1146628|release CampaignEvents to cbk-zam wiki (T393604)]], [[gerrit:1153385|Bump portals to the 2025-06-02 09:23:11+00:00 build (T128546)]], [[gerrit:1151781|build: Rename the rarely-used 'typos' script to 'checkTypos']], [[gerrit:1151751|Drop Chart roll-out dblists, no longer needed (T383079)]] |
[production] |
13:03 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1203', diff saved to https://phabricator.wikimedia.org/P77054 and previous config saved to /var/cache/conftool/dbconfig/20250604-130319-fceratto.json |
[production] |
13:03 |
<jmm@cumin1003> |
START - Cookbook sre.ganeti.addnode for new host ganeti7001.magru.wmnet to cluster magru03 and group B |
[production] |
13:02 |
<sbassett@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/miscweb: apply |
[production] |
13:02 |
<sbassett@deploy1003> |
helmfile [eqiad] START helmfile.d/services/miscweb: apply |
[production] |
13:02 |
<sbassett@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/miscweb: apply |
[production] |
13:01 |
<sbassett@deploy1003> |
helmfile [codfw] START helmfile.d/services/miscweb: apply |
[production] |
13:01 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti7001.magru.wmnet |
[production] |
12:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2217 (re)pooling @ 40%: Repooling', diff saved to https://phabricator.wikimedia.org/P77053 and previous config saved to /var/cache/conftool/dbconfig/20250604-125817-root.json |
[production] |
12:43 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2217 (re)pooling @ 20%: Repooling', diff saved to https://phabricator.wikimedia.org/P77051 and previous config saved to /var/cache/conftool/dbconfig/20250604-124311-root.json |
[production] |
12:43 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host cloudgw1004.eqiad.wmnet |
[production] |
12:42 |
<andrew@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudgw1003.eqiad.wmnet |
[production] |
12:39 |
<jiji@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/mw-mcrouter: apply |
[production] |
12:39 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti7001.magru.wmnet with OS bookworm |
[production] |
12:39 |
<jiji@deploy1003> |
helmfile [codfw] START helmfile.d/services/mw-mcrouter: apply |
[production] |
12:37 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.provision for host ms-be1095.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
12:36 |
<moritzm> |
installing modsecurity-apache security updates |
[production] |
12:36 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.provision for host ms-be1094.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
12:35 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host cloudgw1003.eqiad.wmnet |
[production] |
12:35 |
<jclark@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
12:35 |
<jclark@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added mgmt for ms-be1094/95 - jclark@cumin1002" |
[production] |
12:35 |
<jclark@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added mgmt for ms-be1094/95 - jclark@cumin1002" |
[production] |
12:34 |
<andrew@cumin1002> |
START - Cookbook sre.hosts.reimage for host cloudcontrol2010-dev.codfw.wmnet with OS bullseye |
[production] |
12:33 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1203 (T395241)', diff saved to https://phabricator.wikimedia.org/P77050 and previous config saved to /var/cache/conftool/dbconfig/20250604-123304-fceratto.json |
[production] |
12:32 |
<jclark@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
12:29 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2193 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P77049 and previous config saved to /var/cache/conftool/dbconfig/20250604-122948-root.json |
[production] |
12:28 |
<reedy@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1153591|GenerateFancyCaptchas: Handle captcha.py not generating any captchas, but not erroring (T388531)]], [[gerrit:1153592|captcha.py: Expand variables and user in filenames (T395810)]], [[gerrit:1153593|captcha.py: Check if output dir exists, and attempt to create it (else error) (T395804)]], [[gerrit:1153595|captcha.py: Bail out if no words were rea |
[production] |
12:28 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2217 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P77048 and previous config saved to /var/cache/conftool/dbconfig/20250604-122806-root.json |
[production] |
12:27 |
<jclark@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
12:27 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cloudcontrol2010-dev.codfw.wmnet on all recursors |
[production] |
12:27 |
<cmooney@cumin1002> |
START - Cookbook sre.dns.wipe-cache cloudcontrol2010-dev.codfw.wmnet on all recursors |
[production] |
12:26 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cloudcephosd2010-dev.codfw.wmnet on all recursors |
[production] |
12:26 |
<cmooney@cumin1002> |
START - Cookbook sre.dns.wipe-cache cloudcephosd2010-dev.codfw.wmnet on all recursors |
[production] |
12:25 |
<jclark@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
12:25 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
12:25 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: fix entries for cloudcontrol2010-dev which had been added on wrong vlan - cmooney@cumin1002" |
[production] |
12:25 |
<cmooney@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: fix entries for cloudcontrol2010-dev which had been added on wrong vlan - cmooney@cumin1002" |
[production] |
12:24 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1203 (T395241)', diff saved to https://phabricator.wikimedia.org/P77047 and previous config saved to /var/cache/conftool/dbconfig/20250604-122436-fceratto.json |
[production] |
12:24 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1203.eqiad.wmnet with reason: Maintenance |
[production] |
12:24 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1192 (T395241)', diff saved to https://phabricator.wikimedia.org/P77046 and previous config saved to /var/cache/conftool/dbconfig/20250604-122411-fceratto.json |
[production] |
12:23 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db2217 T395989', diff saved to https://phabricator.wikimedia.org/P77045 and previous config saved to /var/cache/conftool/dbconfig/20250604-122303-marostegui.json |
[production] |
12:22 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2217.codfw.wmnet with reason: Maintenance |
[production] |
12:21 |
<cmooney@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
12:21 |
<reedy@deploy1003> |
reedy: Continuing with sync |
[production] |
12:21 |
<cmooney@cumin1002> |
END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) |
[production] |
12:20 |
<reedy@deploy1003> |
reedy: Backport for [[gerrit:1153591|GenerateFancyCaptchas: Handle captcha.py not generating any captchas, but not erroring (T388531)]], [[gerrit:1153592|captcha.py: Expand variables and user in filenames (T395810)]], [[gerrit:1153593|captcha.py: Check if output dir exists, and attempt to create it (else error) (T395804)]], [[gerrit:1153595|captcha.py: Bail out if no words were read from wordlist (T3 |
[production] |
12:18 |
<cmooney@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
12:18 |
<reedy@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1153591|GenerateFancyCaptchas: Handle captcha.py not generating any captchas, but not erroring (T388531)]], [[gerrit:1153592|captcha.py: Expand variables and user in filenames (T395810)]], [[gerrit:1153593|captcha.py: Check if output dir exists, and attempt to create it (else error) (T395804)]], [[gerrit:1153595|captcha.py: Bail out if no words were read |
[production] |
12:16 |
<jmm@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ganeti7001.magru.wmnet with reason: host reimage |
[production] |
12:14 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2193 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P77044 and previous config saved to /var/cache/conftool/dbconfig/20250604-121442-root.json |
[production] |