2023-01-04
§
|
07:38 |
<oblivian@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mw-web: apply |
[production] |
07:38 |
<oblivian@deploy1002> |
helmfile [codfw] START helmfile.d/services/mw-web: apply |
[production] |
07:38 |
<marostegui> |
Switch x1 back to RBR T255174 |
[production] |
07:35 |
<marostegui> |
dbmaint codfw deploy schema change on x1 T255174 |
[production] |
07:35 |
<marostegui> |
dbmaint eqiad deploy schema change on x1 T255174 |
[production] |
07:29 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2131 (re)pooling @ 10%: After testing', diff saved to https://phabricator.wikimedia.org/P42751 and previous config saved to /var/cache/conftool/dbconfig/20230104-072922-root.json |
[production] |
07:20 |
<oblivian@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply |
[production] |
07:20 |
<oblivian@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mw-debug: apply |
[production] |
07:19 |
<oblivian@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mw-debug: apply |
[production] |
07:19 |
<oblivian@deploy1002> |
helmfile [codfw] START helmfile.d/services/mw-debug: apply |
[production] |
07:14 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2131 (re)pooling @ 5%: After testing', diff saved to https://phabricator.wikimedia.org/P42750 and previous config saved to /var/cache/conftool/dbconfig/20230104-071417-root.json |
[production] |
06:59 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2131 (re)pooling @ 1%: After testing', diff saved to https://phabricator.wikimedia.org/P42749 and previous config saved to /var/cache/conftool/dbconfig/20230104-065912-root.json |
[production] |
2023-01-03
§
|
22:47 |
<eileen> |
config 34754c69 -> 03c4d7a6 |
[production] |
22:33 |
<eileen> |
config revision changed from 5c73975a to 34754c69 |
[production] |
21:55 |
<mutante> |
gitlab-runner* - correction: allowing connections TO kubestagemaster.svc.eqiad.wmnet port 6443 FROM trusted runners, of course - T325385 |
[production] |
21:53 |
<mutante> |
gitlab-runner* - allowing kubestagemaster.svc.eqiad.wmnet to connect to port 6443, run puppet via cumin, deploy gerrit:868737 - T325385 |
[production] |
21:47 |
<taavi> |
UTC late backports done |
[production] |
21:46 |
<taavi@deploy1002> |
Finished scap: Backport for [[gerrit:869226|Specify Citoid RESTBase URL separately (T325425)]], [[gerrit:874855|Use new DiscussionTools heading markup on group1 wikis (T314714)]] (duration: 12m 12s) |
[production] |
21:35 |
<taavi@deploy1002> |
taavi and matmarex: Backport for [[gerrit:869226|Specify Citoid RESTBase URL separately (T325425)]], [[gerrit:874855|Use new DiscussionTools heading markup on group1 wikis (T314714)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet |
[production] |
21:34 |
<taavi@deploy1002> |
Started scap: Backport for [[gerrit:869226|Specify Citoid RESTBase URL separately (T325425)]], [[gerrit:874855|Use new DiscussionTools heading markup on group1 wikis (T314714)]] |
[production] |
21:30 |
<taavi@deploy1002> |
Finished scap: Backport for [[gerrit:874443|Start writing to cuc_comment_id on test wikis (T233004)]] (duration: 12m 54s) |
[production] |
21:19 |
<taavi@deploy1002> |
taavi and zabe: Backport for [[gerrit:874443|Start writing to cuc_comment_id on test wikis (T233004)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet |
[production] |
21:17 |
<taavi@deploy1002> |
Started scap: Backport for [[gerrit:874443|Start writing to cuc_comment_id on test wikis (T233004)]] |
[production] |
21:15 |
<taavi@deploy1002> |
Finished scap: Backport for [[gerrit:873880|Stop setting $wgActorTableSchemaMigrationStage (T215466)]], [[gerrit:873887|Pin $wgCommentTempTableSchemaMigrationStage to default value (T299954)]], [[gerrit:874418|Pin cu_changes comment migration to old schema (T233004)]] (duration: 08m 49s) |
[production] |
21:08 |
<taavi@deploy1002> |
taavi and zabe: Backport for [[gerrit:873880|Stop setting $wgActorTableSchemaMigrationStage (T215466)]], [[gerrit:873887|Pin $wgCommentTempTableSchemaMigrationStage to default value (T299954)]], [[gerrit:874418|Pin cu_changes comment migration to old schema (T233004)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet |
[production] |
21:06 |
<taavi@deploy1002> |
Started scap: Backport for [[gerrit:873880|Stop setting $wgActorTableSchemaMigrationStage (T215466)]], [[gerrit:873887|Pin $wgCommentTempTableSchemaMigrationStage to default value (T299954)]], [[gerrit:874418|Pin cu_changes comment migration to old schema (T233004)]] |
[production] |
19:27 |
<dduvall@deploy1002> |
rebuilt and synchronized wikiversions files: group0 wikis to 1.40.0-wmf.17 refs T325580 |
[production] |
19:18 |
<dduvall@deploy1002> |
deploy-promote aborted: (duration: 08m 55s) |
[production] |
19:13 |
<btullis@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cephosd1001.eqiad.wmnet with OS bullseye |
[production] |
17:37 |
<claime> |
Finished parse reboots in eqiad |
[production] |
17:36 |
<cgoubert@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-cluster (exit_code=0) |
[production] |
17:30 |
<sukhe> |
sudo cumin -b 1 -s 5 'A:codfw and P{O:swift::proxy}' 'depool && sleep 3 && systemctl restart swift-proxy && sleep 3 && pool' |
[production] |
16:40 |
<ejegg> |
fundraising EOY receipt calculation finished, restarted scheduled jobs |
[production] |
16:21 |
<ejegg> |
fundraising scheduled jobs disabled for EOY receipt calculation |
[production] |
15:37 |
<btullis@cumin1001> |
START - Cookbook sre.hosts.reimage for host cephosd1001.eqiad.wmnet with OS bullseye |
[production] |
15:30 |
<btullis@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cephosd1001.eqiad.wmnet with OS bullseye |
[production] |
15:14 |
<cgoubert@cumin1001> |
START - Cookbook sre.hosts.reboot-cluster |
[production] |
15:13 |
<cgoubert@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reboot-cluster (exit_code=97) |
[production] |
15:13 |
<cgoubert@cumin1001> |
START - Cookbook sre.hosts.reboot-cluster |
[production] |
15:13 |
<cgoubert@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reboot-cluster (exit_code=97) |
[production] |
15:13 |
<cgoubert@cumin1001> |
START - Cookbook sre.hosts.reboot-cluster |
[production] |
15:11 |
<cgoubert@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reboot-cluster (exit_code=1) |
[production] |
15:10 |
<andrewbogott> |
upgrading and rebooting wikitech-static |
[production] |
15:07 |
<cgoubert@cumin1001> |
START - Cookbook sre.hosts.reboot-cluster |
[production] |
15:06 |
<claime> |
Starting rolling reboot of parse* hosts in eqiad |
[production] |
15:05 |
<taavi> |
UTC afternoon backports done |
[production] |
15:04 |
<taavi@deploy1002> |
Finished scap: Backport for [[gerrit:874871|SecurePoll: Add files for UCoC 2023 vote (T324793)]], [[gerrit:874872|ucoc2023: Update populateEditCount to count Flow edits (T324793)]], [[gerrit:874873|ucoc2023: Update populateEditCount to count Flow edits (T324793)]] (duration: 08m 10s) |
[production] |
15:00 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts graphite1004.eqiad.wmnet |
[production] |
14:59 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
14:59 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: graphite1004.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - filippo@cumin1001" |
[production] |