4651-4700 of 10000 results (89ms)
2023-01-04 §
07:38 <oblivian@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mw-web: apply [production]
07:38 <oblivian@deploy1002> helmfile [eqiad] START helmfile.d/services/mw-web: apply [production]
07:38 <oblivian@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-web: apply [production]
07:38 <oblivian@deploy1002> helmfile [codfw] START helmfile.d/services/mw-web: apply [production]
07:38 <marostegui> Switch x1 back to RBR T255174 [production]
07:35 <marostegui> dbmaint codfw deploy schema change on x1 T255174 [production]
07:35 <marostegui> dbmaint eqiad deploy schema change on x1 T255174 [production]
07:29 <marostegui@cumin1001> dbctl commit (dc=all): 'db2131 (re)pooling @ 10%: After testing', diff saved to https://phabricator.wikimedia.org/P42751 and previous config saved to /var/cache/conftool/dbconfig/20230104-072922-root.json [production]
07:20 <oblivian@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply [production]
07:20 <oblivian@deploy1002> helmfile [eqiad] START helmfile.d/services/mw-debug: apply [production]
07:19 <oblivian@deploy1002> helmfile [codfw] DONE helmfile.d/services/mw-debug: apply [production]
07:19 <oblivian@deploy1002> helmfile [codfw] START helmfile.d/services/mw-debug: apply [production]
07:14 <marostegui@cumin1001> dbctl commit (dc=all): 'db2131 (re)pooling @ 5%: After testing', diff saved to https://phabricator.wikimedia.org/P42750 and previous config saved to /var/cache/conftool/dbconfig/20230104-071417-root.json [production]
06:59 <marostegui@cumin1001> dbctl commit (dc=all): 'db2131 (re)pooling @ 1%: After testing', diff saved to https://phabricator.wikimedia.org/P42749 and previous config saved to /var/cache/conftool/dbconfig/20230104-065912-root.json [production]
2023-01-03 §
22:47 <eileen> config 34754c69 -> 03c4d7a6 [production]
22:33 <eileen> config revision changed from 5c73975a to 34754c69 [production]
21:55 <mutante> gitlab-runner* - correction: allowing connections TO kubestagemaster.svc.eqiad.wmnet port 6443 FROM trusted runners, of course - T325385 [production]
21:53 <mutante> gitlab-runner* - allowing kubestagemaster.svc.eqiad.wmnet to connect to port 6443, run puppet via cumin, deploy gerrit:868737 - T325385 [production]
21:47 <taavi> UTC late backports done [production]
21:46 <taavi@deploy1002> Finished scap: Backport for [[gerrit:869226|Specify Citoid RESTBase URL separately (T325425)]], [[gerrit:874855|Use new DiscussionTools heading markup on group1 wikis (T314714)]] (duration: 12m 12s) [production]
21:35 <taavi@deploy1002> taavi and matmarex: Backport for [[gerrit:869226|Specify Citoid RESTBase URL separately (T325425)]], [[gerrit:874855|Use new DiscussionTools heading markup on group1 wikis (T314714)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet [production]
21:34 <taavi@deploy1002> Started scap: Backport for [[gerrit:869226|Specify Citoid RESTBase URL separately (T325425)]], [[gerrit:874855|Use new DiscussionTools heading markup on group1 wikis (T314714)]] [production]
21:30 <taavi@deploy1002> Finished scap: Backport for [[gerrit:874443|Start writing to cuc_comment_id on test wikis (T233004)]] (duration: 12m 54s) [production]
21:19 <taavi@deploy1002> taavi and zabe: Backport for [[gerrit:874443|Start writing to cuc_comment_id on test wikis (T233004)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet [production]
21:17 <taavi@deploy1002> Started scap: Backport for [[gerrit:874443|Start writing to cuc_comment_id on test wikis (T233004)]] [production]
21:15 <taavi@deploy1002> Finished scap: Backport for [[gerrit:873880|Stop setting $wgActorTableSchemaMigrationStage (T215466)]], [[gerrit:873887|Pin $wgCommentTempTableSchemaMigrationStage to default value (T299954)]], [[gerrit:874418|Pin cu_changes comment migration to old schema (T233004)]] (duration: 08m 49s) [production]
21:08 <taavi@deploy1002> taavi and zabe: Backport for [[gerrit:873880|Stop setting $wgActorTableSchemaMigrationStage (T215466)]], [[gerrit:873887|Pin $wgCommentTempTableSchemaMigrationStage to default value (T299954)]], [[gerrit:874418|Pin cu_changes comment migration to old schema (T233004)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet [production]
21:06 <taavi@deploy1002> Started scap: Backport for [[gerrit:873880|Stop setting $wgActorTableSchemaMigrationStage (T215466)]], [[gerrit:873887|Pin $wgCommentTempTableSchemaMigrationStage to default value (T299954)]], [[gerrit:874418|Pin cu_changes comment migration to old schema (T233004)]] [production]
19:27 <dduvall@deploy1002> rebuilt and synchronized wikiversions files: group0 wikis to 1.40.0-wmf.17 refs T325580 [production]
19:18 <dduvall@deploy1002> deploy-promote aborted: (duration: 08m 55s) [production]
19:13 <btullis@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cephosd1001.eqiad.wmnet with OS bullseye [production]
17:37 <claime> Finished parse reboots in eqiad [production]
17:36 <cgoubert@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-cluster (exit_code=0) [production]
17:30 <sukhe> sudo cumin -b 1 -s 5 'A:codfw and P{O:swift::proxy}' 'depool && sleep 3 && systemctl restart swift-proxy && sleep 3 && pool' [production]
16:40 <ejegg> fundraising EOY receipt calculation finished, restarted scheduled jobs [production]
16:21 <ejegg> fundraising scheduled jobs disabled for EOY receipt calculation [production]
15:37 <btullis@cumin1001> START - Cookbook sre.hosts.reimage for host cephosd1001.eqiad.wmnet with OS bullseye [production]
15:30 <btullis@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cephosd1001.eqiad.wmnet with OS bullseye [production]
15:14 <cgoubert@cumin1001> START - Cookbook sre.hosts.reboot-cluster [production]
15:13 <cgoubert@cumin1001> END (ERROR) - Cookbook sre.hosts.reboot-cluster (exit_code=97) [production]
15:13 <cgoubert@cumin1001> START - Cookbook sre.hosts.reboot-cluster [production]
15:13 <cgoubert@cumin1001> END (ERROR) - Cookbook sre.hosts.reboot-cluster (exit_code=97) [production]
15:13 <cgoubert@cumin1001> START - Cookbook sre.hosts.reboot-cluster [production]
15:11 <cgoubert@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-cluster (exit_code=1) [production]
15:10 <andrewbogott> upgrading and rebooting wikitech-static [production]
15:07 <cgoubert@cumin1001> START - Cookbook sre.hosts.reboot-cluster [production]
15:06 <claime> Starting rolling reboot of parse* hosts in eqiad [production]
15:05 <taavi> UTC afternoon backports done [production]
15:04 <taavi@deploy1002> Finished scap: Backport for [[gerrit:874871|SecurePoll: Add files for UCoC 2023 vote (T324793)]], [[gerrit:874872|ucoc2023: Update populateEditCount to count Flow edits (T324793)]], [[gerrit:874873|ucoc2023: Update populateEditCount to count Flow edits (T324793)]] (duration: 08m 10s) [production]
15:00 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts graphite1004.eqiad.wmnet [production]