6051-6100 of 10000 results (111ms)
2023-04-04 ยง
20:44 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on miscweb2002.codfw.wmnet with reason: decom [production]
20:44 <TheresNoTime> closing UTC late backport window [production]
20:38 <samtar@deploy2002> Finished scap: Backport for [[gerrit:903781|Clean up history page visual diffs beta feature config (T333448)]] (duration: 06m 42s) [production]
20:33 <samtar@deploy2002> matmarex and samtar: Backport for [[gerrit:903781|Clean up history page visual diffs beta feature config (T333448)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet [production]
20:31 <samtar@deploy2002> Started scap: Backport for [[gerrit:903781|Clean up history page visual diffs beta feature config (T333448)]] [production]
20:27 <samtar@deploy2002> Finished scap: Backport for [[gerrit:905685|EditCheck: catch errors from TransactionSquasher (T324733)]] (duration: 08m 23s) [production]
20:23 <inflatador> bking@cumin1001 unban elastic nodes post switch maintenance T331882 [production]
20:20 <samtar@deploy2002> matmarex and samtar: Backport for [[gerrit:905685|EditCheck: catch errors from TransactionSquasher (T324733)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet [production]
20:18 <samtar@deploy2002> Started scap: Backport for [[gerrit:905685|EditCheck: catch errors from TransactionSquasher (T324733)]] [production]
20:11 <samtar@deploy2002> Finished scap: Backport for [[gerrit:905727|Revert "Revert "Enable hidden tag for "Edit Check" project on Wikipedias"" (T324733)]] (duration: 07m 30s) [production]
20:10 <mutante> deploying ATS config change on cp2* for query.wikidata.org [production]
20:06 <ryankemper> T331896 Running puppet on wcqs fleet to pickup new miscweb gui_url: `ryankemper@cumin1001:~$ sudo -E cumin -b 2 'wcqs*' 'run-puppet-agent'` [production]
20:05 <samtar@deploy2002> matmarex and samtar: Backport for [[gerrit:905727|Revert "Revert "Enable hidden tag for "Edit Check" project on Wikipedias"" (T324733)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet [production]
20:03 <samtar@deploy2002> Started scap: Backport for [[gerrit:905727|Revert "Revert "Enable hidden tag for "Edit Check" project on Wikipedias"" (T324733)]] [production]
20:03 <mutante> running puppet on cp5*, cp4*... [production]
20:00 <ryankemper> T331896 Running puppet on wdqs fleet to pickup new miscweb gui_url: `ryankemper@cumin1001:~$ sudo -E cumin -b 6 'wdqs*' 'run-puppet-agent'` [production]
19:58 <hashar@deploy2002> Finished deploy [gerrit/gerrit@dbaaa7a]: wm-zuul-status: change pending jobs SUCCESS > INFO | T214068 (duration: 00m 07s) [production]
19:58 <hashar@deploy2002> Started deploy [gerrit/gerrit@dbaaa7a]: wm-zuul-status: change pending jobs SUCCESS > INFO | T214068 [production]
19:55 <mutante> https://query.wikidata.org and WCQS GUIs are switching to new backend VMs on bullseye in codfw T330090 T331896 [production]
19:46 <hashar@deploy2002> Finished scap: Backport for [[gerrit:905726|Replace usages of Hooks::register() (T334005)]] (duration: 06m 55s) [production]
19:40 <hashar@deploy2002> hashar: Backport for [[gerrit:905726|Replace usages of Hooks::register() (T334005)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet [production]
19:39 <hashar@deploy2002> Started scap: Backport for [[gerrit:905726|Replace usages of Hooks::register() (T334005)]] [production]
19:10 <hashar@deploy2002> rebuilt and synchronized wikiversions files: group0 wikis to 1.41.0-wmf.3 refs T330209 [production]
18:05 <dcausse@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/rdf-streaming-updater: apply [production]
18:05 <dcausse@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/rdf-streaming-updater: apply [production]
17:22 <ladsgroup@deploy2002> Finished scap: Backport for [[gerrit:905617|Revert "mergeMessageFileList.php: move code out of file scope." (T333966)]] (duration: 38m 18s) [production]
17:04 <ladsgroup@deploy2002> ladsgroup: Backport for [[gerrit:905617|Revert "mergeMessageFileList.php: move code out of file scope." (T333966)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet [production]
16:56 <dcausse@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/rdf-streaming-updater: apply [production]
16:55 <dcausse@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/rdf-streaming-updater: apply [production]
16:55 <dcausse@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/rdf-streaming-updater: apply [production]
16:55 <dcausse@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/rdf-streaming-updater: apply [production]
16:44 <ladsgroup@deploy2002> Started scap: Backport for [[gerrit:905617|Revert "mergeMessageFileList.php: move code out of file scope." (T333966)]] [production]
16:37 <dcausse@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/rdf-streaming-updater: apply [production]
16:37 <dcausse@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/rdf-streaming-updater: apply [production]
16:17 <ladsgroup@deploy2002> Finished scap: Backport for [[gerrit:905623|Revert "external store: Depool es4 (cluster26) from writes for maintenance" (T333961)]] (duration: 07m 31s) [production]
16:11 <ladsgroup@deploy2002> ladsgroup: Backport for [[gerrit:905623|Revert "external store: Depool es4 (cluster26) from writes for maintenance" (T333961)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet [production]
16:10 <ladsgroup@deploy2002> Started scap: Backport for [[gerrit:905623|Revert "external store: Depool es4 (cluster26) from writes for maintenance" (T333961)]] [production]
16:07 <jynus@cumin1001> dbctl commit (dc=all): 'Repool es1021 for reads', diff saved to https://phabricator.wikimedia.org/P46031 and previous config saved to /var/cache/conftool/dbconfig/20230404-160702-jynus.json [production]
16:01 <jynus@cumin1001> dbctl commit (dc=all): 'Repool es1021 for reads (only 10%)', diff saved to https://phabricator.wikimedia.org/P46030 and previous config saved to /var/cache/conftool/dbconfig/20230404-160146-jynus.json [production]
15:59 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on es1022.eqiad.wmnet with reason: T333961 [production]
15:59 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 3 days, 0:00:00 on es1022.eqiad.wmnet with reason: T333961 [production]
15:58 <jynus> restart es1021, several connections in a "stuck" state T333961 [production]
15:50 <dancy@deploy2002> Installation of scap version "4.48.0" completed for 592 hosts [production]
15:49 <dancy@deploy2002> Installing scap version "4.48.0" for 592 hosts [production]
15:31 <jynus> restart es1021, several connections in a "stuck" state T333961 [production]
15:25 <jynus@cumin1001> dbctl commit (dc=all): 'Depool es1021 reads', diff saved to https://phabricator.wikimedia.org/P46029 and previous config saved to /var/cache/conftool/dbconfig/20230404-152501-jynus.json [production]
15:23 <hnowlan@deploy2002> helmfile [eqiad] DONE helmfile.d/services/thumbor: apply [production]
15:19 <jiji@cumin1001> END (FAIL) - Cookbook sre.discovery.datacenter (exit_code=93) pool all active/active services in eqiad: eqiad row C switches upgrade - T331882 [production]
15:18 <ladsgroup@deploy2002> Finished scap: Backport for [[gerrit:905648|external store: Depool es4 (cluster26) from writes for maintenance (T333961)]] (duration: 11m 30s) [production]
15:16 <jynus@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db1150.eqiad.wmnet with reason: pending s3 reprovisioning [production]