1601-1650 of 10000 results (91ms)
2023-04-04 ยง
15:49 <dancy@deploy2002> Installing scap version "4.48.0" for 592 hosts [production]
15:31 <jynus> restart es1021, several connections in a "stuck" state T333961 [production]
15:25 <jynus@cumin1001> dbctl commit (dc=all): 'Depool es1021 reads', diff saved to https://phabricator.wikimedia.org/P46029 and previous config saved to /var/cache/conftool/dbconfig/20230404-152501-jynus.json [production]
15:23 <hnowlan@deploy2002> helmfile [eqiad] DONE helmfile.d/services/thumbor: apply [production]
15:19 <jiji@cumin1001> END (FAIL) - Cookbook sre.discovery.datacenter (exit_code=93) pool all active/active services in eqiad: eqiad row C switches upgrade - T331882 [production]
15:18 <ladsgroup@deploy2002> Finished scap: Backport for [[gerrit:905648|external store: Depool es4 (cluster26) from writes for maintenance (T333961)]] (duration: 11m 30s) [production]
15:16 <jynus@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db1150.eqiad.wmnet with reason: pending s3 reprovisioning [production]
15:16 <jynus@cumin1001> START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on db1150.eqiad.wmnet with reason: pending s3 reprovisioning [production]
15:12 <hnowlan@deploy2002> helmfile [eqiad] START helmfile.d/services/thumbor: apply [production]
15:08 <ladsgroup@deploy2002> ladsgroup and jynus: Backport for [[gerrit:905648|external store: Depool es4 (cluster26) from writes for maintenance (T333961)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet [production]
15:06 <ladsgroup@deploy2002> Started scap: Backport for [[gerrit:905648|external store: Depool es4 (cluster26) from writes for maintenance (T333961)]] [production]
14:54 <urbanecm> [urbanecm@mwmaint2002 /srv/mediawiki/php]$ mwscript extensions/CentralAuth/maintenance/migrateAccount.php --wiki=metawiki -u 'Translation Notification Bot (T255246)' --auto # T255246 [production]
14:43 <jiji@cumin1001> START - Cookbook sre.discovery.datacenter pool all active/active services in eqiad: eqiad row C switches upgrade - T331882 [production]
14:39 <jgiannelos@deploy2002> helmfile [codfw] DONE helmfile.d/services/wikifeeds: apply [production]
14:39 <jgiannelos@deploy2002> helmfile [codfw] START helmfile.d/services/wikifeeds: apply [production]
14:38 <jgiannelos@deploy2002> helmfile [eqiad] DONE helmfile.d/services/wikifeeds: apply [production]
14:38 <jgiannelos@deploy2002> helmfile [eqiad] START helmfile.d/services/wikifeeds: apply [production]
14:38 <jgiannelos@deploy2002> helmfile [staging] DONE helmfile.d/services/wikifeeds: apply [production]
14:37 <jgiannelos@deploy2002> helmfile [staging] START helmfile.d/services/wikifeeds: apply [production]
14:36 <jgiannelos@deploy2002> helmfile [staging] START helmfile.d/services/wikifeeds: apply [production]
14:36 <jgiannelos@deploy2002> helmfile [staging] START helmfile.d/services/wikifeeds: apply [production]
14:28 <vgutierrez> switch cp6008 (upload) and cp6016 (text) to use a single UDS socket between haproxy and varnish - T333965 [production]
14:21 <jynus> stop es1022 for debugging T333961 [production]
14:15 <Lucas_WMDE> UTC afternoon backport+config window done [production]
14:15 <lucaswerkmeister-wmde@deploy2002> Finished scap: Backport for [[gerrit:905598|Use HookContainer to register hooks inside hooks (T333926)]] (duration: 10m 50s) [production]
14:10 <stevemunene@puppetmaster1001> conftool action : set/pooled=yes; selector: name=aqs1018.eqiad.wmnet [production]
14:09 <stevemunene@puppetmaster1001> conftool action : set/pooled=yes; selector: name=aqs1013.eqiad.wmnet [production]
14:09 <stevemunene@puppetmaster1001> conftool action : set/pooled=yes; selector: name=aqs1012.eqiad.wmnet [production]
14:09 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.debug (exit_code=0) for Netbox circuit ID 33 [production]
14:09 <ayounsi@cumin1001> START - Cookbook sre.network.debug for Netbox circuit ID 33 [production]
14:09 <stevemunene@puppetmaster1001> conftool action : set/pooled=yes; selector: name=datahubsearch1003.eqiad.wmnet [production]
14:05 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde: Backport for [[gerrit:905598|Use HookContainer to register hooks inside hooks (T333926)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet [production]
14:04 <lucaswerkmeister-wmde@deploy2002> Started scap: Backport for [[gerrit:905598|Use HookContainer to register hooks inside hooks (T333926)]] [production]
13:44 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depool es1022 T333961', diff saved to https://phabricator.wikimedia.org/P46027 and previous config saved to /var/cache/conftool/dbconfig/20230404-134415-ladsgroup.json [production]
13:42 <Emperor> repool thanos-fe1003 re T331882 [production]
13:41 <Emperor> repool ms-fe1011 re T331882 [production]
13:38 <steve_munene> leave hdfs safemode T331882 [production]
13:38 <inflatador> reboot elastic2038 to clear soft lock [production]
13:34 <sukhe> run authdns-update for CR 905612, reverting depool of eqiad [production]
13:30 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: name=thumbor1006.eqiad.wmnet [production]
13:25 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-web: apply [production]
13:25 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-web: apply [production]
13:13 <hnowlan@puppetmaster1001> conftool action : set/pooled=no; selector: name=thumbor1006.eqiad.wmnet [production]
13:11 <hnowlan@puppetmaster1001> conftool action : set/pooled=no; selector: name=maps1009.eqiad.wmnet [production]
13:11 <XioNoX> asw2-c-eqiad> request system reboot all-members - T331882 [production]
13:10 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:905544|ckbwiktionary: Add logo (T331831)]] (duration: 07m 00s) [production]
13:05 <akosiaris@cumin1001> END (PASS) - Cookbook sre.discovery.datacenter (exit_code=0) depool all active/active services in eqiad: eqiad row C switches upgrade - T331882 [production]
13:03 <urbanecm@deploy2002> Started scap: Backport for [[gerrit:905544|ckbwiktionary: Add logo (T331831)]] [production]
13:02 <ayounsi@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on 227 hosts with reason: eqiad row C upgrade [production]
12:57 <ayounsi@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on 227 hosts with reason: eqiad row C upgrade [production]