4701-4750 of 10000 results (76ms)
2023-01-30 ยง
14:17 <root@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on 34 hosts with reason: Primary switchover s4 T328022 [production]
14:12 <ladsgroup@cumin1001> dbctl commit (dc=all): 'db2105 (re)pooling @ 25%: Maint over', diff saved to https://phabricator.wikimedia.org/P43488 and previous config saved to /var/cache/conftool/dbconfig/20230130-141203-ladsgroup.json [production]
14:07 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2109 (T328255)', diff saved to https://phabricator.wikimedia.org/P43487 and previous config saved to /var/cache/conftool/dbconfig/20230130-140710-ladsgroup.json [production]
13:56 <ladsgroup@cumin1001> dbctl commit (dc=all): 'db2105 (re)pooling @ 10%: Maint over', diff saved to https://phabricator.wikimedia.org/P43486 and previous config saved to /var/cache/conftool/dbconfig/20230130-135659-ladsgroup.json [production]
13:56 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2109 (T328255)', diff saved to https://phabricator.wikimedia.org/P43485 and previous config saved to /var/cache/conftool/dbconfig/20230130-135632-ladsgroup.json [production]
13:56 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2109.codfw.wmnet with reason: Maintenance [production]
13:56 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db2109.codfw.wmnet with reason: Maintenance [production]
13:47 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2105.codfw.wmnet with reason: Maintenance [production]
13:47 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db2105.codfw.wmnet with reason: Maintenance [production]
13:44 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2105 (T328255)', diff saved to https://phabricator.wikimedia.org/P43484 and previous config saved to /var/cache/conftool/dbconfig/20230130-134406-ladsgroup.json [production]
13:44 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2105.codfw.wmnet with reason: Maintenance [production]
13:43 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db2105.codfw.wmnet with reason: Maintenance [production]
13:31 <jgiannelos@deploy1002> Finished deploy [kartotherian/deploy@5c58f8f] (codfw): Disable traffic mirroring from codfw to eqiad (duration: 01m 23s) [production]
13:29 <jgiannelos@deploy1002> Started deploy [kartotherian/deploy@5c58f8f] (codfw): Disable traffic mirroring from codfw to eqiad [production]
13:29 <godog> bounce logstash on logstash1025 -- GC unhappy causing kafka lag [production]
13:29 <jgiannelos@deploy1002> Finished deploy [kartotherian/deploy@5c58f8f] (eqiad): Disable traffic mirroring from codfw to eqiad (duration: 01m 13s) [production]
13:28 <jgiannelos@deploy1002> Started deploy [kartotherian/deploy@5c58f8f] (eqiad): Disable traffic mirroring from codfw to eqiad [production]
13:27 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2177 (T328255)', diff saved to https://phabricator.wikimedia.org/P43483 and previous config saved to /var/cache/conftool/dbconfig/20230130-132701-ladsgroup.json [production]
13:23 <awight@deploy1002> Finished scap: Backport for [[gerrit:884496|Revert "Enable kartographer external data parse time fetch for all wikis" (T323113)]] (duration: 08m 34s) [production]
13:21 <jgiannelos@deploy1002> Finished deploy [kartotherian/deploy@5c58f8f] (eqiad): Disable traffic mirroring from codfw to eqiad (duration: 00m 11s) [production]
13:21 <jgiannelos@deploy1002> Started deploy [kartotherian/deploy@5c58f8f] (eqiad): Disable traffic mirroring from codfw to eqiad [production]
13:21 <jgiannelos@deploy1002> Finished deploy [kartotherian/deploy@5c58f8f] (codfw): Disable traffic mirroring from codfw to eqiad (duration: 00m 22s) [production]
13:20 <jgiannelos@deploy1002> Started deploy [kartotherian/deploy@5c58f8f] (codfw): Disable traffic mirroring from codfw to eqiad [production]
13:16 <awight@deploy1002> awight: Backport for [[gerrit:884496|Revert "Enable kartographer external data parse time fetch for all wikis" (T323113)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet [production]
13:14 <awight@deploy1002> Started scap: Backport for [[gerrit:884496|Revert "Enable kartographer external data parse time fetch for all wikis" (T323113)]] [production]
13:11 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P43482 and previous config saved to /var/cache/conftool/dbconfig/20230130-131155-ladsgroup.json [production]
13:00 <jgiannelos@deploy1002> helmfile [eqiad] DONE helmfile.d/services/wikifeeds: apply [production]
12:59 <jgiannelos@deploy1002> helmfile [eqiad] START helmfile.d/services/wikifeeds: apply [production]
12:58 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts bast3004.wikimedia.org [production]
12:58 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
12:58 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: bast3004.wikimedia.org decommissioned, removing all IPs except the asset tag one - jmm@cumin2002" [production]
12:57 <jgiannelos@deploy1002> helmfile [codfw] DONE helmfile.d/services/wikifeeds: apply [production]
12:56 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P43481 and previous config saved to /var/cache/conftool/dbconfig/20230130-125648-ladsgroup.json [production]
12:56 <jgiannelos@deploy1002> helmfile [codfw] START helmfile.d/services/wikifeeds: apply [production]
12:55 <jgiannelos@deploy1002> helmfile [staging] DONE helmfile.d/services/wikifeeds: apply [production]
12:55 <jgiannelos@deploy1002> helmfile [staging] START helmfile.d/services/wikifeeds: apply [production]
12:55 <awight@deploy1002> scap failed: CalledProcessError Command '/usr/local/bin/mwscript mergeMessageFileList.php --wiki=aawiki --force-version "1.40.0-wmf.20" --list-file="/srv/mediawiki-staging/wmf-config/extension-list" --output="/tmp/tmp.2oaGSEpQR1"' returned non-zero exit status 255. (duration: 00m 00s) [production]
12:55 <awight@deploy1002> Started scap: Backport for [[gerrit:884496|Revert "Enable kartographer external data parse time fetch for all wikis" (T323113)]] [production]
12:46 <awight@deploy1002> Finished deploy [kartotherian/deploy@5c58f8f]: Roll back kartotherian (duration: 01m 27s) [production]
12:45 <awight@deploy1002> Started deploy [kartotherian/deploy@5c58f8f]: Roll back kartotherian [production]
12:41 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2177 (T328255)', diff saved to https://phabricator.wikimedia.org/P43479 and previous config saved to /var/cache/conftool/dbconfig/20230130-124142-ladsgroup.json [production]
12:30 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2177 (T328255)', diff saved to https://phabricator.wikimedia.org/P43478 and previous config saved to /var/cache/conftool/dbconfig/20230130-123004-ladsgroup.json [production]
12:29 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2177.codfw.wmnet with reason: Maintenance [production]
12:29 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db2177.codfw.wmnet with reason: Maintenance [production]
12:29 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2156 (T328255)', diff saved to https://phabricator.wikimedia.org/P43477 and previous config saved to /var/cache/conftool/dbconfig/20230130-122943-ladsgroup.json [production]
12:25 <awight@deploy1002> Finished deploy [kartotherian/deploy@42a07d3]: Disable traffic mirroring from codfw to eqiad (duration: 02m 44s) [production]
12:25 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: bast3004.wikimedia.org decommissioned, removing all IPs except the asset tag one - jmm@cumin2002" [production]
12:23 <awight@deploy1002> Started deploy [kartotherian/deploy@42a07d3]: Disable traffic mirroring from codfw to eqiad [production]
12:22 <jmm@cumin2002> START - Cookbook sre.dns.netbox [production]
12:14 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2156', diff saved to https://phabricator.wikimedia.org/P43476 and previous config saved to /var/cache/conftool/dbconfig/20230130-121437-ladsgroup.json [production]