3051-3100 of 10000 results (122ms)
2024-11-27 ยง
16:05 <fabfur@cumin1002> START - Cookbook sre.dns.netbox [production]
16:03 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1178', diff saved to https://phabricator.wikimedia.org/P71217 and previous config saved to /var/cache/conftool/dbconfig/20241127-160330-ladsgroup.json [production]
15:48 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1178 (T370903)', diff saved to https://phabricator.wikimedia.org/P71216 and previous config saved to /var/cache/conftool/dbconfig/20241127-154823-ladsgroup.json [production]
15:41 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1027.eqiad.wmnet with OS bullseye [production]
15:41 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1026.eqiad.wmnet with OS bullseye [production]
15:33 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1178 (T370903)', diff saved to https://phabricator.wikimedia.org/P71215 and previous config saved to /var/cache/conftool/dbconfig/20241127-153316-ladsgroup.json [production]
15:33 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1178.eqiad.wmnet with reason: Maintenance [production]
15:32 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 8:00:00 on db1178.eqiad.wmnet with reason: Maintenance [production]
15:32 <ecarg@deploy2002> helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply [production]
15:31 <ecarg@deploy2002> helmfile [eqiad] START helmfile.d/services/wikifunctions: apply [production]
15:30 <ecarg@deploy2002> helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply [production]
15:30 <ecarg@deploy2002> helmfile [codfw] START helmfile.d/services/wikifunctions: apply [production]
15:28 <ecarg@deploy2002> helmfile [staging] DONE helmfile.d/services/wikifunctions: apply [production]
15:27 <ecarg@deploy2002> helmfile [staging] START helmfile.d/services/wikifunctions: apply [production]
15:22 <ecarg@deploy2002> helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply [production]
15:22 <ecarg@deploy2002> helmfile [eqiad] START helmfile.d/services/wikifunctions: apply [production]
15:21 <ecarg@deploy2002> helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply [production]
15:20 <ecarg@deploy2002> helmfile [codfw] START helmfile.d/services/wikifunctions: apply [production]
15:09 <ecarg@deploy2002> helmfile [staging] DONE helmfile.d/services/wikifunctions: apply [production]
15:08 <ecarg@deploy2002> helmfile [staging] START helmfile.d/services/wikifunctions: apply [production]
15:08 <Krinkle> krinkle@webperf2003: `sudo apt-get install kafkacat` (matching webperf1003, for ad-hoc debugging) [production]
15:05 <kart_> Updated recommendation-api to 2024-11-27-142924-production (T380838, T379036, T380699) [production]
15:04 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of ml-etcd1003.eqiad.wmnet to plain [production]
15:03 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of ml-etcd1003.eqiad.wmnet to plain [production]
15:02 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1022.eqiad.wmnet [production]
15:02 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1022.eqiad.wmnet [production]
15:01 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of ml-etcd1003.eqiad.wmnet to drbd [production]
14:59 <kartik@deploy2002> helmfile [ml-serve-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
14:58 <kartik@deploy2002> helmfile [ml-serve-eqiad] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
14:51 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of ml-etcd1003.eqiad.wmnet to drbd [production]
14:48 <kartik@deploy2002> helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
14:43 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1022.eqiad.wmnet [production]
14:39 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1022.eqiad.wmnet [production]
14:35 <moritzm> rebalance magru01 following switch of VMs back to DRBD T376737 [production]
14:33 <sukhe@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on doh[7001-7002].wikimedia.org with reason: site is depooled, maintenance [production]
14:33 <sukhe@cumin1002> START - Cookbook sre.hosts.downtime for 6:00:00 on doh[7001-7002].wikimedia.org with reason: site is depooled, maintenance [production]
14:33 <urbanecm@deploy2002> Finished scap sync-world: Backport for [[gerrit:1097309|[GrowthExperiments] Undefine wgGEDatabaseCluster (T354939)]] (duration: 12m 21s) [production]
14:30 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of doh7001.wikimedia.org to drbd [production]
14:26 <urbanecm@deploy2002> urbanecm: Continuing with sync [production]
14:26 <urbanecm@deploy2002> urbanecm: Backport for [[gerrit:1097309|[GrowthExperiments] Undefine wgGEDatabaseCluster (T354939)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:25 <fnegri@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on cloudvirt1061.eqiad.wmnet with reason: cloudvirt1061 needs maintenance T380673 [production]
14:25 <fnegri@cumin1002> START - Cookbook sre.hosts.downtime for 14 days, 0:00:00 on cloudvirt1061.eqiad.wmnet with reason: cloudvirt1061 needs maintenance T380673 [production]
14:24 <urbanecm> Purge https://en.wikipedia.org/static/images/mobile/copyright/wikiquote-wordmark-az.svg (T380974) [production]
14:21 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host wdqs1027.eqiad.wmnet with OS bullseye [production]
14:21 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host wdqs1026.eqiad.wmnet with OS bullseye [production]
14:20 <urbanecm@deploy2002> Started scap sync-world: Backport for [[gerrit:1097309|[GrowthExperiments] Undefine wgGEDatabaseCluster (T354939)]] [production]
14:20 <urbanecm@deploy2002> Finished scap sync-world: Backport for [[gerrit:1098076|Enable ParserMigration compact indicator on all wikis (T363484)]], [[gerrit:1093405|Deploy Parsoid Read Views to de/ru wikivoyage and dagwiki (T375394 T380401)]], [[gerrit:1098019|Updated wordmark for Azerbaijani Wikiquote (T380974)]] (duration: 17m 20s) [production]
14:20 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of doh7001.wikimedia.org to drbd [production]
14:19 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of durum7001.magru.wmnet to drbd [production]
14:13 <urbanecm@deploy2002> urbanecm, cscott, nmw03: Continuing with sync [production]