251-300 of 10000 results (104ms)
2025-04-30 ยง
13:23 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2042.codfw.wmnet [production]
13:23 <jnuche@deploy1003> Installing scap version "4.158.0" for 2 host(s) [production]
13:20 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2213.codfw.wmnet with reason: Maintenance [production]
13:20 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1230.eqiad.wmnet with reason: Maintenance [production]
13:17 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2042.codfw.wmnet [production]
13:17 <stevemunene@deploy1003> Finished deploy [analytics/refinery@ea1cff2] (thin): Regular analytics weekly train THIN [analytics/refinery@ea1cff2c] (duration: 01m 24s) [production]
13:16 <mvernon@cumin1002> START - Cookbook sre.hosts.reboot-cluster [production]
13:16 <lucaswerkmeister-wmde@deploy1003> Finished scap sync-world: Backport for [[gerrit:1140129|mswikisource: add Karya (Work) and Gerbang (Portal) namespaces (T392984)]] (duration: 12m 10s) [production]
13:16 <stevemunene@deploy1003> Started deploy [analytics/refinery@ea1cff2] (thin): Regular analytics weekly train THIN [analytics/refinery@ea1cff2c] [production]
13:13 <stevemunene@deploy1003> Finished deploy [analytics/refinery@ea1cff2]: Regular analytics weekly train [analytics/refinery@ea1cff2c] (duration: 03m 25s) [production]
13:12 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2042.codfw.wmnet [production]
13:11 <XioNoX> adjust fundraising NAT policies - T392843 [production]
13:10 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P75704 and previous config saved to /var/cache/conftool/dbconfig/20250430-131032-fceratto.json [production]
13:10 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2041.codfw.wmnet [production]
13:10 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2041.codfw.wmnet [production]
13:09 <lucaswerkmeister-wmde@deploy1003> anzx, lucaswerkmeister-wmde: Continuing with sync [production]
13:09 <stevemunene@deploy1003> Started deploy [analytics/refinery@ea1cff2]: Regular analytics weekly train [analytics/refinery@ea1cff2c] [production]
13:09 <stevemunene@deploy1003> Finished deploy [analytics/refinery@ea1cff2] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@ea1cff2c] (duration: 01m 35s) [production]
13:09 <lucaswerkmeister-wmde@deploy1003> anzx, lucaswerkmeister-wmde: Backport for [[gerrit:1140129|mswikisource: add Karya (Work) and Gerbang (Portal) namespaces (T392984)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:08 <stevemunene> deploying refinery at 1138395: Add rki.wikipedia to pageview allowlist | https://gerrit.wikimedia.org/r/c/analytics/refinery/+/1138395 T392499 [production]
13:07 <stevemunene> Deploying Refinery at 1136103: Add mad.wikisource to pageview allowlist | https://gerrit.wikimedia.org/r/c/analytics/refinery/+/1136103 T391767 [production]
13:07 <stevemunene@deploy1003> Started deploy [analytics/refinery@ea1cff2] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@ea1cff2c] [production]
13:04 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2041.codfw.wmnet [production]
13:04 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow7001.magru.wmnet [production]
13:04 <lucaswerkmeister-wmde@deploy1003> Started scap sync-world: Backport for [[gerrit:1140129|mswikisource: add Karya (Work) and Gerbang (Portal) namespaces (T392984)]] [production]
13:01 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2041.codfw.wmnet [production]
13:00 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow7001.magru.wmnet [production]
12:59 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2040.codfw.wmnet [production]
12:58 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2040.codfw.wmnet [production]
12:57 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow6001.drmrs.wmnet [production]
12:55 <damilare> config revision changed from 817b0c94 to 45e49fec [production]
12:55 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P75703 and previous config saved to /var/cache/conftool/dbconfig/20250430-125525-fceratto.json [production]
12:53 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow6001.drmrs.wmnet [production]
12:53 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2040.codfw.wmnet [production]
12:50 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
12:50 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
12:50 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2040.codfw.wmnet [production]
12:49 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
12:49 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
12:49 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow5002.eqsin.wmnet [production]
12:48 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2039.codfw.wmnet [production]
12:48 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2039.codfw.wmnet [production]
12:44 <godog> reboot alert2002 [production]
12:43 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow5002.eqsin.wmnet [production]
12:43 <filippo@cumin1002> DONE (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 1:00:00 on alert2002.wikimedia.org with reason: new kernel [production]
12:43 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2039.codfw.wmnet [production]
12:42 <filippo@cumin1002> DONE (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 4:00:00 on alert2002.wikimedia.org with reason: kernel [production]
12:41 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow4002.ulsfo.wmnet [production]
12:40 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1161 (T392806)', diff saved to https://phabricator.wikimedia.org/P75702 and previous config saved to /var/cache/conftool/dbconfig/20250430-124018-fceratto.json [production]
12:38 <filippo@cumin1002> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host alert2002.wikimedia.org [production]