2025-04-30
§
|
13:48 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1183', diff saved to https://phabricator.wikimedia.org/P75708 and previous config saved to /var/cache/conftool/dbconfig/20250430-134805-fceratto.json |
[production] |
13:47 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti2044.codfw.wmnet |
[production] |
13:46 |
<mvernon@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host moss-be1003.eqiad.wmnet |
[production] |
13:44 |
<mvernon@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host moss-be1002.eqiad.wmnet |
[production] |
13:43 |
<hnowlan@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply |
[production] |
13:43 |
<hnowlan@deploy1003> |
helmfile [eqiad] START helmfile.d/services/mw-cron: apply |
[production] |
13:38 |
<mvernon@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host moss-be1002.eqiad.wmnet |
[production] |
13:36 |
<mvernon@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host moss-be1001.eqiad.wmnet |
[production] |
13:36 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2044.codfw.wmnet |
[production] |
13:35 |
<urandom> |
invoking `nodetool garbagecollect` on sessionstore1004 — T392989, T390514 |
[production] |
13:35 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2043.codfw.wmnet |
[production] |
13:35 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2043.codfw.wmnet |
[production] |
13:33 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2229.codfw.wmnet with reason: Maintenance |
[production] |
13:33 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1183 (T392806)', diff saved to https://phabricator.wikimedia.org/P75707 and previous config saved to /var/cache/conftool/dbconfig/20250430-133258-fceratto.json |
[production] |
13:32 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1173.eqiad.wmnet with reason: Maintenance |
[production] |
13:31 |
<mvernon@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host moss-be1001.eqiad.wmnet |
[production] |
13:29 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti2043.codfw.wmnet |
[production] |
13:29 |
<mvernon@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-cluster (exit_code=0) |
[production] |
13:27 |
<Lucas_WMDE> |
lucaswerkmeister-wmde@deploy1003 ~ $ mwscript-k8s --comment=T392984 --follow -- namespaceDupes mswikisource --fix | tee T392984 |
[production] |
13:26 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db1183 (T392806)', diff saved to https://phabricator.wikimedia.org/P75706 and previous config saved to /var/cache/conftool/dbconfig/20250430-132604-fceratto.json |
[production] |
13:25 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1183.eqiad.wmnet with reason: Maintenance |
[production] |
13:25 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1161 (T392806)', diff saved to https://phabricator.wikimedia.org/P75705 and previous config saved to /var/cache/conftool/dbconfig/20250430-132539-fceratto.json |
[production] |
13:24 |
<jnuche@deploy1003> |
Installation of scap version "4.158.0" completed for 2 hosts |
[production] |
13:24 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2043.codfw.wmnet |
[production] |
13:23 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2042.codfw.wmnet |
[production] |
13:23 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2042.codfw.wmnet |
[production] |
13:23 |
<jnuche@deploy1003> |
Installing scap version "4.158.0" for 2 host(s) |
[production] |
13:20 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2213.codfw.wmnet with reason: Maintenance |
[production] |
13:20 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1230.eqiad.wmnet with reason: Maintenance |
[production] |
13:17 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti2042.codfw.wmnet |
[production] |
13:17 |
<stevemunene@deploy1003> |
Finished deploy [analytics/refinery@ea1cff2] (thin): Regular analytics weekly train THIN [analytics/refinery@ea1cff2c] (duration: 01m 24s) |
[production] |
13:16 |
<mvernon@cumin1002> |
START - Cookbook sre.hosts.reboot-cluster |
[production] |
13:16 |
<lucaswerkmeister-wmde@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1140129|mswikisource: add Karya (Work) and Gerbang (Portal) namespaces (T392984)]] (duration: 12m 10s) |
[production] |
13:16 |
<stevemunene@deploy1003> |
Started deploy [analytics/refinery@ea1cff2] (thin): Regular analytics weekly train THIN [analytics/refinery@ea1cff2c] |
[production] |
13:13 |
<stevemunene@deploy1003> |
Finished deploy [analytics/refinery@ea1cff2]: Regular analytics weekly train [analytics/refinery@ea1cff2c] (duration: 03m 25s) |
[production] |
13:12 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2042.codfw.wmnet |
[production] |
13:11 |
<XioNoX> |
adjust fundraising NAT policies - T392843 |
[production] |
13:10 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P75704 and previous config saved to /var/cache/conftool/dbconfig/20250430-131032-fceratto.json |
[production] |
13:10 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2041.codfw.wmnet |
[production] |
13:10 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2041.codfw.wmnet |
[production] |
13:09 |
<lucaswerkmeister-wmde@deploy1003> |
anzx, lucaswerkmeister-wmde: Continuing with sync |
[production] |
13:09 |
<stevemunene@deploy1003> |
Started deploy [analytics/refinery@ea1cff2]: Regular analytics weekly train [analytics/refinery@ea1cff2c] |
[production] |
13:09 |
<stevemunene@deploy1003> |
Finished deploy [analytics/refinery@ea1cff2] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@ea1cff2c] (duration: 01m 35s) |
[production] |
13:09 |
<lucaswerkmeister-wmde@deploy1003> |
anzx, lucaswerkmeister-wmde: Backport for [[gerrit:1140129|mswikisource: add Karya (Work) and Gerbang (Portal) namespaces (T392984)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
13:08 |
<stevemunene> |
deploying refinery at 1138395: Add rki.wikipedia to pageview allowlist | https://gerrit.wikimedia.org/r/c/analytics/refinery/+/1138395 T392499 |
[production] |
13:07 |
<stevemunene> |
Deploying Refinery at 1136103: Add mad.wikisource to pageview allowlist | https://gerrit.wikimedia.org/r/c/analytics/refinery/+/1136103 T391767 |
[production] |
13:07 |
<stevemunene@deploy1003> |
Started deploy [analytics/refinery@ea1cff2] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@ea1cff2c] |
[production] |
13:04 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti2041.codfw.wmnet |
[production] |
13:04 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow7001.magru.wmnet |
[production] |
13:04 |
<lucaswerkmeister-wmde@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1140129|mswikisource: add Karya (Work) and Gerbang (Portal) namespaces (T392984)]] |
[production] |