2025-06-23
ยง
|
09:11 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1221 (T396130)', diff saved to https://phabricator.wikimedia.org/P78600 and previous config saved to /var/cache/conftool/dbconfig/20250623-091123-marostegui.json |
[production] |
09:08 |
<taavi> |
restrict logging in to tools-sgebastion-10 (aka login-buster) T397459 |
[tools] |
09:07 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1220.eqiad.wmnet with reason: Maintenance |
[production] |
09:06 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1220', diff saved to https://phabricator.wikimedia.org/P78599 and previous config saved to /var/cache/conftool/dbconfig/20250623-090619-root.json |
[production] |
09:03 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1224 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P78598 and previous config saved to /var/cache/conftool/dbconfig/20250623-090305-root.json |
[production] |
08:56 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1221', diff saved to https://phabricator.wikimedia.org/P78597 and previous config saved to /var/cache/conftool/dbconfig/20250623-085616-marostegui.json |
[production] |
08:48 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1224 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P78596 and previous config saved to /var/cache/conftool/dbconfig/20250623-084800-root.json |
[production] |
08:44 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be2007.codfw.wmnet |
[production] |
08:41 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1221', diff saved to https://phabricator.wikimedia.org/P78595 and previous config saved to /var/cache/conftool/dbconfig/20250623-084108-marostegui.json |
[production] |
08:39 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1224', diff saved to https://phabricator.wikimedia.org/P78594 and previous config saved to /var/cache/conftool/dbconfig/20250623-083954-root.json |
[production] |
08:39 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1224.eqiad.wmnet with reason: Maintenance |
[production] |
08:38 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host thanos-be2007.codfw.wmnet |
[production] |
08:36 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-be2006.codfw.wmnet |
[production] |
08:31 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host thanos-be2006.codfw.wmnet |
[production] |
08:26 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1221 (T396130)', diff saved to https://phabricator.wikimedia.org/P78592 and previous config saved to /var/cache/conftool/dbconfig/20250623-082600-marostegui.json |
[production] |
08:20 |
<btullis> |
applying views fix to an-redacteddb1001 for T397508 |
[analytics] |
08:19 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1221 (T396130)', diff saved to https://phabricator.wikimedia.org/P78591 and previous config saved to /var/cache/conftool/dbconfig/20250623-081920-marostegui.json |
[production] |
08:19 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1015,1019].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance |
[production] |
08:18 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1221.eqiad.wmnet with reason: Maintenance |
[production] |
08:18 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1199 (T396130)', diff saved to https://phabricator.wikimedia.org/P78590 and previous config saved to /var/cache/conftool/dbconfig/20250623-081839-marostegui.json |
[production] |
08:15 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db[1217,1228].eqiad.wmnet with reason: Maintenance |
[production] |
08:09 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2160.codfw.wmnet with reason: Maintenance |
[production] |
08:09 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2233.codfw.wmnet with reason: Maintenance |
[production] |
08:03 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1199', diff saved to https://phabricator.wikimedia.org/P78588 and previous config saved to /var/cache/conftool/dbconfig/20250623-080332-marostegui.json |
[production] |
07:48 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1199', diff saved to https://phabricator.wikimedia.org/P78587 and previous config saved to /var/cache/conftool/dbconfig/20250623-074824-marostegui.json |
[production] |
07:43 |
<stevemunene> |
restart 'hadoop-hdfs-zkfc.service' on an-master1003 T374922 |
[analytics] |
07:42 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-analytics-test: apply |
[production] |
07:41 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-analytics-test: apply |
[production] |
07:41 |
<stevemunene> |
restart 'hadoop-hdfs-zkfc.service' on an-master1004 T374922 |
[analytics] |
07:37 |
<hashar@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1161622|ApiQueryZFunctionReference: Return an actual empty array instead of [false] (T396978)]], [[gerrit:1154121|captureSpeedtest: Drop PHP 7 check, no longer needed]], [[gerrit:1156351|diffConfig: Add a quick list of affected wikis to the end of the output]] (duration: 41m 07s) |
[production] |
07:33 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1199 (T396130)', diff saved to https://phabricator.wikimedia.org/P78586 and previous config saved to /var/cache/conftool/dbconfig/20250623-073316-marostegui.json |
[production] |
07:31 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2039 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P78585 and previous config saved to /var/cache/conftool/dbconfig/20250623-073145-root.json |
[production] |
07:31 |
<stevemunene@cumin1002> |
END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-analytics cluster: Roll restart of jvm daemons. |
[production] |
07:25 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1199 (T396130)', diff saved to https://phabricator.wikimedia.org/P78584 and previous config saved to /var/cache/conftool/dbconfig/20250623-072542-marostegui.json |
[production] |
07:25 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1199.eqiad.wmnet with reason: Maintenance |
[production] |
07:25 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1190 (T396130)', diff saved to https://phabricator.wikimedia.org/P78583 and previous config saved to /var/cache/conftool/dbconfig/20250623-072519-marostegui.json |
[production] |
07:25 |
<stevemunene@cumin1002> |
START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-analytics cluster: Roll restart of jvm daemons. |
[production] |
07:24 |
<hashar@deploy1003> |
hashar, jforrester: Continuing with sync |
[production] |
07:23 |
<stevemunene> |
roll restart zookeeper analytics cluster to pickup removed hosts T374922 |
[analytics] |
07:20 |
<James_F> |
Zuul: [mediawiki/extensions/EventLogging] Add CodeEditor Phan dependency, for T346540 |
[releng] |
07:20 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1228.eqiad.wmnet with reason: Maintenance |
[production] |
07:20 |
<stevemunene> |
remove an-conf100[1-3] from the zookeeper analytics cluster T374922 |
[analytics] |
07:18 |
<hashar@deploy1003> |
hashar, jforrester: Backport for [[gerrit:1161622|ApiQueryZFunctionReference: Return an actual empty array instead of [false] (T396978)]], [[gerrit:1154121|captureSpeedtest: Drop PHP 7 check, no longer needed]], [[gerrit:1156351|diffConfig: Add a quick list of affected wikis to the end of the output]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be ver |
[production] |
07:16 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2039 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P78582 and previous config saved to /var/cache/conftool/dbconfig/20250623-071639-root.json |
[production] |
07:16 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2035 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P78581 and previous config saved to /var/cache/conftool/dbconfig/20250623-071618-root.json |
[production] |
07:10 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1190', diff saved to https://phabricator.wikimedia.org/P78580 and previous config saved to /var/cache/conftool/dbconfig/20250623-071011-marostegui.json |
[production] |
07:06 |
<marostegui> |
Failover m5 from db1228 to db1164 - T397413 |
[production] |
07:01 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2039 (re)pooling @ 60%: Repooling', diff saved to https://phabricator.wikimedia.org/P78579 and previous config saved to /var/cache/conftool/dbconfig/20250623-070134-root.json |
[production] |
07:01 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2035 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P78578 and previous config saved to /var/cache/conftool/dbconfig/20250623-070112-root.json |
[production] |
06:58 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db[2160,2235].codfw.wmnet,db[1164,1217,1228].eqiad.wmnet with reason: m5 master switch T397413 |
[production] |