|
2026-04-29
ยง
|
| 08:37 |
<urbanecm@deploy1003> |
mwscript-k8s job started: extensions/Translate/scripts/moveTranslatableBundle.php --wiki=mediawikiwiki Wikimedia_Apps/Team/Android/TriviaGame 'Wikimedia Apps/Team/Android/Which' came 'first? Game' 'Martin Urbanec (WMF)' '--reason=per [[:phab:T423845]]' # T423845 |
[production] |
| 08:29 |
<elukey@deploy1003> |
helmfile [staging] DONE helmfile.d/services/wikifunctions: sync |
[production] |
| 08:29 |
<elukey@deploy1003> |
helmfile [staging] START helmfile.d/services/wikifunctions: sync |
[production] |
| 08:28 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.pool pool db2189: after reimage to trixie |
[production] |
| 08:24 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.pool pool db1233: after reimage to trixie |
[production] |
| 08:24 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2194.codfw.wmnet with reason: host reimage |
[production] |
| 08:24 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2189.codfw.wmnet with OS trixie |
[production] |
| 08:21 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1233.eqiad.wmnet with OS trixie |
[production] |
| 08:21 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2194.codfw.wmnet with reason: host reimage |
[production] |
| 08:18 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1175.eqiad.wmnet with reason: host reimage |
[production] |
| 08:18 |
<Emperor> |
re-enable puppet in apus/codfw for TLS key rollover T424674 (no change, incident took over) |
[production] |
| 08:16 |
<Emperor> |
disable puppet in apus/codfw for TLS key rollover T424674 |
[production] |
| 08:14 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1175.eqiad.wmnet with reason: host reimage |
[production] |
| 08:09 |
<dpogorzelski@deploy1003> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' . |
[production] |
| 08:08 |
<a-pizzata@deploy1003> |
Finished deploy [analytics/refinery@d6a17a0] (thin): Regular analytics weekly train THIN [analytics/refinery@d6a17a0a] (duration: 01m 54s) |
[production] |
| 08:06 |
<a-pizzata@deploy1003> |
Started deploy [analytics/refinery@d6a17a0] (thin): Regular analytics weekly train THIN [analytics/refinery@d6a17a0a] |
[production] |
| 08:02 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2189.codfw.wmnet with reason: host reimage |
[production] |
| 07:59 |
<a-pizzata@deploy1003> |
Finished deploy [analytics/refinery@d6a17a0]: Regular analytics weekly train [analytics/refinery@d6a17a0a] (duration: 04m 12s) |
[production] |
| 07:59 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.reimage for host db2194.codfw.wmnet with OS trixie |
[production] |
| 07:59 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1233.eqiad.wmnet with reason: host reimage |
[production] |
| 07:58 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.reimage for host db1175.eqiad.wmnet with OS trixie |
[production] |
| 07:57 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db2194: Reimage to Trixie |
[production] |
| 07:57 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.depool depool db2194: Reimage to Trixie |
[production] |
| 07:57 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db2194.codfw.wmnet with reason: Reimage to Trixie |
[production] |
| 07:56 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db2227: after reimage to trixie |
[production] |
| 07:56 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db1175: Reimage to Trixie |
[production] |
| 07:56 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.depool depool db1175: Reimage to Trixie |
[production] |
| 07:55 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db1175.eqiad.wmnet with reason: Reimage to Trixie |
[production] |
| 07:55 |
<a-pizzata@deploy1003> |
Started deploy [analytics/refinery@d6a17a0]: Regular analytics weekly train [analytics/refinery@d6a17a0a] |
[production] |
| 07:55 |
<a-pizzata@deploy1003> |
Finished deploy [analytics/refinery@d6a17a0] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@d6a17a0a] (duration: 01m 57s) |
[production] |
| 07:53 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2189.codfw.wmnet with reason: host reimage |
[production] |
| 07:53 |
<a-pizzata@deploy1003> |
Started deploy [analytics/refinery@d6a17a0] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@d6a17a0a] |
[production] |
| 07:52 |
<dpogorzelski@deploy1003> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' . |
[production] |
| 07:52 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db1198: after reimage to trixie |
[production] |
| 07:52 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1233.eqiad.wmnet with reason: host reimage |
[production] |
| 07:39 |
<ryankemper> |
T422860 [cloudelastic] Restarted opensearch services on `cloudelastic1011` and `cloudelastic1012` (needed to pick up missing opensearch plugins, which have already been fixed in puppet) (note: this was done ~2h ago; logged in wrong channel) |
[production] |
| 07:38 |
<ryankemper@cumin2002> |
START - Cookbook sre.hosts.reimage for host cloudelastic1007.eqiad.wmnet with OS trixie |
[production] |
| 07:34 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.reimage for host db1233.eqiad.wmnet with OS trixie |
[production] |
| 07:34 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.reimage for host db2189.codfw.wmnet with OS trixie |
[production] |
| 07:33 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db2189: Reimage to Trixie |
[production] |
| 07:33 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db1233: Reimage to Trixie |
[production] |
| 07:32 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.depool depool db1233: Reimage to Trixie |
[production] |
| 07:32 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db1233.eqiad.wmnet with reason: Reimage to Trixie |
[production] |
| 07:32 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.depool depool db2189: Reimage to Trixie |
[production] |
| 07:32 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db2189.codfw.wmnet with reason: Reimage to Trixie |
[production] |
| 07:31 |
<kevinbazira@deploy1003> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' . |
[production] |
| 07:30 |
<jayme@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker1039.eqiad.wmnet |
[production] |
| 07:30 |
<jayme@cumin1003> |
START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker1039.eqiad.wmnet |
[production] |
| 07:25 |
<jayme@cumin1003> |
END (FAIL) - Cookbook sre.k8s.pool-depool-node (exit_code=99) depool for host wikikube-worker1039.eqiad.wmnet |
[production] |
| 07:24 |
<dcausse@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1278509|Completion: fix near match field name (T417648)]], [[gerrit:1269464|search: add alt. completion indices to test keyword tokenizer (1/2) (T420427)]] (duration: 07m 07s) |
[production] |