|
2026-03-19
§
|
| 08:12 |
<aklapper@deploy2002> |
rebuilt and synchronized wikiversions files: group2 to 1.46.0-wmf.20 refs T413811 |
[production] |
| 08:08 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti1048.eqiad.wmnet |
[production] |
| 07:36 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1048.eqiad.wmnet |
[production] |
| 07:17 |
<aokoth@deploy2002> |
helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply |
[production] |
| 07:17 |
<aokoth@deploy2002> |
helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply |
[production] |
| 07:16 |
<aokoth@deploy2002> |
helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply |
[production] |
| 07:16 |
<aokoth@deploy2002> |
helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply |
[production] |
| 07:14 |
<aokoth@deploy2002> |
helmfile [aux-k8s-eqiad] DONE helmfile.d/services/miscweb: apply |
[production] |
| 07:14 |
<aokoth@deploy2002> |
helmfile [aux-k8s-eqiad] START helmfile.d/services/miscweb: apply |
[production] |
| 04:53 |
<kevinbazira@deploy2002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . |
[production] |
| 00:06 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host kafkamon2003.codfw.wmnet |
[production] |
| 00:02 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host kafkamon2003.codfw.wmnet |
[production] |
| 00:01 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host kafkamon1003.eqiad.wmnet |
[production] |
|
2026-03-18
§
|
| 23:58 |
<mutante> |
releases2003 - kill 782 (stunnel4) - systemctl start stunnel4 - fix T420246 T420388 T420411 |
[production] |
| 23:57 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host kafkamon1003.eqiad.wmnet |
[production] |
| 23:49 |
<eevans@cumin1003> |
END (PASS) - Cookbook sre.cassandra.roll-reboot (exit_code=0) rolling reboot on A:cassandra-dev |
[production] |
| 23:23 |
<eevans@cumin1003> |
START - Cookbook sre.cassandra.roll-reboot rolling reboot on A:cassandra-dev |
[production] |
| 23:08 |
<brett@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp5017.* |
[production] |
| 23:02 |
<brett@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp5020.* |
[production] |
| 23:01 |
<brett@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp5028.* |
[production] |
| 22:40 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5028.eqsin.wmnet with OS trixie |
[production] |
| 22:16 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5020.eqsin.wmnet with OS trixie |
[production] |
| 22:08 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5028.eqsin.wmnet with reason: host reimage |
[production] |
| 22:04 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cp5028.eqsin.wmnet with reason: host reimage |
[production] |
| 21:51 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.kafka.roll-restart-reboot-brokers (exit_code=0) rolling reboot on A:kafka-logging-eqiad |
[production] |
| 21:49 |
<sukhe@cumin1003> |
END (PASS) - Cookbook sre.dns.roll-reboot (exit_code=0) rolling reboot on A:dnsbox |
[production] |
| 21:49 |
<sukhe@cumin1003> |
cookbooks.sre.dns.roll-reboot finished rebooting dns7002.wikimedia.org |
[production] |
| 21:44 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5020.eqsin.wmnet with reason: host reimage |
[production] |
| 21:41 |
<brett@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp5027.* |
[production] |
| 21:40 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cp5020.eqsin.wmnet with reason: host reimage |
[production] |
| 21:31 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp5028.eqsin.wmnet with OS trixie |
[production] |
| 21:30 |
<brett@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5028.eqsin.wmnet with OS trixie |
[production] |
| 21:30 |
<sukhe@cumin1003> |
cookbooks.sre.dns.roll-reboot begin reboot of dns7002.wikimedia.org |
[production] |
| 21:29 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5027.eqsin.wmnet with OS trixie |
[production] |
| 21:27 |
<jforrester@deploy2002> |
mwscript-k8s job started: extensions/WikimediaMaintenance/maintenance/addWiki.php --wiki=abstractwiki # T411723 addWiki.php run |
[production] |
| 21:26 |
<jforrester@deploy2002> |
mwscript-k8s job started: extensions/WikimediaMaintenance/maintenance/addWiki.php --wiki=abstractwiki # T411723 addWiki.php run |
[production] |
| 21:24 |
<jforrester@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1255034|Revert "OrchestratorRequest: Switch evaluations to v2 endpoint" (T418887)]], [[gerrit:1247650|Create Abstract Wikipedia (T411725 T411726)]] (duration: 06m 44s) |
[production] |
| 21:20 |
<jforrester@deploy2002> |
jforrester: Continuing with sync |
[production] |
| 21:20 |
<jforrester@deploy2002> |
jforrester: Backport for [[gerrit:1255034|Revert "OrchestratorRequest: Switch evaluations to v2 endpoint" (T418887)]], [[gerrit:1247650|Create Abstract Wikipedia (T411725 T411726)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 21:17 |
<jforrester@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1255034|Revert "OrchestratorRequest: Switch evaluations to v2 endpoint" (T418887)]], [[gerrit:1247650|Create Abstract Wikipedia (T411725 T411726)]] |
[production] |
| 21:16 |
<cdobbins@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5017.eqsin.wmnet with OS trixie |
[production] |
| 21:15 |
<sukhe@cumin1003> |
cookbooks.sre.dns.roll-reboot finished rebooting dns7001.wikimedia.org |
[production] |
| 21:12 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp5020.eqsin.wmnet with OS trixie |
[production] |
| 21:08 |
<jdlrobson@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1255013|Guard for JS null deref on empty Parsoid sections (T419721)]], [[gerrit:1254889|Reapply "hcaptcha: Enforce hCaptcha on API edits coming from the MobileFrontend" (T419125)]] (duration: 11m 20s) |
[production] |
| 21:07 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp5028.eqsin.wmnet with OS trixie |
[production] |
| 21:07 |
<brett@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5028.eqsin.wmnet with OS trixie |
[production] |
| 21:04 |
<jdlrobson@deploy2002> |
jdlrobson, harroyo-wmf: Continuing with sync |
[production] |
| 20:59 |
<jdlrobson@deploy2002> |
jdlrobson, harroyo-wmf: Backport for [[gerrit:1255013|Guard for JS null deref on empty Parsoid sections (T419721)]], [[gerrit:1254889|Reapply "hcaptcha: Enforce hCaptcha on API edits coming from the MobileFrontend" (T419125)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 20:59 |
<herron@cumin1003> |
START - Cookbook sre.kafka.roll-restart-reboot-brokers rolling reboot on A:kafka-logging-eqiad |
[production] |
| 20:58 |
<sukhe@cumin1003> |
cookbooks.sre.dns.roll-reboot begin reboot of dns7001.wikimedia.org |
[production] |