|
2026-03-18
§
|
| 23:58 |
<mutante> |
releases2003 - kill 782 (stunnel4) - systemctl start stunnel4 - fix T420246 T420388 T420411 |
[production] |
| 23:57 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host kafkamon1003.eqiad.wmnet |
[production] |
| 23:49 |
<eevans@cumin1003> |
END (PASS) - Cookbook sre.cassandra.roll-reboot (exit_code=0) rolling reboot on A:cassandra-dev |
[production] |
| 23:23 |
<eevans@cumin1003> |
START - Cookbook sre.cassandra.roll-reboot rolling reboot on A:cassandra-dev |
[production] |
| 23:08 |
<brett@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp5017.* |
[production] |
| 23:02 |
<brett@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp5020.* |
[production] |
| 23:01 |
<brett@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp5028.* |
[production] |
| 22:40 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5028.eqsin.wmnet with OS trixie |
[production] |
| 22:16 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5020.eqsin.wmnet with OS trixie |
[production] |
| 22:08 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5028.eqsin.wmnet with reason: host reimage |
[production] |
| 22:04 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cp5028.eqsin.wmnet with reason: host reimage |
[production] |
| 21:51 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.kafka.roll-restart-reboot-brokers (exit_code=0) rolling reboot on A:kafka-logging-eqiad |
[production] |
| 21:49 |
<sukhe@cumin1003> |
END (PASS) - Cookbook sre.dns.roll-reboot (exit_code=0) rolling reboot on A:dnsbox |
[production] |
| 21:49 |
<sukhe@cumin1003> |
cookbooks.sre.dns.roll-reboot finished rebooting dns7002.wikimedia.org |
[production] |
| 21:44 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5020.eqsin.wmnet with reason: host reimage |
[production] |
| 21:41 |
<brett@puppetserver1001> |
conftool action : set/pooled=yes; selector: name=cp5027.* |
[production] |
| 21:40 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cp5020.eqsin.wmnet with reason: host reimage |
[production] |
| 21:31 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp5028.eqsin.wmnet with OS trixie |
[production] |
| 21:30 |
<brett@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5028.eqsin.wmnet with OS trixie |
[production] |
| 21:30 |
<sukhe@cumin1003> |
cookbooks.sre.dns.roll-reboot begin reboot of dns7002.wikimedia.org |
[production] |
| 21:29 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5027.eqsin.wmnet with OS trixie |
[production] |
| 21:27 |
<jforrester@deploy2002> |
mwscript-k8s job started: extensions/WikimediaMaintenance/maintenance/addWiki.php --wiki=abstractwiki # T411723 addWiki.php run |
[production] |
| 21:26 |
<jforrester@deploy2002> |
mwscript-k8s job started: extensions/WikimediaMaintenance/maintenance/addWiki.php --wiki=abstractwiki # T411723 addWiki.php run |
[production] |
| 21:24 |
<jforrester@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1255034|Revert "OrchestratorRequest: Switch evaluations to v2 endpoint" (T418887)]], [[gerrit:1247650|Create Abstract Wikipedia (T411725 T411726)]] (duration: 06m 44s) |
[production] |
| 21:20 |
<jforrester@deploy2002> |
jforrester: Continuing with sync |
[production] |
| 21:20 |
<jforrester@deploy2002> |
jforrester: Backport for [[gerrit:1255034|Revert "OrchestratorRequest: Switch evaluations to v2 endpoint" (T418887)]], [[gerrit:1247650|Create Abstract Wikipedia (T411725 T411726)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 21:17 |
<jforrester@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1255034|Revert "OrchestratorRequest: Switch evaluations to v2 endpoint" (T418887)]], [[gerrit:1247650|Create Abstract Wikipedia (T411725 T411726)]] |
[production] |
| 21:16 |
<cdobbins@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5017.eqsin.wmnet with OS trixie |
[production] |
| 21:15 |
<sukhe@cumin1003> |
cookbooks.sre.dns.roll-reboot finished rebooting dns7001.wikimedia.org |
[production] |
| 21:12 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp5020.eqsin.wmnet with OS trixie |
[production] |
| 21:11 |
<dcausse> |
T403775: reindexing all wikis to enable new sorting options |
[deployment-prep] |
| 21:11 |
<dcausse> |
T403775: reindexing all wikis to enable new sorting options |
[releng] |
| 21:08 |
<jdlrobson@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1255013|Guard for JS null deref on empty Parsoid sections (T419721)]], [[gerrit:1254889|Reapply "hcaptcha: Enforce hCaptcha on API edits coming from the MobileFrontend" (T419125)]] (duration: 11m 20s) |
[production] |
| 21:07 |
<dcausse> |
restarting opensearch on deployment-cirrussearch(12|13|14) instances to pickup new plugin versions |
[deployment-prep] |
| 21:07 |
<dcausse> |
restarting opensearch on deployment-cirrussearch(12|13|14) instances to pickup new plugin versions |
[releng] |
| 21:07 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp5028.eqsin.wmnet with OS trixie |
[production] |
| 21:07 |
<brett@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5028.eqsin.wmnet with OS trixie |
[production] |
| 21:04 |
<jdlrobson@deploy2002> |
jdlrobson, harroyo-wmf: Continuing with sync |
[production] |
| 20:59 |
<jdlrobson@deploy2002> |
jdlrobson, harroyo-wmf: Backport for [[gerrit:1255013|Guard for JS null deref on empty Parsoid sections (T419721)]], [[gerrit:1254889|Reapply "hcaptcha: Enforce hCaptcha on API edits coming from the MobileFrontend" (T419125)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 20:59 |
<herron@cumin1003> |
START - Cookbook sre.kafka.roll-restart-reboot-brokers rolling reboot on A:kafka-logging-eqiad |
[production] |
| 20:58 |
<sukhe@cumin1003> |
cookbooks.sre.dns.roll-reboot begin reboot of dns7001.wikimedia.org |
[production] |
| 20:58 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5027.eqsin.wmnet with reason: host reimage |
[production] |
| 20:57 |
<wmbot~jeanfred@tools-bastion-15> |
Deploy f9dfb79 (Add /healthz endpoint for automatic health check) |
[tools.integraality] |
| 20:57 |
<jdlrobson@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1255013|Guard for JS null deref on empty Parsoid sections (T419721)]], [[gerrit:1254889|Reapply "hcaptcha: Enforce hCaptcha on API edits coming from the MobileFrontend" (T419125)]] |
[production] |
| 20:52 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.kafka.roll-restart-reboot-brokers (exit_code=0) rolling reboot on A:kafka-logging-codfw |
[production] |
| 20:51 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cp5027.eqsin.wmnet with reason: host reimage |
[production] |
| 20:51 |
<jhathaway@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mx-in1001.wikimedia.org with reason: T419960 |
[production] |