1-50 of 10000 results (95ms)
2026-03-19 §
04:53 <kevinbazira@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
00:06 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host kafkamon2003.codfw.wmnet [production]
00:02 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host kafkamon2003.codfw.wmnet [production]
00:01 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host kafkamon1003.eqiad.wmnet [production]
2026-03-18 §
23:58 <mutante> releases2003 - kill 782 (stunnel4) - systemctl start stunnel4 - fix T420246 T420388 T420411 [production]
23:57 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host kafkamon1003.eqiad.wmnet [production]
23:49 <eevans@cumin1003> END (PASS) - Cookbook sre.cassandra.roll-reboot (exit_code=0) rolling reboot on A:cassandra-dev [production]
23:23 <eevans@cumin1003> START - Cookbook sre.cassandra.roll-reboot rolling reboot on A:cassandra-dev [production]
23:08 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp5017.* [production]
23:02 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp5020.* [production]
23:01 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp5028.* [production]
22:40 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5028.eqsin.wmnet with OS trixie [production]
22:16 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5020.eqsin.wmnet with OS trixie [production]
22:08 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5028.eqsin.wmnet with reason: host reimage [production]
22:04 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp5028.eqsin.wmnet with reason: host reimage [production]
21:51 <herron@cumin1003> END (PASS) - Cookbook sre.kafka.roll-restart-reboot-brokers (exit_code=0) rolling reboot on A:kafka-logging-eqiad [production]
21:49 <sukhe@cumin1003> END (PASS) - Cookbook sre.dns.roll-reboot (exit_code=0) rolling reboot on A:dnsbox [production]
21:49 <sukhe@cumin1003> cookbooks.sre.dns.roll-reboot finished rebooting dns7002.wikimedia.org [production]
21:44 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5020.eqsin.wmnet with reason: host reimage [production]
21:41 <brett@puppetserver1001> conftool action : set/pooled=yes; selector: name=cp5027.* [production]
21:40 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp5020.eqsin.wmnet with reason: host reimage [production]
21:31 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp5028.eqsin.wmnet with OS trixie [production]
21:30 <brett@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5028.eqsin.wmnet with OS trixie [production]
21:30 <sukhe@cumin1003> cookbooks.sre.dns.roll-reboot begin reboot of dns7002.wikimedia.org [production]
21:29 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5027.eqsin.wmnet with OS trixie [production]
21:27 <jforrester@deploy2002> mwscript-k8s job started: extensions/WikimediaMaintenance/maintenance/addWiki.php --wiki=abstractwiki # T411723 addWiki.php run [production]
21:26 <jforrester@deploy2002> mwscript-k8s job started: extensions/WikimediaMaintenance/maintenance/addWiki.php --wiki=abstractwiki # T411723 addWiki.php run [production]
21:24 <jforrester@deploy2002> Finished scap sync-world: Backport for [[gerrit:1255034|Revert "OrchestratorRequest: Switch evaluations to v2 endpoint" (T418887)]], [[gerrit:1247650|Create Abstract Wikipedia (T411725 T411726)]] (duration: 06m 44s) [production]
21:20 <jforrester@deploy2002> jforrester: Continuing with sync [production]
21:20 <jforrester@deploy2002> jforrester: Backport for [[gerrit:1255034|Revert "OrchestratorRequest: Switch evaluations to v2 endpoint" (T418887)]], [[gerrit:1247650|Create Abstract Wikipedia (T411725 T411726)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
21:17 <jforrester@deploy2002> Started scap sync-world: Backport for [[gerrit:1255034|Revert "OrchestratorRequest: Switch evaluations to v2 endpoint" (T418887)]], [[gerrit:1247650|Create Abstract Wikipedia (T411725 T411726)]] [production]
21:16 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5017.eqsin.wmnet with OS trixie [production]
21:15 <sukhe@cumin1003> cookbooks.sre.dns.roll-reboot finished rebooting dns7001.wikimedia.org [production]
21:12 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp5020.eqsin.wmnet with OS trixie [production]
21:08 <jdlrobson@deploy2002> Finished scap sync-world: Backport for [[gerrit:1255013|Guard for JS null deref on empty Parsoid sections (T419721)]], [[gerrit:1254889|Reapply "hcaptcha: Enforce hCaptcha on API edits coming from the MobileFrontend" (T419125)]] (duration: 11m 20s) [production]
21:07 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp5028.eqsin.wmnet with OS trixie [production]
21:07 <brett@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5028.eqsin.wmnet with OS trixie [production]
21:04 <jdlrobson@deploy2002> jdlrobson, harroyo-wmf: Continuing with sync [production]
20:59 <jdlrobson@deploy2002> jdlrobson, harroyo-wmf: Backport for [[gerrit:1255013|Guard for JS null deref on empty Parsoid sections (T419721)]], [[gerrit:1254889|Reapply "hcaptcha: Enforce hCaptcha on API edits coming from the MobileFrontend" (T419125)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:59 <herron@cumin1003> START - Cookbook sre.kafka.roll-restart-reboot-brokers rolling reboot on A:kafka-logging-eqiad [production]
20:58 <sukhe@cumin1003> cookbooks.sre.dns.roll-reboot begin reboot of dns7001.wikimedia.org [production]
20:58 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5027.eqsin.wmnet with reason: host reimage [production]
20:57 <jdlrobson@deploy2002> Started scap sync-world: Backport for [[gerrit:1255013|Guard for JS null deref on empty Parsoid sections (T419721)]], [[gerrit:1254889|Reapply "hcaptcha: Enforce hCaptcha on API edits coming from the MobileFrontend" (T419125)]] [production]
20:52 <herron@cumin1003> END (PASS) - Cookbook sre.kafka.roll-restart-reboot-brokers (exit_code=0) rolling reboot on A:kafka-logging-codfw [production]
20:51 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp5027.eqsin.wmnet with reason: host reimage [production]
20:51 <jhathaway@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mx-in1001.wikimedia.org with reason: T419960 [production]
20:50 <cdobbins@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cp5020.eqsin.wmnet with OS trixie [production]
20:50 <jhathaway@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mx-in2001.wikimedia.org with reason: T419960 [production]
20:49 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp5028.eqsin.wmnet with OS trixie [production]
20:48 <brett@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5028.eqsin.wmnet with OS trixie [production]