601-650 of 10000 results (97ms)
2025-10-28 ยง
13:17 <sukhe@cumin1003> END (ERROR) - Cookbook sre.hosts.reboot-single (exit_code=97) for host lvs2011.codfw.wmnet [production]
13:14 <brouberol@deploy2002> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
13:14 <brouberol@deploy2002> helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. [production]
13:06 <sukhe@cumin1003> START - Cookbook sre.hosts.reboot-single for host lvs2011.codfw.wmnet [production]
13:00 <mvernon@cumin2002> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling reboot on A:swift-fe-codfw [production]
12:53 <fceratto@cumin1003> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts es2026.codfw.wmnet [production]
12:53 <fceratto@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
12:53 <fceratto@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: es2026.codfw.wmnet decommissioned, removing all IPs except the asset tag one - fceratto@cumin1003" [production]
12:49 <sukhe@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host pybal-test2003.codfw.wmnet [production]
12:46 <sukhe@cumin1003> START - Cookbook sre.hosts.reboot-single for host pybal-test2003.codfw.wmnet [production]
12:45 <sukhe@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs2011.codfw.wmnet with reason: reboot [production]
12:24 <fceratto@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: es2026.codfw.wmnet decommissioned, removing all IPs except the asset tag one - fceratto@cumin1003" [production]
12:04 <Msz2001> Deployed changes to Suggested Investigations [production]
11:48 <fceratto@cumin1003> START - Cookbook sre.dns.netbox [production]
11:44 <mvernon@cumin2002> START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling reboot on A:swift-fe-codfw [production]
11:42 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.powercycle (exit_code=0) for host sretest2010 [production]
11:42 <fceratto@cumin1003> START - Cookbook sre.hosts.decommission for hosts es2026.codfw.wmnet [production]
11:41 <elukey@cumin2002> START - Cookbook sre.hosts.powercycle for host sretest2010 [production]
11:40 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.powercycle (exit_code=0) for host ml-serve2001 [production]
11:30 <elukey@cumin2002> START - Cookbook sre.hosts.powercycle for host ml-serve2001 [production]
11:00 <zabe@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-experimental: apply [production]
10:58 <zabe@deploy2002> helmfile [codfw] START helmfile.d/services/mw-experimental: apply [production]
10:50 <moritzm> installing openjdk-17 security updates [production]
10:10 <klausman@cumin1003> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-codfw: Roll-restart for Java security updates - klausman@cumin1003 [production]
09:55 <cgoubert@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host rdb1012.eqiad.wmnet [production]
09:52 <klausman@cumin1003> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-codfw: Roll-restart for Java security updates - klausman@cumin1003 [production]
09:51 <klausman@cumin1003> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-eqiad: Roll-restart for Java security updates - klausman@cumin1003 [production]
09:49 <cgoubert@cumin1003> START - Cookbook sre.hosts.reboot-single for host rdb1012.eqiad.wmnet [production]
09:48 <cgoubert@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host rdb1014.eqiad.wmnet [production]
09:42 <cgoubert@cumin1003> START - Cookbook sre.hosts.reboot-single for host rdb1014.eqiad.wmnet [production]
09:42 <cgoubert@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
09:39 <cgoubert@cumin1003> START - Cookbook sre.dns.netbox [production]
09:39 <cgoubert@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
09:36 <cgoubert@cumin1003> START - Cookbook sre.dns.netbox [production]
09:34 <klausman@cumin1003> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-eqiad: Roll-restart for Java security updates - klausman@cumin1003 [production]
09:20 <jmm@cumin2002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:cassandra-dev: OpenJDK security updates - jmm@cumin2002 [production]
09:08 <kharlan@deploy2002> Finished scap sync-world: Backport for [[gerrit:1199231|hCaptcha: Enable on loginwiki (T408428)]] (duration: 16m 35s) [production]
09:05 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
09:02 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
08:59 <jmm@cumin2002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:cassandra-dev: OpenJDK security updates - jmm@cumin2002 [production]
08:58 <kharlan@deploy2002> kharlan: Continuing with sync [production]
08:56 <kharlan@deploy2002> kharlan: Backport for [[gerrit:1199231|hCaptcha: Enable on loginwiki (T408428)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
08:53 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.powercycle (exit_code=0) for host ml-serve2001 [production]
08:53 <elukey@cumin2002> START - Cookbook sre.hosts.powercycle for host ml-serve2001 [production]
08:52 <kharlan@deploy2002> Started scap sync-world: Backport for [[gerrit:1199231|hCaptcha: Enable on loginwiki (T408428)]] [production]
08:49 <kharlan@deploy2002> Finished scap sync-world: Backport for [[gerrit:1199026|CheckUser: Enable SI on metawiki and loginwiki (T408428)]] (duration: 46m 57s) [production]
08:33 <kharlan@deploy2002> kharlan: Continuing with sync [production]
08:29 <elukey@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host ml-serve2001.codfw.wmnet [production]
08:29 <elukey@cumin1003> START - Cookbook sre.k8s.pool-depool-node depool for host ml-serve2001.codfw.wmnet [production]
08:28 <moritzm> installing openjdk-11 security updates [production]