2251-2300 of 10000 results (147ms)
2025-10-28 ยง
15:09 <swfrench@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-cron: apply [production]
15:06 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on phab2002.codfw.wmnet with reason: reboot for kernel [production]
15:05 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on phab1004.eqiad.wmnet with reason: reboot for kernel [production]
14:59 <dancy@deploy2002> Installation of scap version "4.218.0" completed for 2 hosts [production]
14:57 <dancy@deploy2002> Installing scap version "4.218.0" for 2 host(s) [production]
14:52 <elukey@puppetserver1001> conftool action : set/pooled=true; selector: dnsdisc=kartotherian,name=eqiad [production]
14:45 <hashar> Restarted CI Jenkins [production]
14:42 <hashar> Restarting Gerrit [production]
13:43 <derick@deploy2002> Finished scap sync-world: Backport for [[gerrit:1199291|Remove hCaptcha site key from private/readme.php]] (duration: 08m 58s) [production]
13:39 <derick@deploy2002> mszwarc, derick: Continuing with sync [production]
13:38 <derick@deploy2002> mszwarc, derick: Backport for [[gerrit:1199291|Remove hCaptcha site key from private/readme.php]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:34 <derick@deploy2002> Started scap sync-world: Backport for [[gerrit:1199291|Remove hCaptcha site key from private/readme.php]] [production]
13:32 <derick@deploy2002> Finished scap sync-world: Backport for [[gerrit:1199074|Make wgVectorMaxWidthOptions specify Special:Userlogin correctly (T408447)]] (duration: 10m 56s) [production]
13:29 <bking@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/opensearch-ipoid-test: apply [production]
13:29 <bking@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/opensearch-ipoid-test: apply [production]
13:29 <bking@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/opensearch-ipoid-test: apply [production]
13:29 <bking@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/opensearch-ipoid-test: apply [production]
13:26 <derick@deploy2002> derick, matmarex: Continuing with sync [production]
13:25 <derick@deploy2002> derick, matmarex: Backport for [[gerrit:1199074|Make wgVectorMaxWidthOptions specify Special:Userlogin correctly (T408447)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:21 <derick@deploy2002> Started scap sync-world: Backport for [[gerrit:1199074|Make wgVectorMaxWidthOptions specify Special:Userlogin correctly (T408447)]] [production]
13:17 <sukhe@cumin1003> END (ERROR) - Cookbook sre.hosts.reboot-single (exit_code=97) for host lvs2011.codfw.wmnet [production]
13:14 <brouberol@deploy2002> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
13:14 <brouberol@deploy2002> helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. [production]
13:06 <sukhe@cumin1003> START - Cookbook sre.hosts.reboot-single for host lvs2011.codfw.wmnet [production]
13:00 <mvernon@cumin2002> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling reboot on A:swift-fe-codfw [production]
12:53 <fceratto@cumin1003> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts es2026.codfw.wmnet [production]
12:53 <fceratto@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
12:53 <fceratto@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: es2026.codfw.wmnet decommissioned, removing all IPs except the asset tag one - fceratto@cumin1003" [production]
12:49 <sukhe@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host pybal-test2003.codfw.wmnet [production]
12:46 <sukhe@cumin1003> START - Cookbook sre.hosts.reboot-single for host pybal-test2003.codfw.wmnet [production]
12:45 <sukhe@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs2011.codfw.wmnet with reason: reboot [production]
12:24 <fceratto@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: es2026.codfw.wmnet decommissioned, removing all IPs except the asset tag one - fceratto@cumin1003" [production]
12:04 <Msz2001> Deployed changes to Suggested Investigations [production]
11:48 <fceratto@cumin1003> START - Cookbook sre.dns.netbox [production]
11:44 <mvernon@cumin2002> START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling reboot on A:swift-fe-codfw [production]
11:42 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.powercycle (exit_code=0) for host sretest2010 [production]
11:42 <fceratto@cumin1003> START - Cookbook sre.hosts.decommission for hosts es2026.codfw.wmnet [production]
11:41 <elukey@cumin2002> START - Cookbook sre.hosts.powercycle for host sretest2010 [production]
11:40 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.powercycle (exit_code=0) for host ml-serve2001 [production]
11:30 <elukey@cumin2002> START - Cookbook sre.hosts.powercycle for host ml-serve2001 [production]
11:00 <zabe@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-experimental: apply [production]
10:58 <zabe@deploy2002> helmfile [codfw] START helmfile.d/services/mw-experimental: apply [production]
10:50 <moritzm> installing openjdk-17 security updates [production]
10:10 <klausman@cumin1003> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-codfw: Roll-restart for Java security updates - klausman@cumin1003 [production]
09:55 <cgoubert@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host rdb1012.eqiad.wmnet [production]
09:52 <klausman@cumin1003> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-codfw: Roll-restart for Java security updates - klausman@cumin1003 [production]
09:51 <klausman@cumin1003> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-eqiad: Roll-restart for Java security updates - klausman@cumin1003 [production]
09:49 <cgoubert@cumin1003> START - Cookbook sre.hosts.reboot-single for host rdb1012.eqiad.wmnet [production]
09:48 <cgoubert@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host rdb1014.eqiad.wmnet [production]
09:42 <cgoubert@cumin1003> START - Cookbook sre.hosts.reboot-single for host rdb1014.eqiad.wmnet [production]