201-250 of 10000 results (137ms)
2026-03-30 ยง
13:12 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host pki-root1002.eqiad.wmnet [production]
13:10 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti4005.ulsfo.wmnet [production]
13:09 <kharlan@deploy1003> kharlan: Continuing with sync [production]
13:07 <kharlan@deploy1003> kharlan: Backport for [[gerrit:1264578|hCaptcha: Add APCu cache layer to health checker (T421204 T412947)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:05 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host pki-root1002.eqiad.wmnet [production]
13:05 <kharlan@deploy1003> Started scap sync-world: Backport for [[gerrit:1264578|hCaptcha: Add APCu cache layer to health checker (T421204 T412947)]] [production]
13:05 <jayme> disabling puppet on A:wikiube-worker-eqiad for T420436 [production]
12:34 <moritzm> failover Ganeti master in ulsfo to ganeti4008 [production]
12:03 <topranks> apply transport-in policy to core router transport peerings to prefer local anycast routes [production]
12:01 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti4006.ulsfo.wmnet [production]
12:00 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti4006.ulsfo.wmnet [production]
11:54 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti4006.ulsfo.wmnet [production]
11:52 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti4006.ulsfo.wmnet [production]
11:51 <godog> bounce neutron-l3-agent on cloundnet1005 - T421054 [production]
11:06 <btullis@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
11:05 <btullis@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. [production]
11:05 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
11:04 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
10:37 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host bast4006.wikimedia.org with OS bookworm [production]
10:15 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on bast4006.wikimedia.org with reason: host reimage [production]
10:09 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on bast4006.wikimedia.org with reason: host reimage [production]
09:46 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host bast4006.wikimedia.org with OS bookworm [production]
09:42 <jmm@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host bast4006.wikimedia.org with OS trixie [production]
09:19 <ayounsi@cumin1003> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 42 [production]
09:17 <ayounsi@cumin1003> START - Cookbook sre.network.peering with action 'email' for AS: 42 [production]
09:15 <ayounsi@cumin1003> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 12200 [production]
09:14 <ayounsi@cumin1003> START - Cookbook sre.network.peering with action 'email' for AS: 12200 [production]
09:11 <tappof> prometheus[12]008: reboot (T419960) [production]
09:10 <tappof> prometheus[12]006: reboot (T419960) [production]
08:56 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host bast4006.wikimedia.org with OS trixie [production]
08:52 <XioNoX> push pfw policy - T421556 [production]
08:51 <tappof> prometheus[12]007: reboot (T419960) [production]
08:38 <javiermonton@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply [production]
08:38 <javiermonton@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply [production]
08:37 <tappof> prometheus[12]005: reboot (T419960) [production]
08:34 <jmm@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host bast4006.wikimedia.org with OS trixie [production]
08:17 <javiermonton@deploy1003> Finished scap sync-world: Backport for [[gerrit:1261377|stream: mediawiki.page_html_content_change (T421341)]] (duration: 35m 10s) [production]
08:03 <javiermonton@deploy1003> javiermonton: Continuing with sync [production]
08:00 <javiermonton@deploy1003> javiermonton: Backport for [[gerrit:1261377|stream: mediawiki.page_html_content_change (T421341)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
07:54 <godog> deploy rabbitmq changes to allow cli communication - T420923 [production]
07:48 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host bast4006.wikimedia.org with OS trixie [production]
07:48 <jmm@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=93) for host bast4006.wikimedia.org with OS trixie [production]
07:48 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host bast4006.wikimedia.org with OS trixie [production]
07:45 <jmm@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host bast4006.wikimedia.org with OS trixie [production]
07:45 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host bast4006.wikimedia.org with OS trixie [production]
07:42 <javiermonton@deploy1003> Started scap sync-world: Backport for [[gerrit:1261377|stream: mediawiki.page_html_content_change (T421341)]] [production]
07:38 <jmm@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host bast4006.wikimedia.org with OS trixie [production]
07:24 <tappof> prometheus7002: switch to nftables and reboot (T419960) [production]
07:18 <tappof> prometheus6002: switch to nftables and reboot (T419960) [production]
07:11 <tappof> prometheus5002: switch to nftables and reboot (T419960) [production]