1201-1250 of 10000 results (32ms)
2026-03-30 ยง
13:31 <moritzm> rebalance Ganeti cluster in ulsfo following the completion of the migration to routed Ganeti T421044 [production]
13:30 <jforrester@deploy1003> Started scap sync-world: Backport for [[gerrit:1264590|instrument(ReviseTone): record start of copyedit session (T419181)]], [[gerrit:1261477|Replace WANObjectCache with new MemcachedWrapper concept (T419666)]], [[gerrit:1262199|Fix match case for setting minute, week or month TTL on OrchestratorRequest (T421475)]] [production]
13:30 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host aux-k8s-etcd1004.eqiad.wmnet [production]
13:19 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti4005.ulsfo.wmnet [production]
13:19 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host aux-k8s-etcd1003.eqiad.wmnet [production]
13:18 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti4005.ulsfo.wmnet [production]
13:17 <kharlan@deploy1003> Finished scap sync-world: Backport for [[gerrit:1264578|hCaptcha: Add APCu cache layer to health checker (T421204 T412947)]] (duration: 11m 56s) [production]
13:15 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host aux-k8s-etcd1003.eqiad.wmnet [production]
13:12 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti4005.ulsfo.wmnet [production]
13:12 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host pki-root1002.eqiad.wmnet [production]
13:10 <hashar> gerrit: abandon mediawiki/core changes that are 2+years old and are attached to a task (`Bug: Txxxx`) [releng]
13:10 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti4005.ulsfo.wmnet [production]
13:09 <kharlan@deploy1003> kharlan: Continuing with sync [production]
13:07 <kharlan@deploy1003> kharlan: Backport for [[gerrit:1264578|hCaptcha: Add APCu cache layer to health checker (T421204 T412947)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:05 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host pki-root1002.eqiad.wmnet [production]
13:05 <kharlan@deploy1003> Started scap sync-world: Backport for [[gerrit:1264578|hCaptcha: Add APCu cache layer to health checker (T421204 T412947)]] [production]
13:05 <jayme> disabling puppet on A:wikiube-worker-eqiad for T420436 [production]
12:54 <wmbot~lucaswerkmeister@tools-bastion-15> deployed 7b4b75736e (l10n updates: ja, ru) [tools.wd-image-positions]
12:34 <moritzm> failover Ganeti master in ulsfo to ganeti4008 [production]
12:05 <dcaro> removing wal from prometheus nodes to restart them [tools]
12:03 <topranks> apply transport-in policy to core router transport peerings to prefer local anycast routes [production]
12:01 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti4006.ulsfo.wmnet [production]
12:00 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti4006.ulsfo.wmnet [production]
11:54 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti4006.ulsfo.wmnet [production]
11:53 <godog> bounce neutron-l3-agent on cloudnet1005 [admin]
11:52 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti4006.ulsfo.wmnet [production]
11:51 <godog> bounce neutron-l3-agent on cloundnet1005 - T421054 [production]
11:37 <hashar> Reloaded Zuul to to add 3 persons to the allow list [releng]
11:20 <filippo@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) on deployment eqiad1 for all services [admin]
11:06 <filippo@cloudcumin1001> START - Cookbook wmcs.openstack.restart_openstack on deployment eqiad1 for all services [admin]
11:06 <btullis@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
11:05 <btullis@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. [production]
11:05 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
11:04 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
11:01 <filippo@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.rabbitmq.rebuild_rabbit_cluster (exit_code=0) on deployment eqiad1 [admin]
10:57 <filippo@cloudcumin1001> START - Cookbook wmcs.openstack.rabbitmq.rebuild_rabbit_cluster on deployment eqiad1 [admin]
10:43 <James_F> Docker: Re-pushing to try to create quibble-coverage 1.16.0-s2 [releng]
10:37 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host bast4006.wikimedia.org with OS bookworm [production]
10:36 <filippo@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) on deployment eqiad1 for all services [admin]
10:21 <filippo@cloudcumin1001> START - Cookbook wmcs.openstack.restart_openstack on deployment eqiad1 for all services [admin]
10:15 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on bast4006.wikimedia.org with reason: host reimage [production]
10:09 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on bast4006.wikimedia.org with reason: host reimage [production]
09:58 <filippo@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.rabbitmq.rebuild_rabbit_cluster (exit_code=99) on deployment eqiad1 [admin]
09:57 <filippo@cloudcumin1001> START - Cookbook wmcs.openstack.rabbitmq.rebuild_rabbit_cluster on deployment eqiad1 [admin]
09:53 <wmftkbot> Test Kitchen edge-unique experiments (poll 45906) - adds: synth-aa-test-traffic-impact-2, synth-aa-test-traffic-impact-1, synth-aa-test-traffic-impact-3; removes: none; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD [analytics]
09:46 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host bast4006.wikimedia.org with OS bookworm [production]
09:42 <filippo@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) on deployment eqiad1 for service: project,designate [admin]
09:42 <jmm@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host bast4006.wikimedia.org with OS trixie [production]
09:41 <filippo@cloudcumin1001> START - Cookbook wmcs.openstack.restart_openstack on deployment eqiad1 for service: project,designate [admin]
09:30 <filippo@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) on deployment eqiad1 for service: project,nova,neutron,designate [admin]