351-400 of 10000 results (114ms)
2025-04-29 §
10:33 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti4006.ulsfo.wmnet [production]
10:33 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti4006.ulsfo.wmnet [production]
10:31 <vgutierrez@cumin1002> START - Cookbook sre.hosts.reboot-single for host lvs4010.ulsfo.wmnet [production]
10:31 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) depooling P{lvs4010.ulsfo.wmnet} and A:liberica [production]
10:31 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin depooling P{lvs4010.ulsfo.wmnet} and A:liberica [production]
10:30 <tappof@cumin1002> START - Cookbook sre.hosts.reboot-single for host grafana1002.eqiad.wmnet [production]
10:27 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti4006.ulsfo.wmnet [production]
10:25 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti4006.ulsfo.wmnet [production]
10:18 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs5004.eqsin.wmnet [production]
10:14 <vgutierrez@cumin1002> START - Cookbook sre.hosts.reboot-single for host lvs5004.eqsin.wmnet [production]
09:59 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) depooling P{lvs5004.eqsin.wmnet} and A:liberica [production]
09:58 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin depooling P{lvs5004.eqsin.wmnet} and A:liberica [production]
09:50 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs5005.eqsin.wmnet [production]
09:50 <tappof@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host grafana2001.codfw.wmnet [production]
09:47 <vgutierrez@cumin1002> START - Cookbook sre.hosts.reboot-single for host lvs5005.eqsin.wmnet [production]
09:46 <tappof@cumin1002> START - Cookbook sre.hosts.reboot-single for host grafana2001.codfw.wmnet [production]
09:46 <tappof@cumin1002> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host grafana2001.codfw.wmnet [production]
09:46 <tappof@cumin1002> START - Cookbook sre.hosts.reboot-single for host grafana2001.codfw.wmnet [production]
09:45 <fabfur> uploading haproxykafka 0.3.7 to reprepro (T387454) [production]
09:44 <TheresNoTime> Ran fixStuckGlobalRename.php for T392873 — job (re)started OK [production]
09:44 <tappof@cumin1002> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host grafana2001.codfw.wmnet [production]
09:44 <tappof@cumin1002> START - Cookbook sre.hosts.reboot-single for host grafana2001.codfw.wmnet [production]
09:41 <vgutierrez> re-arming keyholder in acmechief and acmechief-test instances [production]
09:39 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) depooling P{lvs5005.eqsin.wmnet} and A:liberica [production]
09:37 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin depooling P{lvs5005.eqsin.wmnet} and A:liberica [production]
09:34 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs5006.eqsin.wmnet [production]
09:31 <vgutierrez@cumin1002> START - Cookbook sre.hosts.reboot-single for host lvs5006.eqsin.wmnet [production]
09:30 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) depooling P{lvs5006.eqsin.wmnet} and A:liberica [production]
09:29 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin depooling P{lvs5006.eqsin.wmnet} and A:liberica [production]
09:21 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs7001.magru.wmnet [production]
09:18 <vgutierrez@cumin1002> START - Cookbook sre.hosts.reboot-single for host lvs7001.magru.wmnet [production]
09:11 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) depooling P{lvs7001.magru.wmnet} and A:liberica [production]
09:10 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin depooling P{lvs7001.magru.wmnet} and A:liberica [production]
08:52 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs7002.magru.wmnet [production]
08:48 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2050.codfw.wmnet [production]
08:45 <vgutierrez@cumin1002> START - Cookbook sre.hosts.reboot-single for host lvs7002.magru.wmnet [production]
08:43 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2050.codfw.wmnet [production]
08:42 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2049.codfw.wmnet [production]
08:41 <marostegui@cumin1002> dbctl commit (dc=all): 'es2033 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P75601 and previous config saved to /var/cache/conftool/dbconfig/20250429-084116-root.json [production]
08:39 <godog> bounce prometheus-statsd-exporter on stat1011 - T389344 [production]
08:38 <marostegui@cumin1002> dbctl commit (dc=all): 'es1033 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P75600 and previous config saved to /var/cache/conftool/dbconfig/20250429-083855-root.json [production]
08:37 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) depooling P{lvs7002.magru.wmnet} and A:liberica [production]
08:36 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin depooling P{lvs7002.magru.wmnet} and A:liberica [production]
08:36 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2049.codfw.wmnet [production]
08:30 <marostegui@cumin1002> START - Cookbook sre.mysql.pool db1188 slowly with 10 steps - Pool db1188.eqiad.wmnet in after cloning [production]
08:28 <moritzm> installing wget security updates [production]
08:26 <marostegui@cumin1002> dbctl commit (dc=all): 'es2033 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P75598 and previous config saved to /var/cache/conftool/dbconfig/20250429-082611-root.json [production]
08:25 <fabfur> rolling restart haproxykafka on A:cp to apply new configuration https://gerrit.wikimedia.org/r/c/operations/puppet/+/1136679 (T382571) [production]
08:24 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 8 hosts with reason: Maintenance [production]
08:23 <marostegui@cumin1002> dbctl commit (dc=all): 'es1033 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P75597 and previous config saved to /var/cache/conftool/dbconfig/20250429-082349-root.json [production]