1-50 of 10000 results (83ms)
2025-05-02 §
21:38 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ganeti1054.eqiad.wmnet with OS bookworm [production]
21:23 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ganeti1053.eqiad.wmnet with OS bookworm [production]
20:34 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host ganeti1053.eqiad.wmnet with OS bookworm [production]
20:31 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ganeti1053.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
20:29 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host ganeti1053.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
20:27 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ganeti1053.eqiad.wmnet with OS bookworm [production]
20:23 <tzatziki> removed 3 files for legal compliance [production]
20:18 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host ganeti1054.eqiad.wmnet with OS bookworm [production]
20:16 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ganeti1054.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
20:15 <tzatziki> removed 1 file for legal compliance [production]
20:11 <tzatziki> removed 1 file for legal compliance [production]
20:09 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host ganeti1054.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
20:09 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ganeti1054.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
19:57 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host ganeti1054.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
19:41 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host ganeti1053.eqiad.wmnet with OS bookworm [production]
19:38 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ganeti1053.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
19:36 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host ganeti1053.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
17:35 <stevemunene@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1168.eqiad.wmnet [production]
17:27 <stevemunene@cumin1002> START - Cookbook sre.hosts.reboot-single for host an-worker1168.eqiad.wmnet [production]
17:26 <stevemunene@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1167.eqiad.wmnet [production]
17:19 <stevemunene@cumin1002> START - Cookbook sre.hosts.reboot-single for host an-worker1167.eqiad.wmnet [production]
17:17 <stevemunene@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1166.eqiad.wmnet [production]
17:09 <stevemunene@cumin1002> START - Cookbook sre.hosts.reboot-single for host an-worker1166.eqiad.wmnet [production]
16:53 <sukhe@dns1004> END - running authdns-update [production]
16:51 <sukhe@dns1004> START - running authdns-update [production]
16:47 <pt1979@cumin2002> END (PASS) - Cookbook sre.network.provision (exit_code=0) for device lsw1-f1-codfw.mgmt.codfw.wmnet [production]
16:28 <mvernon@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 18:00:00 on ms-fe1016.eqiad.wmnet with reason: not yet in prod [production]
16:28 <mvernon@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 18:00:00 on ms-fe1015.eqiad.wmnet with reason: not yet in prod [production]
16:26 <pt1979@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:24 <pt1979@cumin2002> START - Cookbook sre.dns.netbox [production]
16:24 <pt1979@cumin2002> START - Cookbook sre.network.provision for device lsw1-f1-codfw.mgmt.codfw.wmnet [production]
15:45 <stevemunene@cumin1002> START - Cookbook sre.hosts.reboot-single for host an-worker1166.eqiad.wmnet [production]
15:11 <herron> power cycling prometheus200[78] via rac [production]
15:06 <stevemunene@cumin1002> END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1168.eqiad.wmnet [production]
15:05 <jgleeson> SmashPig changed from 9b3c4587 to ddf64519 [production]
15:04 <stevemunene@cumin1002> START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1168.eqiad.wmnet [production]
15:03 <stevemunene@cumin1002> END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1167.eqiad.wmnet [production]
15:01 <stevemunene@cumin1002> START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1167.eqiad.wmnet [production]
15:01 <bking@cumin2002> conftool action : set/pooled=yes:weight=10; selector: name=cirrussearch2076.codfw.wmnet|cirrussearch2080.codfw.wmnet|cirrussearch2081.codfw.wmnet|cirrussearch2083.codfw.wmnet|cirrussearch2084.codfw.wmnet|cirrussearch2092.codfw.wmnet|cirrussearch2093.codfw.wmnet|cirrussearch2100.codfw.wmnet|cirrussearch2106.codfw.wmnet|cirrussearch2108.codfw.wmnet [production]
15:01 <stevemunene@cumin1002> END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1166.eqiad.wmnet [production]
14:55 <stevemunene@cumin1002> START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1166.eqiad.wmnet [production]
14:48 <dancy@deploy1003> Installation of scap version "4.159.0" completed for 2 hosts [production]
14:46 <dancy@deploy1003> Installing scap version "4.159.0" for 2 host(s) [production]
14:10 <inflatador> bking@localhost set search_codfw num_concurrent_incoming_recoveries from 20 back down to 4 after migration T391350 [production]
13:49 <moritzm> imported ruby-defaults 1:3.3~wmf13u1 to component/puppet7 for trixie-wikimedia T392790 [production]
13:40 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host testvm2008.wikimedia.org [production]
13:37 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host testvm2008.wikimedia.org [production]
13:25 <urandom> invoked manual `garbagecollect`, Cassandra sessionstore — T390514 [production]
13:21 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host testvm2007.codfw.wmnet [production]
13:17 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host testvm2007.codfw.wmnet [production]