201-250 of 10000 results (25ms)
2025-04-17 ยง
20:45 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1249.eqiad.wmnet with reason: Maintenance [production]
20:45 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1248 (T391056)', diff saved to https://phabricator.wikimedia.org/P75233 and previous config saved to /var/cache/conftool/dbconfig/20250417-204528-fceratto.json [production]
20:38 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cirrussearch2058.codfw.wmnet with reason: host reimage [production]
20:37 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host an-worker1183.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
20:34 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cirrussearch2058.codfw.wmnet with reason: host reimage [production]
20:30 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1248', diff saved to https://phabricator.wikimedia.org/P75232 and previous config saved to /var/cache/conftool/dbconfig/20250417-203021-fceratto.json [production]
20:25 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host an-worker1181.eqiad.wmnet with OS bullseye [production]
20:25 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on an-worker1178.eqiad.wmnet with reason: host reimage [production]
20:25 <vriley@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1178.eqiad.wmnet with reason: host reimage [production]
20:18 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host cirrussearch2058 [production]
20:18 <bking@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cirrussearch2058 [production]
20:18 <bking@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host cirrussearch2058 [production]
20:18 <bking@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cirrussearch2058.codfw.wmnet 205.16.192.10.in-addr.arpa 5.0.2.0.6.1.0.0.2.9.1.0.0.1.0.0.2.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
20:18 <bking@cumin2002> START - Cookbook sre.dns.wipe-cache cirrussearch2058.codfw.wmnet 205.16.192.10.in-addr.arpa 5.0.2.0.6.1.0.0.2.9.1.0.0.1.0.0.2.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
20:18 <bking@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
20:18 <bking@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host cirrussearch2058 - bking@cumin2002" [production]
20:18 <bking@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host cirrussearch2058 - bking@cumin2002" [production]
20:15 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1248', diff saved to https://phabricator.wikimedia.org/P75231 and previous config saved to /var/cache/conftool/dbconfig/20250417-201515-fceratto.json [production]
20:13 <bking@cumin2002> START - Cookbook sre.dns.netbox [production]
20:10 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1178.eqiad.wmnet with OS bullseye [production]
20:09 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host an-worker1178.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
20:09 <bking@cumin2002> START - Cookbook sre.hosts.move-vlan for host cirrussearch2058 [production]
20:08 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host cirrussearch2058.codfw.wmnet with OS bullseye [production]
20:07 <bking@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cirrussearch2058.codfw.wmnet on all recursors [production]
20:07 <bking@cumin2002> START - Cookbook sre.dns.wipe-cache cirrussearch2058.codfw.wmnet on all recursors [production]
20:07 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.rename (exit_code=0) from elastic2058 to cirrussearch2058 [production]
20:06 <bking@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cirrussearch2058 [production]
20:06 <bking@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host cirrussearch2058 [production]
20:06 <bking@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
20:06 <bking@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming elastic2058 to cirrussearch2058 - bking@cumin2002" [production]
20:05 <bking@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming elastic2058 to cirrussearch2058 - bking@cumin2002" [production]
20:02 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1181.eqiad.wmnet with reason: host reimage [production]
20:00 <bking@cumin2002> START - Cookbook sre.dns.netbox [production]
20:00 <bking@cumin2002> START - Cookbook sre.hosts.rename from elastic2058 to cirrussearch2058 [production]
20:00 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1248 (T391056)', diff saved to https://phabricator.wikimedia.org/P75230 and previous config saved to /var/cache/conftool/dbconfig/20250417-200008-fceratto.json [production]
19:59 <bking@cumin2002> START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row B - bking@cumin2002 - T388610 [production]
19:59 <bking@cumin2002> END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row B - bking@cumin2002 - T388610 [production]
19:58 <bking@cumin2002> START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (3 nodes at a time) for ElasticSearch cluster search_codfw: reimage row B - bking@cumin2002 - T388610 [production]
19:58 <vriley@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1181.eqiad.wmnet with reason: host reimage [production]
19:55 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1248 (T391056)', diff saved to https://phabricator.wikimedia.org/P75229 and previous config saved to /var/cache/conftool/dbconfig/20250417-195506-fceratto.json [production]
19:54 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1248.eqiad.wmnet with reason: Maintenance [production]
19:54 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1247 (T391056)', diff saved to https://phabricator.wikimedia.org/P75228 and previous config saved to /var/cache/conftool/dbconfig/20250417-195442-fceratto.json [production]
19:50 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host an-worker1178.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
19:50 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1178.eqiad.wmnet with OS bullseye [production]
19:44 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1181.eqiad.wmnet with OS bullseye [production]
19:43 <vriley@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host an-worker1181.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
19:42 <vriley@cumin1002> START - Cookbook sre.hosts.provision for host an-worker1181.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
19:39 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1247', diff saved to https://phabricator.wikimedia.org/P75226 and previous config saved to /var/cache/conftool/dbconfig/20250417-193935-fceratto.json [production]
19:36 <vriley@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1178.eqiad.wmnet with OS bullseye [production]
19:35 <vriley@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1178.eqiad.wmnet with OS bullseye [production]