5501-5550 of 10000 results (77ms)
2023-03-23 §
10:21 <jmm@cumin2002> START - Cookbook sre.dns.wipe-cache irc2002.wikimedia.org on all recursors [production]
10:21 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
10:21 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM irc2002.wikimedia.org - jmm@cumin2002" [production]
10:18 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-main2005.codfw.wmnet with reason: host reimage [production]
10:15 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-main2005.codfw.wmnet with reason: host reimage [production]
10:10 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM irc2002.wikimedia.org - jmm@cumin2002" [production]
10:08 <jmm@cumin2002> START - Cookbook sre.dns.netbox [production]
10:08 <jmm@cumin2002> START - Cookbook sre.ganeti.makevm for new host irc2002.wikimedia.org [production]
10:01 <elukey@cumin1001> START - Cookbook sre.hosts.reimage for host kafka-main2005.codfw.wmnet with OS bullseye [production]
09:57 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kafka-main2005.codfw.wmnet with reason: stop kafka and reimage [production]
09:57 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on kafka-main2005.codfw.wmnet with reason: stop kafka and reimage [production]
09:47 <moritzm> uploaded prometheus-druid-exporter 0.8-2 for bullseye-wikimedia T332584 T332589 [production]
08:20 <elukey> clean up docker and reboot kubernetes2024 to enable overlay2 - T332803 [production]
08:11 <vgutierrez> testing HAProxy 2.6.11 in cp4044 - T332796 [production]
08:08 <vgutierrez> fetch haproxy 2.6.11 in apt.wm.o thirdparty/haproxy26 for bullseye & buster [production]
08:04 <vgutierrez> rolling rollback to HAProxy 2.6.9 in cache text cluster - T332796 [production]
07:54 <elukey> clean up docker and reboot kubernetes2023 to enable overlay2 - T332803 [production]
07:50 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kubernetes2023.codfw.wmnet with reason: Restart docker with overlay [production]
07:49 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on kubernetes2023.codfw.wmnet with reason: Restart docker with overlay [production]
07:49 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kubernetes2024.codfw.wmnet with reason: Restart docker with overlay [production]
07:49 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on kubernetes2024.codfw.wmnet with reason: Restart docker with overlay [production]
07:42 <elukey> clean up docker on kubernetes1024 (cordon + stop kubelet + docker + clean /var/lib/docker/*) and reboot to enable overlay2 - T332803 [production]
07:38 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kubernetes1024.eqiad.wmnet with reason: Restart docker with overlay [production]
07:37 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on kubernetes1024.eqiad.wmnet with reason: Restart docker with overlay [production]
07:23 <marostegui@cumin1001> dbctl commit (dc=all): 'es2029 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P45928 and previous config saved to /var/cache/conftool/dbconfig/20230323-072315-root.json [production]
07:08 <marostegui@cumin1001> dbctl commit (dc=all): 'es2029 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P45927 and previous config saved to /var/cache/conftool/dbconfig/20230323-070811-root.json [production]
06:53 <marostegui@cumin1001> dbctl commit (dc=all): 'es2029 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P45926 and previous config saved to /var/cache/conftool/dbconfig/20230323-065306-root.json [production]
06:38 <marostegui@cumin1001> dbctl commit (dc=all): 'es2029 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P45925 and previous config saved to /var/cache/conftool/dbconfig/20230323-063800-root.json [production]
06:22 <marostegui@cumin1001> dbctl commit (dc=all): 'es2029 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P45924 and previous config saved to /var/cache/conftool/dbconfig/20230323-062255-root.json [production]
06:07 <marostegui@cumin1001> dbctl commit (dc=all): 'es2029 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P45923 and previous config saved to /var/cache/conftool/dbconfig/20230323-060750-root.json [production]
05:37 <denisse@cumin1001> END (FAIL) - Cookbook sre.ganeti.reimage (exit_code=99) for host doc2002.codfw.wmnet with OS bullseye [production]
05:34 <stevemunene@cumin1001> END (FAIL) - Cookbook sre.ganeti.reimage (exit_code=99) for host an-test-client1002.eqiad.wmnet with OS bullseye [production]
04:25 <denisse@cumin1001> START - Cookbook sre.ganeti.reimage for host doc2002.codfw.wmnet with OS bullseye [production]
02:07 <denisse@cumin1001> END (FAIL) - Cookbook sre.ganeti.reimage (exit_code=99) for host doc2002.codfw.wmnet with OS bullseye [production]
02:00 <mutante> rsyncing ~4GB files for static-codereview.wikimedia.org from old to newer VMs for T331896 - no automatic sync / deploy for these [production]
01:05 <denisse@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "doc1003 - denisse@cumin1001 - T332812" [production]
01:03 <denisse@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "doc1003 - denisse@cumin1001 - T332812" [production]
00:57 <denisse@cumin1001> START - Cookbook sre.ganeti.reimage for host doc2002.codfw.wmnet with OS bullseye [production]
00:57 <denisse@cumin1001> END (ERROR) - Cookbook sre.ganeti.reimage (exit_code=97) for host doc2002.codfw.wmnet with OS bullseye [production]
00:57 <denisse@cumin1001> START - Cookbook sre.ganeti.reimage for host doc2002.codfw.wmnet with OS bullseye [production]
00:27 <denisse@cumin1001> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host doc2002.codfw.wmnet [production]
00:10 <denisse@cumin1001> END (PASS) - Cookbook sre.ganeti.reimage (exit_code=0) for host doc1003.eqiad.wmnet with OS bullseye [production]
2023-03-22 §
23:59 <denisse@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on doc1003.eqiad.wmnet with reason: host reimage [production]
23:56 <denisse@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on doc1003.eqiad.wmnet with reason: host reimage [production]
23:46 <denisse@cumin1001> START - Cookbook sre.ganeti.reimage for host doc1003.eqiad.wmnet with OS bullseye [production]
23:34 <denisse@cumin1001> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) doc2002.codfw.wmnet on all recursors [production]
23:34 <denisse@cumin1001> START - Cookbook sre.dns.wipe-cache doc2002.codfw.wmnet on all recursors [production]
23:34 <denisse@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
23:33 <denisse@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM doc2002.codfw.wmnet - denisse@cumin1001" [production]
23:32 <denisse@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM doc2002.codfw.wmnet - denisse@cumin1001" [production]