5651-5700 of 10000 results (94ms)
2023-08-25 §
08:04 <jmm@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host bast5004.wikimedia.org with OS bookworm [production]
08:03 <jayme@deploy1002> helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply [production]
08:03 <jayme@deploy1002> helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply [production]
07:58 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host puppetboard1002.eqiad.wmnet [production]
07:54 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host puppetboard1002.eqiad.wmnet [production]
07:53 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host puppetboard2002.codfw.wmnet [production]
07:49 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host puppetboard2002.codfw.wmnet [production]
07:30 <jelto@deploy1002> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
07:29 <jelto@deploy1002> helmfile [staging] START helmfile.d/services/miscweb: apply [production]
07:19 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host failoid1002.eqiad.wmnet [production]
07:15 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host failoid1002.eqiad.wmnet [production]
07:13 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host failoid2002.codfw.wmnet [production]
07:12 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host bast5004.wikimedia.org with OS bookworm [production]
07:11 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM bast5004.wikimedia.org - jmm@cumin2002" [production]
07:10 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM bast5004.wikimedia.org - jmm@cumin2002" [production]
07:10 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) bast5004.wikimedia.org on all recursors [production]
07:10 <jmm@cumin2002> START - Cookbook sre.dns.wipe-cache bast5004.wikimedia.org on all recursors [production]
07:10 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
07:10 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM bast5004.wikimedia.org - jmm@cumin2002" [production]
07:09 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host failoid2002.codfw.wmnet [production]
07:09 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM bast5004.wikimedia.org - jmm@cumin2002" [production]
07:07 <jayme@deploy1002> helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
07:07 <jayme@deploy1002> helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
07:06 <moritzm> installing cups security updates [production]
07:05 <jayme@deploy1002> helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
07:05 <jayme@deploy1002> helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
07:04 <jmm@cumin2002> START - Cookbook sre.dns.netbox [production]
07:04 <jmm@cumin2002> START - Cookbook sre.ganeti.makevm for new host bast5004.wikimedia.org [production]
06:58 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ldap-rw2001.wikimedia.org [production]
06:55 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ldap-rw2001.wikimedia.org [production]
06:53 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ldap-rw1001.wikimedia.org [production]
06:49 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ldap-rw1001.wikimedia.org [production]
05:47 <ladsgroup@cumin1001> dbctl commit (dc=all): 'db2140 (re)pooling @ 100%: Maint over', diff saved to https://phabricator.wikimedia.org/P51427 and previous config saved to /var/cache/conftool/dbconfig/20230825-054701-ladsgroup.json [production]
05:31 <ladsgroup@cumin1001> dbctl commit (dc=all): 'db2140 (re)pooling @ 75%: Maint over', diff saved to https://phabricator.wikimedia.org/P51426 and previous config saved to /var/cache/conftool/dbconfig/20230825-053156-ladsgroup.json [production]
05:28 <marostegui> failover m3-master to dbproxy1020 [production]
05:16 <ladsgroup@cumin1001> dbctl commit (dc=all): 'db2140 (re)pooling @ 25%: Maint over', diff saved to https://phabricator.wikimedia.org/P51425 and previous config saved to /var/cache/conftool/dbconfig/20230825-051651-ladsgroup.json [production]
05:01 <ladsgroup@cumin1001> dbctl commit (dc=all): 'db2140 (re)pooling @ 10%: Maint over', diff saved to https://phabricator.wikimedia.org/P51424 and previous config saved to /var/cache/conftool/dbconfig/20230825-050147-ladsgroup.json [production]
2023-08-24 §
23:10 <bblack> geodns: DE+GB mapped back to esams (were temporarily on drmrs) [production]
22:15 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kubernetes2025.codfw.wmnet with OS bullseye [production]
21:59 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host kubernetes2025.codfw.wmnet with OS bullseye [production]
21:43 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kubernetes2025.codfw.wmnet with OS bullseye [production]
21:43 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host kubernetes2025.codfw.wmnet with OS bullseye [production]
21:38 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kubernetes2025.codfw.wmnet with OS bullseye [production]
21:29 <bking@deploy1002> Finished deploy [wdqs/wdqs@16e3dcf]: allow list changes T343856 0.3.125 (duration: 00m 15s) [production]
21:29 <bking@deploy1002> Started deploy [wdqs/wdqs@16e3dcf]: allow list changes T343856 0.3.125 [production]
21:28 <bking@deploy1002> Finished deploy [wdqs/wdqs@16e3dcf]: allow list changes T343856 0.3.125 (duration: 08m 18s) [production]
21:25 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1219 (T344589)', diff saved to https://phabricator.wikimedia.org/P51422 and previous config saved to /var/cache/conftool/dbconfig/20230824-212554-ladsgroup.json [production]
21:23 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host kubernetes2025.codfw.wmnet with OS bullseye [production]
21:19 <bking@deploy1002> Started deploy [wdqs/wdqs@16e3dcf]: allow list changes T343856 0.3.125 [production]
21:18 <bking@deploy1002> Finished deploy [wdqs/wdqs@16e3dcf]: allow list changes T343856 0.3.125 (duration: 02m 17s) [production]