7951-8000 of 10000 results (75ms)
2022-01-13 ยง
11:27 <marostegui@cumin1001> dbctl commit (dc=all): 'es1022 (re)pooling @ 25%: repooling after reimage', diff saved to https://phabricator.wikimedia.org/P18723 and previous config saved to /var/cache/conftool/dbconfig/20220113-112749-root.json [production]
11:26 <btullis@cumin1001> START - Cookbook sre.ganeti.reboot-vm for VM kafka-test1010.eqiad.wmnet [production]
11:26 <_joe_> update scap everywhere T298986 [production]
11:25 <oblivian@deploy1002> Finished deploy [restbase/deploy@0848b15]: scap testing (duration: 00m 09s) [production]
11:25 <oblivian@deploy1002> Started deploy [restbase/deploy@0848b15]: scap testing [production]
11:24 <oblivian@deploy1002> Finished deploy [restbase/deploy@0848b15]: (no justification provided) (duration: 00m 09s) [production]
11:23 <oblivian@deploy1002> Started deploy [restbase/deploy@0848b15]: (no justification provided) [production]
11:20 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM testreduce1001.eqiad.wmnet [production]
11:18 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host es2022.codfw.wmnet with OS bullseye [production]
11:16 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM testreduce1001.eqiad.wmnet [production]
11:12 <marostegui@cumin1001> dbctl commit (dc=all): 'es1022 (re)pooling @ 20%: repooling after reimage', diff saved to https://phabricator.wikimedia.org/P18722 and previous config saved to /var/cache/conftool/dbconfig/20220113-111245-root.json [production]
11:11 <btullis@cumin1001> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM kafka-test1009.eqiad.wmnet [production]
11:11 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM netbox1001.wikimedia.org [production]
11:08 <btullis@cumin1001> START - Cookbook sre.ganeti.reboot-vm for VM kafka-test1009.eqiad.wmnet [production]
11:03 <moritzm> rebooting netbox1001 (running netbox.wikimedia.org) [production]
11:03 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM netbox1001.wikimedia.org [production]
11:03 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-main1001.eqiad.wmnet with OS buster [production]
11:02 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM netboxdb1001.eqiad.wmnet [production]
10:59 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM netboxdb1001.eqiad.wmnet [production]
10:58 <btullis@cumin1001> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM kafka-test1008.eqiad.wmnet [production]
10:57 <marostegui@cumin1001> dbctl commit (dc=all): 'es1022 (re)pooling @ 10%: repooling after reimage', diff saved to https://phabricator.wikimedia.org/P18721 and previous config saved to /var/cache/conftool/dbconfig/20220113-105741-root.json [production]
10:56 <btullis@cumin1001> START - Cookbook sre.ganeti.reboot-vm for VM kafka-test1008.eqiad.wmnet [production]
10:52 <hashar> Restarting Jenkins CI for plugins update T298691 [production]
10:47 <btullis@cumin1001> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM kafka-test1007.eqiad.wmnet [production]
10:46 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM search-loader1001.eqiad.wmnet [production]
10:45 <btullis@cumin1001> START - Cookbook sre.ganeti.reboot-vm for VM kafka-test1007.eqiad.wmnet [production]
10:43 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM search-loader1001.eqiad.wmnet [production]
10:42 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host es2022.codfw.wmnet with OS bullseye [production]
10:42 <marostegui@cumin1001> dbctl commit (dc=all): 'es1022 (re)pooling @ 5%: repooling after reimage', diff saved to https://phabricator.wikimedia.org/P18720 and previous config saved to /var/cache/conftool/dbconfig/20220113-104238-root.json [production]
10:31 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM irc1001.wikimedia.org [production]
10:29 <elukey@cumin1001> START - Cookbook sre.hosts.reimage for host kafka-main1001.eqiad.wmnet with OS buster [production]
10:29 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM irc1001.wikimedia.org [production]
10:27 <marostegui@cumin1001> dbctl commit (dc=all): 'es1022 (re)pooling @ 1%: repooling after reimage', diff saved to https://phabricator.wikimedia.org/P18719 and previous config saved to /var/cache/conftool/dbconfig/20220113-102734-root.json [production]
10:27 <moritzm> systemctl reset-failed ifup@ens5.service on lists1001 T273026 [production]
10:13 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM grafana1002.eqiad.wmnet [production]
10:10 <moritzm> rebooting grafana1002 (running grafana.wikimedia.org) [production]
10:10 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM grafana1002.eqiad.wmnet [production]
10:09 <marostegui@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host es1022.eqiad.wmnet with OS bullseye [production]
10:02 <mmandere> cp3052: upgrade varnish to 6.0.9-1wm1 T298758 [production]
10:02 <joal@deploy1002> Finished deploy [analytics/refinery@94ec386]: Hotfix analytics deploy [analytics/refinery@94ec386] (duration: 21m 47s) [production]
10:02 <elukey> run kafka preferred-replica-election on kafka-main1001 to force a rebalance of partition leaders (after kafka-main1002's reimage) [production]
10:00 <btullis@cumin1001> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM kafka-test1006.eqiad.wmnet [production]
09:59 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-main1002.eqiad.wmnet with OS buster [production]
09:56 <btullis@cumin1001> START - Cookbook sre.ganeti.reboot-vm for VM kafka-test1006.eqiad.wmnet [production]
09:49 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host es1022.eqiad.wmnet with OS bullseye [production]
09:46 <marostegui@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host es1022.eqiad.wmnet with OS bullseye [production]
09:42 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host es1022.eqiad.wmnet with OS bullseye [production]
09:40 <joal@deploy1002> Started deploy [analytics/refinery@94ec386]: Hotfix analytics deploy [analytics/refinery@94ec386] [production]
09:40 <joal@deploy1002> Finished deploy [analytics/refinery@94ec386] (thin): Hotfix analytics deploy THIN [analytics/refinery@94ec386] (duration: 00m 07s) [production]
09:40 <joal@deploy1002> Started deploy [analytics/refinery@94ec386] (thin): Hotfix analytics deploy THIN [analytics/refinery@94ec386] [production]