551-600 of 10000 results (70ms)
2022-12-05 ยง
17:21 <cwhite@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logstash1035.eqiad.wmnet with OS bullseye [production]
17:02 <cwhite@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logstash1033.eqiad.wmnet with reason: host reimage [production]
16:59 <cwhite@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on logstash1033.eqiad.wmnet with reason: host reimage [production]
16:59 <cwhite@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logstash1034.eqiad.wmnet with reason: host reimage [production]
16:57 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts cp5015.eqsin.wmnet [production]
16:57 <sukhe@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:57 <sukhe@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cp5015.eqsin.wmnet decommissioned, removing all IPs except the asset tag one - sukhe@cumin2002" [production]
16:56 <sukhe@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cp5015.eqsin.wmnet decommissioned, removing all IPs except the asset tag one - sukhe@cumin2002" [production]
16:56 <cwhite@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logstash1035.eqiad.wmnet with reason: host reimage [production]
16:56 <cwhite@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on logstash1034.eqiad.wmnet with reason: host reimage [production]
16:53 <cwhite@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on logstash1035.eqiad.wmnet with reason: host reimage [production]
16:53 <sukhe@cumin2002> START - Cookbook sre.dns.netbox [production]
16:49 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2127.codfw.wmnet with reason: Maintenance [production]
16:49 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on db2127.codfw.wmnet with reason: Maintenance [production]
16:48 <sukhe@cumin2002> START - Cookbook sre.hosts.decommission for hosts cp5015.eqsin.wmnet [production]
16:44 <cwhite@cumin2002> START - Cookbook sre.hosts.reimage for host logstash1033.eqiad.wmnet with OS bullseye [production]
16:43 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on cp5015.eqsin.wmnet with reason: downtimed, to be depooled [production]
16:43 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on cp5015.eqsin.wmnet with reason: downtimed, to be depooled [production]
16:41 <cwhite@cumin2002> START - Cookbook sre.hosts.reimage for host logstash1034.eqiad.wmnet with OS bullseye [production]
16:40 <sukhe@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp5015.eqsin.wmnet,service=varnish-fe [production]
16:40 <sukhe@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp5015.eqsin.wmnet,service=ats-be [production]
16:40 <sukhe@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp5015.eqsin.wmnet,service=ats-tls [production]
16:40 <cwhite@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logstash1010.eqiad.wmnet with OS bullseye [production]
16:38 <cwhite@cumin2002> START - Cookbook sre.hosts.reimage for host logstash1035.eqiad.wmnet with OS bullseye [production]
16:38 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp5027.eqsin.wmnet,service=varnish-fe [production]
16:38 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp5027.eqsin.wmnet,service=ats-tls [production]
16:38 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp5027.eqsin.wmnet,service=ats-be [production]
16:38 <sukhe@puppetmaster1001> conftool action : set/weight=1; selector: name=cp5027.eqsin.wmnet,service=varnish-fe [production]
16:38 <sukhe@puppetmaster1001> conftool action : set/weight=1; selector: name=cp5027.eqsin.wmnet,service=ats-tls [production]
16:38 <sukhe@puppetmaster1001> conftool action : set/weight=100; selector: name=cp5027.eqsin.wmnet,service=ats-be [production]
16:38 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp5023.eqsin.wmnet,service=varnish-fe [production]
16:38 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp5023.eqsin.wmnet,service=ats-tls [production]
16:38 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp5023.eqsin.wmnet,service=ats-be [production]
16:38 <sukhe@puppetmaster1001> conftool action : set/weight=1; selector: name=cp5023.eqsin.wmnet,service=varnish-fe [production]
16:38 <sukhe@puppetmaster1001> conftool action : set/weight=1; selector: name=cp5023.eqsin.wmnet,service=ats-tls [production]
16:38 <sukhe@puppetmaster1001> conftool action : set/weight=100; selector: name=cp5023.eqsin.wmnet,service=ats-be [production]
16:27 <klausman> restarted kube-apiserver on ml-staging-ctrl2001 to adress high latency [production]
16:14 <cwhite@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logstash1010.eqiad.wmnet with reason: host reimage [production]
16:11 <cwhite@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on logstash1010.eqiad.wmnet with reason: host reimage [production]
16:06 <klausman> restarted kube-apiserver on ml-serve-ctrl1001 to adress high latency and large number of 504s [production]
16:06 <moritzm> installing glibc security updates on buster [production]
15:46 <cwhite@cumin2002> START - Cookbook sre.hosts.reimage for host logstash1010.eqiad.wmnet with OS bullseye [production]
15:45 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts cp[5012,5014].eqsin.wmnet [production]
15:45 <sukhe@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
15:45 <sukhe@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cp[5012,5014].eqsin.wmnet decommissioned, removing all IPs except the asset tag one - sukhe@cumin2002" [production]
15:44 <sukhe@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cp[5012,5014].eqsin.wmnet decommissioned, removing all IPs except the asset tag one - sukhe@cumin2002" [production]
15:41 <sukhe@cumin2002> START - Cookbook sre.dns.netbox [production]
15:36 <moritzm> installing apache2 security updates on buster [production]
15:35 <sukhe@cumin2002> START - Cookbook sre.hosts.decommission for hosts cp[5012,5014].eqsin.wmnet [production]
15:30 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on cp[5012,5014].eqsin.wmnet with reason: downtimed, to be depooled [production]