5401-5450 of 10000 results (86ms)
2022-12-29 §
23:26 <ryankemper@cumin2002> END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) [production]
23:25 <ryankemper@cumin1001> END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) [production]
23:24 <ryankemper@cumin2002> START - Cookbook sre.wdqs.data-reload [production]
23:22 <ryankemper@cumin1001> START - Cookbook sre.wdqs.data-reload [production]
09:19 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on an-worker1084.eqiad.wmnet with reason: Avoid IRC spam [production]
09:19 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 14 days, 0:00:00 on an-worker1084.eqiad.wmnet with reason: Avoid IRC spam [production]
2022-12-22 §
18:27 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-jumbo1015.eqiad.wmnet with OS bullseye [production]
18:27 <btullis@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1001" [production]
18:16 <btullis@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1001" [production]
18:03 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-jumbo1015.eqiad.wmnet with reason: host reimage [production]
18:00 <btullis@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-jumbo1015.eqiad.wmnet with reason: host reimage [production]
17:25 <robh@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
17:25 <robh@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: atlas ulsfo decom - robh@cumin2002" [production]
17:24 <robh@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: atlas ulsfo decom - robh@cumin2002" [production]
17:22 <robh@cumin2002> START - Cookbook sre.dns.netbox [production]
16:56 <btullis@cumin1001> START - Cookbook sre.hosts.reimage for host cephosd1001.eqiad.wmnet with OS bullseye [production]
16:51 <btullis@cumin1001> START - Cookbook sre.hosts.reimage for host kafka-jumbo1015.eqiad.wmnet with OS bullseye [production]
16:51 <elukey@deploy1002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
16:50 <elukey@deploy1002> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
16:23 <elukey@cumin1001> END (PASS) - Cookbook sre.discovery.service-route (exit_code=0) pool inference in codfw: maintenance [production]
16:18 <elukey@cumin1001> START - Cookbook sre.discovery.service-route pool inference in codfw: maintenance [production]
16:17 <elukey@cumin1001> END (FAIL) - Cookbook sre.discovery.service-route (exit_code=99) depool inference in eqiad: maintenance [production]
16:17 <elukey@cumin1001> START - Cookbook sre.discovery.service-route depool inference in eqiad: maintenance [production]
16:15 <elukey@cumin1001> END (PASS) - Cookbook sre.discovery.service-route (exit_code=0) depool inference in codfw: maintenance [production]
16:10 <elukey@cumin1001> START - Cookbook sre.discovery.service-route depool inference in codfw: maintenance [production]
16:09 <elukey@cumin1001> END (PASS) - Cookbook sre.discovery.service-route (exit_code=0) check inference: maintenance [production]
16:09 <elukey@cumin1001> START - Cookbook sre.discovery.service-route check inference: maintenance [production]
16:00 <aikochou@deploy1002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
15:40 <btullis@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kafka-jumbo1015.eqiad.wmnet with OS bullseye [production]
14:47 <akosiaris> truncate daemon.log.1 on maps1009 to free up disk space [production]
14:43 <btullis@cumin1001> START - Cookbook sre.hosts.reimage for host kafka-jumbo1015.eqiad.wmnet with OS bullseye [production]
14:42 <btullis@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kafka-jumbo1015.eqiad.wmnet with OS bullseye [production]
13:46 <btullis@cumin1001> START - Cookbook sre.hosts.reimage for host kafka-jumbo1015.eqiad.wmnet with OS bullseye [production]
13:18 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-jumbo1014.eqiad.wmnet with OS bullseye [production]
13:18 <btullis@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1001" [production]
12:49 <btullis@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1001" [production]
12:37 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-jumbo1014.eqiad.wmnet with reason: host reimage [production]
12:34 <btullis@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-jumbo1014.eqiad.wmnet with reason: host reimage [production]
11:32 <btullis@cumin1001> START - Cookbook sre.hosts.reimage for host kafka-jumbo1014.eqiad.wmnet with OS bullseye [production]
11:30 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-jumbo1013.eqiad.wmnet with OS bullseye [production]
11:30 <btullis@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1001" [production]
11:23 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 17806 [production]
11:14 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'configure' for AS: 17806 [production]
11:09 <btullis@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1001" [production]
10:57 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-jumbo1013.eqiad.wmnet with reason: host reimage [production]
10:54 <btullis@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-jumbo1013.eqiad.wmnet with reason: host reimage [production]
09:38 <btullis@cumin1001> START - Cookbook sre.hosts.reimage for host kafka-jumbo1013.eqiad.wmnet with OS bullseye [production]
08:11 <moritzm> installing libksba security updates [production]
07:43 <vgutierrez> restarting varnish on cp4052 to clear VarnishChildRestarted alert - T325797 [production]
07:22 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 56286 [production]