401-450 of 10000 results (20ms)
2024-11-15 §
09:28 <elukey@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:28 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:28 <elukey@cumin2002> END (ERROR) - Cookbook sre.hosts.provision (exit_code=97) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:27 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:23 <elukey@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:23 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:22 <elukey@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:21 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
2024-11-14 §
17:13 <elukey@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
17:13 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
15:24 <elukey@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
15:24 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
15:07 <elukey@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
15:02 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
2024-11-13 §
16:30 <elukey> reload nginx on registry* to pick up logging changes (log of X-Client-IP from the CDN) [production]
10:09 <elukey> disallow calls to /v2/_catalog from the outside internet on Docker Registry hosts - T378618 [production]
10:01 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be2088.codfw.wmnet with OS bullseye [production]
10:00 <elukey@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - elukey@cumin1002" [production]
10:00 <elukey@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - elukey@cumin1002" [production]
09:41 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2088.codfw.wmnet with reason: host reimage [production]
09:38 <elukey@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2088.codfw.wmnet with reason: host reimage [production]
09:25 <elukey@cumin1002> START - Cookbook sre.hosts.reimage for host ms-be2088.codfw.wmnet with OS bullseye [production]
09:11 <elukey@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host ms-be2088.codfw.wmnet with OS bullseye [production]
09:01 <elukey@cumin1002> START - Cookbook sre.hosts.reimage for host ms-be2088.codfw.wmnet with OS bullseye [production]
08:49 <elukey@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host ms-be2088.codfw.wmnet with OS bullseye [production]
08:32 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2088.codfw.wmnet with reason: host reimage [production]
08:27 <elukey@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2088.codfw.wmnet with reason: host reimage [production]
08:14 <elukey@cumin1002> START - Cookbook sre.hosts.reimage for host ms-be2088.codfw.wmnet with OS bullseye [production]
2024-11-12 §
11:52 <elukey@deploy2002> helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
09:17 <elukey@deploy2002> helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
2024-11-11 §
16:19 <elukey> restart pybal on lvs2013 (primary) to pick up new kartotherian-k8s-ssl service [production]
16:17 <elukey> restart pybal on lvs2014 (secondary) to pick up new kartotherian-k8s-ssl service [production]
16:10 <elukey> restart pybal on lvs1019 (primary) to pick up new kartotherian-k8s-ssl service [production]
16:09 <elukey> restart pybal on lvs1020 (secondary) to pick up new kartotherian-k8s-ssl service [production]
15:55 <elukey@puppetserver1001> conftool action : set/pooled=yes:weight=10; selector: dc=codfw,cluster=maps,service=kartotherian-k8s-ssl [production]
15:55 <elukey@puppetserver1001> conftool action : set/pooled=yes:weight=10; selector: dc=eqiad,cluster=maps,service=kartotherian-k8s-ssl [production]
15:54 <elukey@puppetserver1001> conftool action : set/pooled=yes:weight=1; selector: cluster=codfw,service=kartotherian-k8s-ssl [production]
14:35 <elukey@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host ms-be2088.codfw.wmnet with OS bullseye [production]
14:27 <elukey@cumin1002> START - Cookbook sre.hosts.reimage for host ms-be2088.codfw.wmnet with OS bullseye [production]
14:20 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ms-be2088.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
14:07 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host ms-be2088.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
12:23 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be2083.codfw.wmnet with OS bullseye [production]
12:01 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2083.codfw.wmnet with reason: host reimage [production]
11:56 <elukey@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2083.codfw.wmnet with reason: host reimage [production]
11:44 <elukey@cumin2002> START - Cookbook sre.hosts.reimage for host ms-be2083.codfw.wmnet with OS bullseye [production]
11:43 <elukey@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ms-be2083.codfw.wmnet with OS bullseye [production]
11:43 <elukey@cumin2002> START - Cookbook sre.hosts.reimage for host ms-be2083.codfw.wmnet with OS bullseye [production]
11:30 <elukey@deploy2002> helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
11:06 <elukey@deploy2002> helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]
10:55 <elukey@deploy2002> helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . [production]