501-550 of 10000 results (27ms)
2024-12-19 §
02:28 <krinkle@deploy2002> Started deploy [statsv/statsv@2ee86ea]: Add dogstatsd support [production]
01:48 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host es1043.eqiad.wmnet with OS bookworm [production]
01:05 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host es1043.eqiad.wmnet with OS bookworm [production]
2024-12-18 §
23:41 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host es1043.eqiad.wmnet with OS bookworm [production]
22:58 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host es1043.eqiad.wmnet with OS bookworm [production]
21:31 <mfossati@deploy2002> Finished deploy [airflow-dags/platform_eng@a43cacf]: bump image suggestions, section topics, and SEAL (duration: 01m 43s) [production]
21:30 <mfossati@deploy2002> Started deploy [airflow-dags/platform_eng@a43cacf]: bump image suggestions, section topics, and SEAL [production]
20:44 <cjming@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic: apply [production]
20:44 <cjming@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic: apply [production]
20:36 <cjming@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply [production]
20:36 <cjming@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply [production]
20:29 <otto@deploy2002> helmfile [codfw] DONE helmfile.d/services/eventgate-analytics-external: sync [production]
20:28 <otto@deploy2002> helmfile [codfw] START helmfile.d/services/eventgate-analytics-external: sync [production]
20:28 <otto@deploy2002> helmfile [eqiad] DONE helmfile.d/services/eventgate-analytics-external: sync [production]
20:27 <otto@deploy2002> helmfile [eqiad] START helmfile.d/services/eventgate-analytics-external: sync [production]
20:27 <otto@deploy2002> helmfile [staging] DONE helmfile.d/services/eventgate-analytics-external: sync [production]
20:27 <otto@deploy2002> helmfile [staging] START helmfile.d/services/eventgate-analytics-external: sync [production]
20:26 <ottomata> restarting eventgate-analytics-external to clear schema cache - T382113 | https://phabricator.wikimedia.org/T382113#10414005 [production]
19:28 <dancy@deploy2002> rebuilt and synchronized wikiversions files: group1 to 1.44.0-wmf.8 refs T375667 [production]
18:55 <btullis@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1002" [production]
18:41 <wmbot~anticomposite@tools-bastion-13> kubectl rollout restart deployment flr # bot not processing files [tools.yifeibot]
18:40 <btullis@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1069.eqiad.wmnet with reason: host reimage [production]
18:37 <btullis@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1069.eqiad.wmnet with reason: host reimage [production]
18:25 <btullis@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1069.eqiad.wmnet with OS bullseye [production]
18:25 <btullis@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-worker1069.eqiad.wmnet with OS bullseye [production]
18:23 <btullis@cumin1002> END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1068.eqiad.wmnet [production]
18:21 <btullis@cumin1002> START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1068.eqiad.wmnet [production]
18:20 <btullis@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1069.eqiad.wmnet with OS bullseye [production]
18:18 <btullis@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host an-worker1068.eqiad.wmnet with OS bullseye [production]
18:18 <btullis@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1002" [production]
18:18 <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching sessionstore1*.eqiad.wmnet: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002 [production]
18:16 <btullis@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1002" [production]
18:16 <btullis@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1067.eqiad.wmnet with OS bullseye [production]
18:15 <btullis@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-worker1067.eqiad.wmnet with OS bullseye [production]
18:13 <btullis@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-worker1069.eqiad.wmnet with OS bullseye [production]
18:09 <btullis@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1067.eqiad.wmnet with OS bullseye [production]
18:06 <lucaswerkmeister> add samtar and remove toolforge-standards-committee per T380537 [tools.bullseye]
18:05 <btullis@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-worker1067.eqiad.wmnet with OS bullseye [production]
18:04 <lucaswerkmeister> sudo rm __pycache__/settings*.pyc # T380537 [tools.bullseye]
18:03 <lucaswerkmeister> sed -i -E "/(SPUR|SHODAN)_KEY/ s/'[^']*'/'expunged (T380537)'/" settings.py [tools.bullseye]
18:01 <btullis@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1068.eqiad.wmnet with reason: host reimage [production]
18:01 <btullis@cumin1002> START - Cookbook sre.hosts.reimage for host an-worker1069.eqiad.wmnet with OS bullseye [production]
18:00 <lucaswerkmeister> sudo install -m600 settings.py{,-before-T380537} [tools.bullseye]
18:00 <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching sessionstore1*.eqiad.wmnet: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002 [production]
17:59 <btullis@cumin1002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host an-worker1069 [production]
17:58 <btullis@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1068.eqiad.wmnet with reason: host reimage [production]
17:58 <btullis@cumin1002> START - Cookbook sre.network.configure-switch-interfaces for host an-worker1069 [production]
17:57 <btullis@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
17:57 <btullis@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Re-commissioning an-presto1005 as an-worker1069 - btullis@cumin1002" [production]
17:57 <btullis@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Re-commissioning an-presto1005 as an-worker1069 - btullis@cumin1002" [production]