651-700 of 10000 results (31ms)
2024-11-15 §
11:14 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
11:12 <elukey@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
11:12 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
11:06 <elukey@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
11:06 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
11:05 <claime> homer 'cr*eqiad*' commit 'T377022' [production]
10:56 <wmbot~taavi@tools-bastion-12> $ toolforge envvars create RUST_LOG debug # attempting to debug where the bot is [tools.ircservserv]
10:36 <elukey@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
10:36 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:36 <elukey@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:34 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on pc1013.eqiad.wmnet with reason: T373037, host is not pooled [production]
09:34 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 4 days, 0:00:00 on pc1013.eqiad.wmnet with reason: T373037, host is not pooled [production]
09:31 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:28 <elukey@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:28 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:28 <elukey@cumin2002> END (ERROR) - Cookbook sre.hosts.provision (exit_code=97) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:27 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:23 <elukey@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:23 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:22 <elukey@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:21 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host thanos-be2005.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
09:15 <aokoth@cumin1002> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: Security Update [production]
08:48 <moritzm> installing Linux 6.1.115 kernel updates from Bookworm point release [production]
04:54 <rzl@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 12:00:00 on db1246.eqiad.wmnet with reason: depooled [production]
04:54 <rzl@cumin2002> START - Cookbook sre.hosts.downtime for 3 days, 12:00:00 on db1246.eqiad.wmnet with reason: depooled [production]
04:51 <rzl@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 12:00:00 on db1246.eqiad.wmnet with reason: depooled [production]
04:50 <rzl@cumin2002> START - Cookbook sre.hosts.downtime for 1 day, 12:00:00 on db1246.eqiad.wmnet with reason: depooled [production]
04:47 <rzl@cumin2002> dbctl commit (dc=all): 'db1246 depooled', diff saved to https://phabricator.wikimedia.org/P71052 and previous config saved to /var/cache/conftool/dbconfig/20241115-044705-rzl.json [production]
03:44 <ejegg> fundraising python tools upgraded from c6e2dbcc to b230f718 [production]
2024-11-14 §
23:17 <eileen> civicrm upgraded from 2a53f697 to d49a064d [production]
22:59 <eileen> civicrm upgraded from 2ab8334a to 2a53f697 [production]
22:37 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on cp4043.ulsfo.wmnet with reason: ATS upgrade 9.2.6 [production]
22:37 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on cp4043.ulsfo.wmnet with reason: ATS upgrade 9.2.6 [production]
22:30 <ryankemper> T376150 Depooled `wdqs20[18-20]` in preparation of merging https://gerrit.wikimedia.org/r/c/operations/puppet/+/1088185 [production]
21:49 <aqu@deploy2002> Finished deploy [airflow-dags/analytics@7a66849]: Stage Refine: fix Airflow skip (duration: 00m 59s) [production]
21:48 <aqu@deploy2002> Started deploy [airflow-dags/analytics@7a66849]: Stage Refine: fix Airflow skip [production]
21:47 <aqu@deploy2002> Finished deploy [airflow-dags/analytics_test@7a66849]: Stage Refine: fix Airflow skip (duration: 00m 14s) [production]
21:47 <aqu@deploy2002> Started deploy [airflow-dags/analytics_test@7a66849]: Stage Refine: fix Airflow skip [production]
21:41 <dancy> Reverting scap to 4.101.1 in deployment-prep. [releng]
21:37 <dancy> Installed scap 4.123.0 in deployment-prep. [releng]
21:26 <aqu@deploy2002> Finished deploy [airflow-dags/analytics_test@2220747]: Stage Refine test fix (duration: 00m 16s) [production]
21:26 <aqu@deploy2002> Started deploy [airflow-dags/analytics_test@2220747]: Stage Refine test fix [production]
21:20 <cjming> end of UTC late backport window [production]
21:17 <cjming@deploy2002> Finished scap sync-world: Backport for [[gerrit:1082853|Redirect to wikis using subpages rather than namespaces too (T376923)]] (duration: 13m 44s) [production]
21:13 <cjming@deploy2002> cjming, pppery: Continuing with sync [production]
21:07 <cjming@deploy2002> cjming, pppery: Backport for [[gerrit:1082853|Redirect to wikis using subpages rather than namespaces too (T376923)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:04 <cjming@deploy2002> Started scap sync-world: Backport for [[gerrit:1082853|Redirect to wikis using subpages rather than namespaces too (T376923)]] [production]
20:47 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2139.codfw.wmnet with OS bookworm [production]
20:47 <jhancock@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin2002" [production]
20:38 <bvibber@deploy2002> helmfile [codfw] DONE helmfile.d/services/chart-renderer: apply [production]