1801-1850 of 10000 results (115ms)
2024-10-13 §
10:22 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 12:00:00 on db1216.eqiad.wmnet with reason: Maintenance [production]
10:22 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 2 days, 12:00:00 on db1216.eqiad.wmnet with reason: Maintenance [production]
10:22 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1214 (T367856)', diff saved to https://phabricator.wikimedia.org/P69701 and previous config saved to /var/cache/conftool/dbconfig/20241013-102205-ladsgroup.json [production]
10:06 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1214', diff saved to https://phabricator.wikimedia.org/P69700 and previous config saved to /var/cache/conftool/dbconfig/20241013-100658-ladsgroup.json [production]
09:51 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1214', diff saved to https://phabricator.wikimedia.org/P69699 and previous config saved to /var/cache/conftool/dbconfig/20241013-095151-ladsgroup.json [production]
09:36 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1214 (T367856)', diff saved to https://phabricator.wikimedia.org/P69698 and previous config saved to /var/cache/conftool/dbconfig/20241013-093644-ladsgroup.json [production]
2024-10-11 §
22:18 <btullis@cumin1002> END (PASS) - Cookbook sre.ceph.roll-restart-reboot-server (exit_code=0) rolling reboot on P{cephosd100[3-5]*} and (A:cephosd) [production]
21:38 <btullis@cumin1002> START - Cookbook sre.ceph.roll-restart-reboot-server rolling reboot on P{cephosd100[3-5]*} and (A:cephosd) [production]
21:36 <btullis@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cephosd1002.eqiad.wmnet [production]
21:26 <btullis@cumin1002> START - Cookbook sre.hosts.reboot-single for host cephosd1002.eqiad.wmnet [production]
21:24 <btullis@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cephosd1001.eqiad.wmnet [production]
21:14 <btullis@cumin1002> START - Cookbook sre.hosts.reboot-single for host cephosd1001.eqiad.wmnet [production]
16:57 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudlb2004-dev.codfw.wmnet with OS bookworm [production]
16:57 <jhancock@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin2002" [production]
16:56 <jhancock@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin2002" [production]
16:49 <btullis@cumin1002> END (PASS) - Cookbook sre.ceph.roll-restart-reboot-server (exit_code=0) rolling reboot on A:cephosd [production]
16:40 <mfossati@deploy2002> Finished deploy [airflow-dags/platform_eng@c1d2914]: bump section topics to v0.16.0 (duration: 00m 42s) [production]
16:39 <mfossati@deploy2002> Started deploy [airflow-dags/platform_eng@c1d2914]: bump section topics to v0.16.0 [production]
16:38 <mfossati@deploy2002> Finished deploy [airflow-dags/platform_eng@c1d2914]: bump section topics to v0.16.0 (duration: 01m 06s) [production]
16:38 <mfossati@deploy2002> Started deploy [airflow-dags/platform_eng@c1d2914]: bump section topics to v0.16.0 [production]
16:37 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudlb2004-dev.codfw.wmnet with reason: host reimage [production]
16:34 <jhancock@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudlb2004-dev.codfw.wmnet with reason: host reimage [production]
16:16 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host cloudlb2004-dev.codfw.wmnet with OS bookworm [production]
16:14 <jhancock@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:14 <jhancock@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding cloudlb2004-dev to codfw - jhancock@cumin2002" [production]
16:14 <jhancock@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding cloudlb2004-dev to codfw - jhancock@cumin2002" [production]
16:11 <kcvelaga@deploy2002> Finished deploy [airflow-dags/analytics_product@1fb69c4]: T376456 (duration: 01m 15s) [production]
16:10 <jhancock@cumin2002> START - Cookbook sre.dns.netbox [production]
16:10 <kcvelaga@deploy2002> Started deploy [airflow-dags/analytics_product@1fb69c4]: T376456 [production]
15:41 <jhancock@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cloudlb2004-dev.mgmt.codfw.wmnet with chassis set policy FORCE_RESTARTand with Dell SCP reboot policy FORCED [production]
15:40 <btullis@cumin1002> START - Cookbook sre.ceph.roll-restart-reboot-server rolling reboot on A:cephosd [production]
15:37 <cmooney@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
15:37 <cmooney@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add new entries for codfw cloudgw - cmooney@cumin1002" [production]
15:37 <cmooney@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add new entries for codfw cloudgw - cmooney@cumin1002" [production]
15:36 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host cloudlb2004-dev.mgmt.codfw.wmnet with chassis set policy FORCE_RESTARTand with Dell SCP reboot policy FORCED [production]
15:34 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudlb2004-dev.mgmt.codfw.wmnet with chassis set policy FORCE_RESTARTand with Dell SCP reboot policy FORCED [production]
15:34 <cmooney@cumin1002> START - Cookbook sre.dns.netbox [production]
15:32 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host cloudlb2004-dev.mgmt.codfw.wmnet with chassis set policy FORCE_RESTARTand with Dell SCP reboot policy FORCED [production]
14:48 <eevans@deploy2002> helmfile [eqiad] DONE helmfile.d/services/data-gateway: apply [production]
14:48 <eevans@deploy2002> helmfile [eqiad] START helmfile.d/services/data-gateway: apply [production]
14:47 <urandom> upgrading data-gateway to v1.0.10 [production]
14:46 <eevans@deploy2002> helmfile [codfw] DONE helmfile.d/services/data-gateway: apply [production]
14:46 <eevans@deploy2002> helmfile [codfw] START helmfile.d/services/data-gateway: apply [production]
14:39 <eevans@deploy2002> helmfile [staging] DONE helmfile.d/services/data-gateway: apply [production]
14:38 <eevans@deploy2002> helmfile [staging] START helmfile.d/services/data-gateway: apply [production]
14:31 <andrewtavis-wmde@deploy2002> Finished deploy [airflow-dags/wmde@c9a2532]: (no justification provided) (duration: 00m 25s) [production]
14:30 <andrewtavis-wmde@deploy2002> Started deploy [airflow-dags/wmde@c9a2532]: (no justification provided) [production]
13:59 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2175 (re)pooling @ 100%: T376988', diff saved to https://phabricator.wikimedia.org/P69695 and previous config saved to /var/cache/conftool/dbconfig/20241011-135903-arnaudb.json [production]
13:46 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudlb2004-dev.mgmt.codfw.wmnet with chassis set policy FORCE_RESTARTand with Dell SCP reboot policy FORCED [production]
13:43 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2175 (re)pooling @ 75%: T376988', diff saved to https://phabricator.wikimedia.org/P69694 and previous config saved to /var/cache/conftool/dbconfig/20241011-134357-arnaudb.json [production]