3251-3300 of 10000 results (95ms)
2023-04-07 §
18:19 <xcollazo@deploy2002> Finished deploy [airflow-dags/platform_eng@5c4ebda]: (no justification provided) (duration: 00m 35s) [production]
18:18 <xcollazo@deploy2002> Started deploy [airflow-dags/platform_eng@5c4ebda]: (no justification provided) [production]
17:02 <urandom> restart Cassandra, sessionstore1001-a (re-enabling CQL) — T327954 [production]
11:05 <aqu@deploy2002> Finished deploy [analytics/refinery@e70da10] (hadoop-test): Deploy analytics_refinery including last webrquest load scripts in TEST 2nd try [analytics/refinery@e70da10] (duration: 01m 33s) [production]
11:03 <aqu@deploy2002> Started deploy [analytics/refinery@e70da10] (hadoop-test): Deploy analytics_refinery including last webrquest load scripts in TEST 2nd try [analytics/refinery@e70da10] [production]
10:40 <aqu@deploy2002> Finished deploy [analytics/refinery@eb4c2b2] (hadoop-test): Deploy analytics_refinery including last webrquest load scripts in TEST [analytics/refinery@eb4c2b2] (duration: 00m 06s) [production]
10:40 <aqu@deploy2002> Started deploy [analytics/refinery@eb4c2b2] (hadoop-test): Deploy analytics_refinery including last webrquest load scripts in TEST [analytics/refinery@eb4c2b2] [production]
10:34 <aqu> About to deploy analytics/refinery in test cluster [production]
09:23 <ayounsi@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
09:23 <ayounsi@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: sonicmgmt - ayounsi@cumin1001" [production]
09:22 <ayounsi@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: sonicmgmt - ayounsi@cumin1001" [production]
09:20 <ayounsi@cumin1001> START - Cookbook sre.dns.netbox [production]
01:17 <urandom> rebooting sessionstore1001 — T327954 [production]
01:10 <urandom> rebooting sessionstore1001 — T327954 [production]
01:02 <urandom> rebooting sessionstore1001 — T327954 [production]
00:39 <urandom> rebooting sessionstore1001 — T327954 [production]
2023-04-06 §
22:05 <ejegg> SmashPig upgraded from 7c19151f to 24d700f4 [production]
22:04 <ejegg> payments-wiki upgraded from 75b068a1 to 0f15a101 [production]
21:52 <sbassett> Deployed updated mitigation for T333140 [production]
21:19 <jclark@cumin1001> START - Cookbook sre.hosts.provision for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED [production]
21:18 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED [production]
21:10 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2180 (T333332)', diff saved to https://phabricator.wikimedia.org/P46154 and previous config saved to /var/cache/conftool/dbconfig/20230406-211054-ladsgroup.json [production]
21:05 <jclark@cumin1001> START - Cookbook sre.hosts.provision for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED [production]
21:04 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED [production]
21:02 <jclark@cumin1001> START - Cookbook sre.hosts.provision for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED [production]
21:02 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED [production]
21:00 <jclark@cumin1001> START - Cookbook sre.hosts.provision for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED [production]
21:00 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED [production]
20:59 <jclark@cumin1001> START - Cookbook sre.hosts.provision for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED [production]
20:57 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED [production]
20:55 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2180', diff saved to https://phabricator.wikimedia.org/P46153 and previous config saved to /var/cache/conftool/dbconfig/20230406-205548-ladsgroup.json [production]
20:53 <jclark@cumin1001> START - Cookbook sre.hosts.provision for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED [production]
20:50 <eevans@cumin1001> conftool action : set/pooled=yes; selector: name=ms-fe1014.eqiad.wmnet [production]
20:49 <eevans@cumin1001> conftool action : set/pooled=yes; selector: name=ms-fe1013.eqiad.wmnet [production]
20:49 <eevans@cumin1001> conftool action : set/weight=40; selector: name=ms-fe1014.eqiad.wmnet [production]
20:49 <eevans@cumin1001> conftool action : set/weight=40; selector: name=ms-fe1013.eqiad.wmnet [production]
20:45 <eevans@cumin1001> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe-eqiad [production]
20:44 <cmooney@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "remove info for new ssw as need to set back to planned to make homer happy - cmooney@cumin1001 - T322937" [production]
20:43 <cmooney@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "remove info for new ssw as need to set back to planned to make homer happy - cmooney@cumin1001 - T322937" [production]
20:41 <eevans@cumin1001> START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe-eqiad [production]
20:40 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2180', diff saved to https://phabricator.wikimedia.org/P46152 and previous config saved to /var/cache/conftool/dbconfig/20230406-204041-ladsgroup.json [production]
20:25 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2180 (T333332)', diff saved to https://phabricator.wikimedia.org/P46151 and previous config saved to /var/cache/conftool/dbconfig/20230406-202535-ladsgroup.json [production]
20:23 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2180 (T333332)', diff saved to https://phabricator.wikimedia.org/P46150 and previous config saved to /var/cache/conftool/dbconfig/20230406-202319-ladsgroup.json [production]
20:23 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2180.codfw.wmnet with reason: Maintenance [production]
20:23 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 8:00:00 on db2180.codfw.wmnet with reason: Maintenance [production]
20:22 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2171:3316 (T333332)', diff saved to https://phabricator.wikimedia.org/P46149 and previous config saved to /var/cache/conftool/dbconfig/20230406-202256-ladsgroup.json [production]
20:16 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-fe1014.eqiad.wmnet [production]
20:15 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-fe1013.eqiad.wmnet [production]
20:09 <eevans@cumin1001> START - Cookbook sre.hosts.reboot-single for host ms-fe1014.eqiad.wmnet [production]
20:09 <eevans@cumin1001> START - Cookbook sre.hosts.reboot-single for host ms-fe1013.eqiad.wmnet [production]