production SAL

3251-3300 of 10000 results (91ms)

2023-04-07 §
18:19	<xcollazo@deploy2002>	Finished deploy [airflow-dags/platform_eng@5c4ebda]: (no justification provided) (duration: 00m 35s)	[production]
18:18	<xcollazo@deploy2002>	Started deploy [airflow-dags/platform_eng@5c4ebda]: (no justification provided)	[production]
17:02	<urandom>	restart Cassandra, sessionstore1001-a (re-enabling CQL) — T327954	[production]
11:05	<aqu@deploy2002>	Finished deploy [analytics/refinery@e70da10] (hadoop-test): Deploy analytics_refinery including last webrquest load scripts in TEST 2nd try [analytics/refinery@e70da10] (duration: 01m 33s)	[production]
11:03	<aqu@deploy2002>	Started deploy [analytics/refinery@e70da10] (hadoop-test): Deploy analytics_refinery including last webrquest load scripts in TEST 2nd try [analytics/refinery@e70da10]	[production]
10:40	<aqu@deploy2002>	Finished deploy [analytics/refinery@eb4c2b2] (hadoop-test): Deploy analytics_refinery including last webrquest load scripts in TEST [analytics/refinery@eb4c2b2] (duration: 00m 06s)	[production]
10:40	<aqu@deploy2002>	Started deploy [analytics/refinery@eb4c2b2] (hadoop-test): Deploy analytics_refinery including last webrquest load scripts in TEST [analytics/refinery@eb4c2b2]	[production]
10:34	<aqu>	About to deploy analytics/refinery in test cluster	[production]
09:23	<ayounsi@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
09:23	<ayounsi@cumin1001>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: sonicmgmt - ayounsi@cumin1001"	[production]
09:22	<ayounsi@cumin1001>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: sonicmgmt - ayounsi@cumin1001"	[production]
09:20	<ayounsi@cumin1001>	START - Cookbook sre.dns.netbox	[production]
01:17	<urandom>	rebooting sessionstore1001 — T327954	[production]
01:10	<urandom>	rebooting sessionstore1001 — T327954	[production]
01:02	<urandom>	rebooting sessionstore1001 — T327954	[production]
00:39	<urandom>	rebooting sessionstore1001 — T327954	[production]
2023-04-06 §
22:05	<ejegg>	SmashPig upgraded from 7c19151f to 24d700f4	[production]
22:04	<ejegg>	payments-wiki upgraded from 75b068a1 to 0f15a101	[production]
21:52	<sbassett>	Deployed updated mitigation for T333140	[production]
21:19	<jclark@cumin1001>	START - Cookbook sre.hosts.provision for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
21:18	<jclark@cumin1001>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
21:10	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2180 (T333332)', diff saved to https://phabricator.wikimedia.org/P46154 and previous config saved to /var/cache/conftool/dbconfig/20230406-211054-ladsgroup.json	[production]
21:05	<jclark@cumin1001>	START - Cookbook sre.hosts.provision for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
21:04	<jclark@cumin1001>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
21:02	<jclark@cumin1001>	START - Cookbook sre.hosts.provision for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
21:02	<jclark@cumin1001>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
21:00	<jclark@cumin1001>	START - Cookbook sre.hosts.provision for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
21:00	<jclark@cumin1001>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
20:59	<jclark@cumin1001>	START - Cookbook sre.hosts.provision for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
20:57	<jclark@cumin1001>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
20:55	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2180', diff saved to https://phabricator.wikimedia.org/P46153 and previous config saved to /var/cache/conftool/dbconfig/20230406-205548-ladsgroup.json	[production]
20:53	<jclark@cumin1001>	START - Cookbook sre.hosts.provision for host cloudvirtlocal1001.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
20:50	<eevans@cumin1001>	conftool action : set/pooled=yes; selector: name=ms-fe1014.eqiad.wmnet	[production]
20:49	<eevans@cumin1001>	conftool action : set/pooled=yes; selector: name=ms-fe1013.eqiad.wmnet	[production]
20:49	<eevans@cumin1001>	conftool action : set/weight=40; selector: name=ms-fe1014.eqiad.wmnet	[production]
20:49	<eevans@cumin1001>	conftool action : set/weight=40; selector: name=ms-fe1013.eqiad.wmnet	[production]
20:45	<eevans@cumin1001>	END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe-eqiad	[production]
20:44	<cmooney@cumin1001>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "remove info for new ssw as need to set back to planned to make homer happy - cmooney@cumin1001 - T322937"	[production]
20:43	<cmooney@cumin1001>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "remove info for new ssw as need to set back to planned to make homer happy - cmooney@cumin1001 - T322937"	[production]
20:41	<eevans@cumin1001>	START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe-eqiad	[production]
20:40	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2180', diff saved to https://phabricator.wikimedia.org/P46152 and previous config saved to /var/cache/conftool/dbconfig/20230406-204041-ladsgroup.json	[production]
20:25	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2180 (T333332)', diff saved to https://phabricator.wikimedia.org/P46151 and previous config saved to /var/cache/conftool/dbconfig/20230406-202535-ladsgroup.json	[production]
20:23	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db2180 (T333332)', diff saved to https://phabricator.wikimedia.org/P46150 and previous config saved to /var/cache/conftool/dbconfig/20230406-202319-ladsgroup.json	[production]
20:23	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2180.codfw.wmnet with reason: Maintenance	[production]
20:23	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 8:00:00 on db2180.codfw.wmnet with reason: Maintenance	[production]
20:22	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2171:3316 (T333332)', diff saved to https://phabricator.wikimedia.org/P46149 and previous config saved to /var/cache/conftool/dbconfig/20230406-202256-ladsgroup.json	[production]
20:16	<eevans@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-fe1014.eqiad.wmnet	[production]
20:15	<eevans@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-fe1013.eqiad.wmnet	[production]
20:09	<eevans@cumin1001>	START - Cookbook sre.hosts.reboot-single for host ms-fe1014.eqiad.wmnet	[production]
20:09	<eevans@cumin1001>	START - Cookbook sre.hosts.reboot-single for host ms-fe1013.eqiad.wmnet	[production]