production SAL

1251-1300 of 10000 results (29ms)

2022-04-08 §
08:03	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1174', diff saved to https://phabricator.wikimedia.org/P24282 and previous config saved to /var/cache/conftool/dbconfig/20220408-080335-ladsgroup.json	[production]
08:01	<jynus@cumin2002>	START - Cookbook sre.hosts.reimage for host db2151.codfw.wmnet with OS bullseye	[production]
07:59	<jynus@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1176.eqiad.wmnet with OS bullseye	[production]
07:54	<jynus@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbprov2001.codfw.wmnet with reason: host reimage	[production]
07:50	<jynus@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on dbprov2001.codfw.wmnet with reason: host reimage	[production]
07:50	<mmandere@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp6003.drmrs.wmnet with reason: host reimage	[production]
07:48	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1174 (T305300)', diff saved to https://phabricator.wikimedia.org/P24281 and previous config saved to /var/cache/conftool/dbconfig/20220408-074829-ladsgroup.json	[production]
07:47	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1174 (T305300)', diff saved to https://phabricator.wikimedia.org/P24280 and previous config saved to /var/cache/conftool/dbconfig/20220408-074723-ladsgroup.json	[production]
07:47	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1174.eqiad.wmnet with reason: Maintenance	[production]
07:47	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1174.eqiad.wmnet with reason: Maintenance	[production]
07:46	<mmandere@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cp6003.drmrs.wmnet with reason: host reimage	[production]
07:45	<jynus@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1176.eqiad.wmnet with reason: host reimage	[production]
07:42	<jynus@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on db1176.eqiad.wmnet with reason: host reimage	[production]
07:42	<mmandere@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp6011.drmrs.wmnet with reason: host reimage	[production]
07:39	<mmandere@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cp6011.drmrs.wmnet with reason: host reimage	[production]
07:36	<jynus@cumin2002>	START - Cookbook sre.hosts.reimage for host dbprov2001.codfw.wmnet with OS bullseye	[production]
07:34	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1169 (re)pooling @ 100%: After schema change', diff saved to https://phabricator.wikimedia.org/P24279 and previous config saved to /var/cache/conftool/dbconfig/20220408-073442-root.json	[production]
07:31	<jynus@cumin1001>	START - Cookbook sre.hosts.reimage for host db1176.eqiad.wmnet with OS bullseye	[production]
07:28	<mmandere@cumin1001>	START - Cookbook sre.hosts.reimage for host cp6003.drmrs.wmnet with OS buster	[production]
07:26	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1105:3311 (T298565)', diff saved to https://phabricator.wikimedia.org/P24278 and previous config saved to /var/cache/conftool/dbconfig/20220408-072615-ladsgroup.json	[production]
07:26	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1105.eqiad.wmnet with reason: Maintenance	[production]
07:26	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1105.eqiad.wmnet with reason: Maintenance	[production]
07:21	<mmandere>	depool cp6003 for reimage - T290005	[production]
07:21	<mmandere@cumin1001>	START - Cookbook sre.hosts.reimage for host cp6011.drmrs.wmnet with OS buster	[production]
07:19	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1169 (re)pooling @ 75%: After schema change', diff saved to https://phabricator.wikimedia.org/P24277 and previous config saved to /var/cache/conftool/dbconfig/20220408-071938-root.json	[production]
07:12	<mmandere>	depool cp6011 for reimage - T290005	[production]
07:04	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1169 (re)pooling @ 50%: After schema change', diff saved to https://phabricator.wikimedia.org/P24276 and previous config saved to /var/cache/conftool/dbconfig/20220408-070434-root.json	[production]
06:49	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1169 (re)pooling @ 25%: After schema change', diff saved to https://phabricator.wikimedia.org/P24275 and previous config saved to /var/cache/conftool/dbconfig/20220408-064930-root.json	[production]
06:38	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1140.eqiad.wmnet with reason: Maintenance	[production]
06:38	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1140.eqiad.wmnet with reason: Maintenance	[production]
06:34	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1169 (re)pooling @ 10%: After schema change', diff saved to https://phabricator.wikimedia.org/P24274 and previous config saved to /var/cache/conftool/dbconfig/20220408-063426-root.json	[production]
06:19	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1169 (re)pooling @ 5%: After schema change', diff saved to https://phabricator.wikimedia.org/P24273 and previous config saved to /var/cache/conftool/dbconfig/20220408-061922-root.json	[production]
05:10	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depool db1169', diff saved to https://phabricator.wikimedia.org/P24272 and previous config saved to /var/cache/conftool/dbconfig/20220408-051044-root.json	[production]
02:30	<bking@cumin1001>	END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) Operation.REBOOT (3 nodes at a time) for ElasticSearch cluster search_eqiad: security updates - bking@cumin1001 - T304938	[production]
2022-04-07 §
22:18	<ejegg>	restarted fundraising scheduled jobs	[production]
22:08	<ejegg>	updated fundraising CiviCRM from 7b7b284d to a90a6709	[production]
22:05	<cmjohnson@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
22:01	<cmjohnson@cumin1001>	START - Cookbook sre.dns.netbox	[production]
21:46	<ejegg>	disabled fundraising scheduled jobs for CiviCRM upgrade	[production]
21:26	<cmjohnson@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic1101.eqiad.wmnet with OS bullseye	[production]
21:24	<cmjohnson@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic1100.eqiad.wmnet with OS bullseye	[production]
21:23	<cmjohnson@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic1102.eqiad.wmnet with OS bullseye	[production]
21:19	<cmjohnson@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic1099.eqiad.wmnet with OS bullseye	[production]
21:16	<cmjohnson@cumin1001>	END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on elastic1101.eqiad.wmnet with reason: host reimage	[production]
21:15	<cmjohnson@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic1100.eqiad.wmnet with reason: host reimage	[production]
21:13	<cmjohnson@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic1098.eqiad.wmnet with OS bullseye	[production]
21:13	<cmjohnson@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic1102.eqiad.wmnet with reason: host reimage	[production]
21:10	<cmjohnson@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic1099.eqiad.wmnet with reason: host reimage	[production]
21:09	<cmjohnson@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic1097.eqiad.wmnet with OS bullseye	[production]
21:07	<cmjohnson@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on elastic1102.eqiad.wmnet with reason: host reimage	[production]