production SAL

951-1000 of 10000 results (25ms)

2020-09-28 §
09:02	<klausman@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
09:00	<elukey@cumin1001>	END (PASS) - Cookbook sre.hosts.decommission (exit_code=0)	[production]
09:00	<klausman@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
08:56	<dcausse>	T263970: recovering lost apifeature indices (copying eqiad indices -> codfw)	[production]
08:55	<elukey@cumin1001>	START - Cookbook sre.hosts.decommission	[production]
08:53	<elukey@cumin1001>	END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1)	[production]
08:46	<godog>	swift codfw-prod: bump object weight for ms-be2057 - T261633	[production]
08:43	<elukey@cumin1001>	START - Cookbook sre.hosts.decommission	[production]
08:43	<elukey@cumin1001>	END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1)	[production]
08:43	<elukey@cumin1001>	START - Cookbook sre.hosts.decommission	[production]
08:42	<elukey@cumin1001>	END (PASS) - Cookbook sre.hosts.decommission (exit_code=0)	[production]
08:37	<elukey>	decommission the hadoop test cluster (analytics1028->41)	[production]
08:36	<elukey@cumin1001>	START - Cookbook sre.hosts.decommission	[production]
08:36	<elukey@cumin1001>	END (ERROR) - Cookbook sre.hosts.decommission (exit_code=97)	[production]
08:35	<elukey@cumin1001>	START - Cookbook sre.hosts.decommission	[production]
08:34	<elukey@cumin1001>	END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1)	[production]
08:34	<elukey@cumin1001>	START - Cookbook sre.hosts.decommission	[production]
08:32	<ema>	text@eqiad: rolling varnish upgrade to 6.0.6-1wm1 T263557	[production]
08:28	<kormat@cumin1001>	dbctl commit (dc=all): 'db2125 (re)pooling @ 100%: mobo replaced T260670', diff saved to https://phabricator.wikimedia.org/P12813 and previous config saved to /var/cache/conftool/dbconfig/20200928-082825-kormat.json	[production]
08:21	<ema>	upload@eqiad: rolling varnish upgrade to 6.0.6-1wm1 T263557	[production]
08:21	<kormat@cumin1001>	dbctl commit (dc=all): 'Remove db2113 from contributions/logpager/recentchanges*/watchlist T263842', diff saved to https://phabricator.wikimedia.org/P12812 and previous config saved to /var/cache/conftool/dbconfig/20200928-082114-kormat.json	[production]
08:13	<kormat@cumin1001>	dbctl commit (dc=all): 'db2125 (re)pooling @ 75%: mobo replaced T260670', diff saved to https://phabricator.wikimedia.org/P12811 and previous config saved to /var/cache/conftool/dbconfig/20200928-081321-kormat.json	[production]
08:07	<jayme>	restarting pybal on lvs3005 for switching to conf1005 - T196487	[production]
08:06	<jayme>	restarting pybal on lvs3006 for switching to conf1005 - T196487	[production]
08:02	<jayme>	restarting pybal on lvs3007 for switching to conf1005 - T196487	[production]
08:02	<elukey@cumin1001>	END (PASS) - Cookbook sre.hadoop.stop-cluster (exit_code=0)	[production]
07:58	<kormat@cumin1001>	dbctl commit (dc=all): 'db2125 (re)pooling @ 50%: mobo replaced T260670', diff saved to https://phabricator.wikimedia.org/P12810 and previous config saved to /var/cache/conftool/dbconfig/20200928-075817-kormat.json	[production]
07:54	<elukey@cumin1001>	START - Cookbook sre.hadoop.stop-cluster	[production]
07:43	<kormat@cumin1001>	dbctl commit (dc=all): 'db2125 (re)pooling @ 25%: mobo replaced T260670', diff saved to https://phabricator.wikimedia.org/P12809 and previous config saved to /var/cache/conftool/dbconfig/20200928-074313-kormat.json	[production]
07:29	<_joe_>	restarting pybal on the LVS primaries	[production]
07:24	<dcausse>	T263970: forcing allocation of enwiki_general_1587198756 (chi@eqiad)	[production]
07:18	<_joe_>	restarting pybal on the backup LVS in eqiad, codfw to pick up the new wikifeeds endpoint	[production]
07:17	<elukey@cumin1001>	END (PASS) - Cookbook sre.presto.roll-restart-workers (exit_code=0)	[production]
07:09	<elukey@cumin1001>	START - Cookbook sre.presto.roll-restart-workers	[production]
06:59	<marostegui@cumin1001>	dbctl commit (dc=all): 'Promote es2028 as es1 master in codfw T261717', diff saved to https://phabricator.wikimedia.org/P12806 and previous config saved to /var/cache/conftool/dbconfig/20200928-065938-marostegui.json	[production]
06:15	<marostegui>	Set innodb_change_buffering = inserts; on db2089 (s5), db2106 (s4), db2108 (s2), db2085 (s1), db2085 (s8), db2087 (s7), db2087 (s6), db2109 (s3) T263443	[production]
05:55	<marostegui>	Stop MySQL on es2013 before decommissioning it T263740	[production]
05:54	<marostegui@cumin1001>	dbctl commit (dc=all): 'Remove es2013 from dbctl T263740', diff saved to https://phabricator.wikimedia.org/P12805 and previous config saved to /var/cache/conftool/dbconfig/20200928-055410-marostegui.json	[production]
05:48	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depool es2013 T263740', diff saved to https://phabricator.wikimedia.org/P12804 and previous config saved to /var/cache/conftool/dbconfig/20200928-054846-marostegui.json	[production]
05:22	<marostegui>	Decrease labsdb1011 weight	[production]
2020-09-27 §
06:36	<elukey>	powercycle analytics1048	[production]
2020-09-26 §
19:20	<chrisalbon>	sudo service uwsgi-ores restart	[production]
02:17	<dzahn@cumin1001>	END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0)	[production]
02:04	<cdanis@cumin2001>	conftool action : set/pooled=false; selector: dnsdisc=ores,name=eqiad	[production]
02:04	<cdanis@cumin2001>	conftool action : set/pooled=true; selector: dnsdisc=ores,name=codfw	[production]
01:56	<cdanis>	❌cdanis@cumin2001.codfw.wmnet ~ 🕙🍺 sudo cumin 'A:ores and A:codfw' 'systemctl restart celery-ores-worker.service uwsgi-ores.service '	[production]
01:48	<cdanis@cumin1001>	conftool action : set/pooled=false; selector: dnsdisc=ores,name=codfw	[production]
01:48	<cdanis@cumin1001>	conftool action : set/pooled=true; selector: dnsdisc=ores,name=eqiad	[production]
01:17	<cdanis>	❌cdanis@ores2001.codfw.wmnet ~ 🕤🍺 sudo systemctl restart uwsgi-ores.service	[production]
01:11	<cdanis>	✔️ cdanis@ores2001.codfw.wmnet ~ 🕘🍺 sudo systemctl restart celery-ores-worker.service	[production]