production SAL

3001-3050 of 10000 results (26ms)

2021-02-10 §
05:58	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depool db1076 to clone db1162 T258361', diff saved to https://phabricator.wikimedia.org/P14277 and previous config saved to /var/cache/conftool/dbconfig/20210210-055846-marostegui.json	[production]
03:46	<ryankemper>	`ryankemper@wdqs1012:~$ sudo systemctl restart wdqs-blazegraph.service`	[production]
01:54	<krinkle@deploy1001>	Finished deploy [integration/docroot@0234db2]: Unbreak doc.wm.o (2) - Ib67da94fb1bdf0 (duration: 00m 06s)	[production]
01:54	<krinkle@deploy1001>	Started deploy [integration/docroot@0234db2]: Unbreak doc.wm.o (2) - Ib67da94fb1bdf0	[production]
01:43	<krinkle@deploy1001>	Finished deploy [integration/docroot@fddc7c9]: Unbreak doc.wm.o - Ibf28e02ec03 (duration: 00m 06s)	[production]
01:43	<krinkle@deploy1001>	Started deploy [integration/docroot@fddc7c9]: Unbreak doc.wm.o - Ibf28e02ec03	[production]
01:06	<milimetric@deploy1001>	Finished deploy [analytics/refinery@b539bf6] (thin): Job fixes after Hadoop upgrade (duration: 00m 06s)	[production]
01:06	<milimetric@deploy1001>	Started deploy [analytics/refinery@b539bf6] (thin): Job fixes after Hadoop upgrade	[production]
01:06	<milimetric@deploy1001>	Finished deploy [analytics/refinery@b539bf6]: Job fixes after Hadoop upgrade (duration: 10m 55s)	[production]
00:58	<mutante>	doc1001 - reloaded apache2	[production]
00:55	<milimetric@deploy1001>	Started deploy [analytics/refinery@b539bf6]: Job fixes after Hadoop upgrade	[production]
00:42	<Amir1>	changing frwiki to wmf.30 in mwdebug1002 to test T264391	[production]
00:33	<ladsgroup@deploy1001>	Synchronized php-1.36.0-wmf.30/extensions/FeaturedFeeds: [[gerrit:662965\|Fix issues with recent caching update]] (T264391) (duration: 01m 10s)	[production]
00:22	<twentyafterfour@deploy1001>	Finished scap: testwikis wikis to 1.36.0-wmf.30 (duration: 24m 10s)	[production]
00:01	<twentyafterfour>	train status: wmf.28 and wmf.29 are undeployed. wmf.27 is everywhere with the exception of testwikis which is at wmf.30 refs T271344	[production]
2021-02-09 §
23:58	<twentyafterfour@deploy1001>	Started scap: testwikis wikis to 1.36.0-wmf.30	[production]
23:56	<dzahn@cumin1001>	conftool action : set/pooled=yes; selector: name=mw2250.codfw.wmnet	[production]
23:55	<ryankemper>	Depooled `wdqs1005` - it's catching up on hours of lag	[production]
23:55	<twentyafterfour@deploy1001>	Finished scap: (no justification provided) (duration: 08m 43s)	[production]
23:53	<dzahn@cumin1001>	conftool action : set/pooled=no; selector: name=mw2250.codfw.wmnet	[production]
23:50	<mutante>	mw1383,mw1385 - scap pull, php	[production]
23:48	<dzahn@cumin1001>	conftool action : set/pooled=yes; selector: name=mw1296.eqiad.wmnet	[production]
23:47	<twentyafterfour>	running scap sync-world	[production]
23:47	<twentyafterfour@deploy1001>	Started scap: (no justification provided)	[production]
23:46	<twentyafterfour@deploy1001>	rebuilt and synchronized wikiversions files: all wikis to 1.36.0-wmf.27	[production]
23:40	<dzahn@cumin1001>	conftool action : set/pooled=no; selector: name=mw1296.eqiad.wmnet	[production]
23:33	<dzahn@cumin1001>	conftool action : set/pooled=yes; selector: name=mw1380.eqiad.wmnet	[production]
23:32	<dzahn@cumin1001>	conftool action : set/pooled=no; selector: name=mw1380.eqiad.wmnet	[production]
23:28	<mutante>	mw1380 - powercycling after it did not come back from normal reboot during reimaging	[production]
23:23	<dzahn@cumin1001>	conftool action : set/pooled=yes; selector: name=mw1372.eqiad.wmnet	[production]
23:18	<dzahn@cumin1001>	conftool action : set/pooled=no; selector: name=mw1372.eqiad.wmnet	[production]
23:05	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2250.codfw.wmnet with reason: REIMAGE	[production]
23:03	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on mw2250.codfw.wmnet with reason: REIMAGE	[production]
22:57	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1296.eqiad.wmnet with reason: REIMAGE	[production]
22:54	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on mw1296.eqiad.wmnet with reason: REIMAGE	[production]
22:49	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1372.eqiad.wmnet with reason: REIMAGE	[production]
22:46	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on mw1372.eqiad.wmnet with reason: REIMAGE	[production]
22:34	<dzahn@cumin1001>	conftool action : set/pooled=yes; selector: name=mw2259.codfw.wmnet	[production]
22:31	<dzahn@cumin1001>	conftool action : set/pooled=no; selector: name=mw2259.codfw.wmnet	[production]
22:29	<dzahn@cumin1001>	conftool action : set/pooled=yes; selector: name=mw1373.eqiad.wmnet	[production]
22:28	<dzahn@cumin1001>	conftool action : set/pooled=no; selector: name=mw1373.eqiad.wmnet	[production]
22:26	<dzahn@cumin1001>	conftool action : set/pooled=yes; selector: name=mw1298.eqiad.wmnet	[production]
22:23	<dzahn@cumin1001>	conftool action : set/pooled=no; selector: name=mw1298.eqiad.wmnet	[production]
22:23	<legoktm@deploy1001>	Synchronized wmf-config/InitialiseSettings.php: Enable GlobalWatchlist extension on testwiki (T260862) (duration: 02m 51s)	[production]
22:03	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2259.codfw.wmnet with reason: REIMAGE	[production]
22:01	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1380.eqiad.wmnet with reason: REIMAGE	[production]
22:00	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on mw2259.codfw.wmnet with reason: REIMAGE	[production]
21:59	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1373.eqiad.wmnet with reason: REIMAGE	[production]
21:58	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on mw1380.eqiad.wmnet with reason: REIMAGE	[production]
21:57	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on mw1373.eqiad.wmnet with reason: REIMAGE	[production]