production SAL

101-150 of 10000 results (113ms)

2026-02-20 §
10:48	<jgiannelos@deploy2002>	helmfile [codfw] START helmfile.d/services/mw-parsoid: apply	[production]
10:47	<jgiannelos@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/mw-parsoid: apply	[production]
10:47	<jgiannelos@deploy2002>	helmfile [eqiad] START helmfile.d/services/mw-parsoid: apply	[production]
10:38	<jayme@cumin1003>	START - Cookbook sre.hosts.reimage for host kubestage2002.codfw.wmnet with OS trixie	[production]
10:37	<jayme@cumin1003>	START - Cookbook sre.k8s.roll-reimage-nodes rolling reimage on A:wikikube-staging-worker-codfw	[production]
10:34	<ammarpad@deploy2002>	mwscript-k8s job started: extensions/Translate/scripts/moveTranslatableBundle.php --wiki=metawiki --reason 'Requested at [[phab:T417862]]' 'Wikimedia Foundation/Advancement/Community Growth/Community Resources and Partnerships' 'Wikimedia Foundation/Advancement/Community Growth/Community Investment and Partnerships' Ammarpad # T417862	[production]
10:14	<jgiannelos@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/mw-parsoid: apply	[production]
10:13	<jgiannelos@deploy2002>	helmfile [eqiad] START helmfile.d/services/mw-parsoid: apply	[production]
09:41	<hashar>	Upgraded CI Jenkins from 2.528.3 to 2.541.2 # T417791	[production]
08:29	<brouberol@cumin1003>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cephosd1003.eqiad.wmnet	[production]
08:19	<brouberol@cumin1003>	START - Cookbook sre.hosts.reboot-single for host cephosd1003.eqiad.wmnet	[production]
07:18	<arnaudb@cumin1003>	END (PASS) - Cookbook sre.gerrit.sync-instances (exit_code=0) sync Gerrit data from gerrit2003.wikimedia.org to gerrit1003.wikimedia.org	[production]
07:13	<arnaudb@cumin1003>	START - Cookbook sre.gerrit.sync-instances sync Gerrit data from gerrit2003.wikimedia.org to gerrit1003.wikimedia.org	[production]
01:49	<zabe@deploy2002>	Finished scap sync-world: Backport for [[gerrit:1239497\|Start reading from new file tables on mediawikiwiki (T416548)]] (duration: 07m 17s)	[production]
01:45	<zabe@deploy2002>	zabe: Continuing with sync	[production]
01:44	<zabe@deploy2002>	zabe: Backport for [[gerrit:1239497\|Start reading from new file tables on mediawikiwiki (T416548)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there.	[production]
01:41	<zabe@deploy2002>	Started scap sync-world: Backport for [[gerrit:1239497\|Start reading from new file tables on mediawikiwiki (T416548)]]	[production]
00:33	<ryankemper>	[WDQS] Restarted blazegraph on `wdqs1014` as well. all 3 hosts were deadlocked	[production]
00:32	<ryankemper>	[WDQS] Restarted blazegraph on `wdqs101[1,3]`	[production]
2026-02-19 §
23:41	<marostegui@cumin1003>	dbctl commit (dc=all): 'Depooling db1241 (T415786)', diff saved to https://phabricator.wikimedia.org/P88911 and previous config saved to /var/cache/conftool/dbconfig/20260219-234101-marostegui.json	[production]
23:40	<marostegui@cumin1003>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1241.eqiad.wmnet with reason: Maintenance	[production]
23:40	<marostegui@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1238 (T415786)', diff saved to https://phabricator.wikimedia.org/P88910 and previous config saved to /var/cache/conftool/dbconfig/20260219-234036-marostegui.json	[production]
23:25	<marostegui@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1238', diff saved to https://phabricator.wikimedia.org/P88909 and previous config saved to /var/cache/conftool/dbconfig/20260219-232528-marostegui.json	[production]
23:11	<marostegui@cumin1003>	dbctl commit (dc=all): 'Depooling db2210 (T415786)', diff saved to https://phabricator.wikimedia.org/P88908 and previous config saved to /var/cache/conftool/dbconfig/20260219-231101-marostegui.json	[production]
23:10	<marostegui@cumin1003>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2210.codfw.wmnet with reason: Maintenance	[production]
23:10	<marostegui@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db2206 (T415786)', diff saved to https://phabricator.wikimedia.org/P88907 and previous config saved to /var/cache/conftool/dbconfig/20260219-231037-marostegui.json	[production]
23:10	<marostegui@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1238', diff saved to https://phabricator.wikimedia.org/P88906 and previous config saved to /var/cache/conftool/dbconfig/20260219-231020-marostegui.json	[production]
23:00	<ryankemper@cumin2002>	END (PASS) - Cookbook sre.hadoop.reboot-workers (exit_code=0) for Hadoop test cluster	[production]
22:55	<marostegui@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db2206', diff saved to https://phabricator.wikimedia.org/P88904 and previous config saved to /var/cache/conftool/dbconfig/20260219-225529-marostegui.json	[production]
22:55	<marostegui@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1238 (T415786)', diff saved to https://phabricator.wikimedia.org/P88903 and previous config saved to /var/cache/conftool/dbconfig/20260219-225512-marostegui.json	[production]
22:40	<marostegui@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db2206', diff saved to https://phabricator.wikimedia.org/P88902 and previous config saved to /var/cache/conftool/dbconfig/20260219-224020-marostegui.json	[production]
22:32	<egardner@deploy2002>	Finished scap sync-world: Backport for [[gerrit:1240814\|Minerva TOC: Fix TOC instrumentation selectors (T415611)]] (duration: 07m 31s)	[production]
22:27	<egardner@deploy2002>	egardner: Continuing with sync	[production]
22:26	<egardner@deploy2002>	egardner: Backport for [[gerrit:1240814\|Minerva TOC: Fix TOC instrumentation selectors (T415611)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there.	[production]
22:25	<marostegui@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db2206 (T415786)', diff saved to https://phabricator.wikimedia.org/P88901 and previous config saved to /var/cache/conftool/dbconfig/20260219-222512-marostegui.json	[production]
22:24	<egardner@deploy2002>	Started scap sync-world: Backport for [[gerrit:1240814\|Minerva TOC: Fix TOC instrumentation selectors (T415611)]]	[production]
22:24	<ryankemper>	T415696 Will be merging https://gerrit.wikimedia.org/r/c/operations/puppet/+/1237142 shortly, which will permanently decom the LDF endpoint for wdqs services	[production]
22:16	<ryankemper@cumin2002>	START - Cookbook sre.hadoop.reboot-workers for Hadoop test cluster	[production]
22:14	<ryankemper@cumin2002>	END (FAIL) - Cookbook sre.hadoop.reboot-workers (exit_code=99) for Hadoop test cluster	[production]
22:14	<ryankemper@cumin2002>	START - Cookbook sre.hadoop.reboot-workers for Hadoop test cluster	[production]
21:35	<cscott@deploy2002>	Finished scap sync-world: Backport for [[gerrit:1240782\|Enable parser survey for opted out users on some English-language wikis (T414852)]], [[gerrit:1239270\|Deploy PRV to 19 wikis (T417349)]] (duration: 10m 28s)	[production]
21:33	<jhathaway@dns1004>	END - running authdns-update	[production]
21:32	<jhathaway@dns1004>	START - running authdns-update	[production]
21:31	<cscott@deploy2002>	cscott, arlolra: Continuing with sync	[production]
21:26	<cscott@deploy2002>	cscott, arlolra: Backport for [[gerrit:1240782\|Enable parser survey for opted out users on some English-language wikis (T414852)]], [[gerrit:1239270\|Deploy PRV to 19 wikis (T417349)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there.	[production]
21:24	<cscott@deploy2002>	Started scap sync-world: Backport for [[gerrit:1240782\|Enable parser survey for opted out users on some English-language wikis (T414852)]], [[gerrit:1239270\|Deploy PRV to 19 wikis (T417349)]]	[production]
21:16	<arlolra@deploy2002>	Finished scap sync-world: Backport for [[gerrit:1240773\|Update Qids according to communication with communities (v20260219) (T417902)]], [[gerrit:1240779\|Fix finding joiner in the face of pwrapping (T411935)]] (duration: 07m 16s)	[production]
21:12	<jhancock@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host apus-fe2005.codfw.wmnet with OS bookworm	[production]
21:12	<jhancock@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host apus-fe2004.codfw.wmnet with OS bookworm	[production]
21:12	<arlolra@deploy2002>	arlolra, aude: Continuing with sync	[production]