production SAL

2701-2750 of 10000 results (47ms)

2021-05-05 §
06:06	<marostegui@cumin1001>	dbctl commit (dc=all): 'Remove db1104 from API', diff saved to https://phabricator.wikimedia.org/P15725 and previous config saved to /var/cache/conftool/dbconfig/20210505-060636-marostegui.json	[production]
06:00	<marostegui>	Restart mysqld on x1 database primary master (db1103) T281212	[production]
05:38	<marostegui@cumin1001>	dbctl commit (dc=all): 'Slowly repool db1099:3311 into main traffic', diff saved to https://phabricator.wikimedia.org/P15724 and previous config saved to /var/cache/conftool/dbconfig/20210505-053841-marostegui.json	[production]
05:32	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repool db1106 into s1 vslow, remove db1099:3311', diff saved to https://phabricator.wikimedia.org/P15723 and previous config saved to /var/cache/conftool/dbconfig/20210505-053211-marostegui.json	[production]
05:29	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depool db1096:3316 for schema change', diff saved to https://phabricator.wikimedia.org/P15722 and previous config saved to /var/cache/conftool/dbconfig/20210505-052943-marostegui.json	[production]
04:53	<eileen>	civicrm revision changed from e7c610fd87 to 8034e47008, config revision is 189788d452	[production]
03:58	<ryankemper>	T280563 `sudo -i cookbook sre.elasticsearch.rolling-operation search_codfw "codfw reboot" --reboot --nodes-per-run 3 --start-datetime 2021-04-29T23:04:29 --task-id T280563` on `ryankemper@cumin1001` tmux session `elastic_restarts`	[production]
03:58	<ryankemper@cumin1001>	START - Cookbook sre.elasticsearch.rolling-operation reboot without plugin upgrade (3 nodes at a time) for ElasticSearch cluster search_codfw: codfw reboot - ryankemper@cumin1001 - T280563	[production]
03:56	<ryankemper>	T280563 Reboot of `eqiad` complete. Only ~half of `codfw` is remaining.	[production]
03:56	<ryankemper@cumin1001>	END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) reboot without plugin upgrade (3 nodes at a time) for ElasticSearch cluster search_eqiad: eqiad reboot to apply sec updates - ryankemper@cumin1001 - T280563	[production]
03:54	<ryankemper>	T280382 `wdqs1011.eqiad.wmnet` has been re-imaged and had the appropriate wikidata/categories journal files transferred. `df -h` shows disk space is no longer an issue following the switch to `raid0`: `/dev/mapper/vg0-srv 2.7T 998G 1.6T 39% /srv`	[production]
03:52	<ryankemper@cumin1001>	START - Cookbook sre.elasticsearch.rolling-operation reboot without plugin upgrade (3 nodes at a time) for ElasticSearch cluster search_eqiad: eqiad reboot to apply sec updates - ryankemper@cumin1001 - T280563	[production]
03:51	<ryankemper>	T280382 [WDQS] `ryankemper@wdqs2007:~$ sudo depool` (need to monitor host to see if it becomes ssh unreachable again or if it was a one-off; also high update lag)	[production]
03:50	<ryankemper>	T280382 `wdqs2007.codfw.wmnet` has been re-imaged and had the appropriate wikidata/categories journal files transferred. `df -h` shows disk space is no longer an issue following the switch to `raid0`: `/dev/mapper/vg0-srv 2.7T 998G 1.6T 39% /srv`	[production]
03:07	<ryankemper@cumin1001>	END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0)	[production]
03:02	<ryankemper@cumin1001>	END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0)	[production]
02:59	<ryankemper@cumin1001>	END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) reboot without plugin upgrade (3 nodes at a time) for ElasticSearch cluster search_eqiad: eqiad reboot to apply sec updates - ryankemper@cumin1001 - T280563	[production]
01:55	<ryankemper>	T281327 [Elastic] Unbanned `elastic2043` from cluster	[production]
01:50	<ryankemper@cumin1001>	START - Cookbook sre.wdqs.data-transfer	[production]
01:49	<ryankemper>	T280382 `sudo -i cookbook sre.wdqs.data-transfer --source wdqs2001.codfw.wmnet --dest wdqs2007.codfw.wmnet --reason "transferring fresh wikidata journal following reimage" --blazegraph_instance blazegraph` on `ryankemper@cumin1001` tmux session `reimage` (will likely fail due to underlying hw but we'll see)	[production]
01:47	<ryankemper@cumin1001>	END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97)	[production]
01:45	<ryankemper>	T280382 `sudo -i cookbook sre.wdqs.data-transfer --source wdqs1006.eqiad.wmnet --dest wdqs1011.eqiad.wmnet --reason "transferring fresh wikidata journal following reimage" --blazegraph_instance blazegraph` on `ryankemper@cumin1001` tmux session `reimage`	[production]
01:45	<ryankemper@cumin1001>	START - Cookbook sre.wdqs.data-transfer	[production]
01:44	<ryankemper@cumin1001>	END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0)	[production]
01:43	<ryankemper>	T280382 [WDQS] `racadm>>racadm serveraction powercycle` on `wdqs2007`	[production]
01:39	<ryankemper>	T280382 `sudo -i cookbook sre.wdqs.data-transfer --source wdqs1006.eqiad.wmnet --dest wdqs1011.eqiad.wmnet --reason "transferring fresh categories journal following reimage" --blazegraph_instance categories` on `ryankemper@cumin1001` tmux session `reimage`	[production]
01:39	<ryankemper@cumin1001>	START - Cookbook sre.wdqs.data-transfer	[production]
01:36	<ryankemper@cumin1001>	START - Cookbook sre.elasticsearch.rolling-operation reboot without plugin upgrade (3 nodes at a time) for ElasticSearch cluster search_eqiad: eqiad reboot to apply sec updates - ryankemper@cumin1001 - T280563	[production]
00:29	<eileen>	civicrm revision changed from 94e321dbe0 to e7c610fd87, config revision is 189788d452	[production]
00:15	<ejegg>	updated payments-wiki from 44570561f2 to d449599540	[production]
00:08	<urbanecm@deploy1002>	Synchronized wmf-config/InitialiseSettings.php: 3f6ea8c0e5a4dc667969f5847207902727625bbe: Growth: enwiki: Add list of mentors (T281896) (duration: 01m 10s)	[production]
00:00	<urbanecm@deploy1002>	Synchronized fc-list: 93970496da7678d896b7f812b3bb5f4cf0b691ad: update fc-list to current version on buster (T79424) (duration: 01m 09s)	[production]
2021-05-04 §
23:41	<urbanecm@deploy1002>	Synchronized wmf-config/config/enwiki.yaml: d29dbb2f435afe64f2fee15b430ee04d5d13c8d7: Enable Growth features on enwiki in the dark mode (T281896; 3/3) (duration: 01m 09s)	[production]
23:40	<urbanecm@deploy1002>	Synchronized dblists/growthexperiments.dblist: d29dbb2f435afe64f2fee15b430ee04d5d13c8d7: Enable Growth features on enwiki in the dark mode (T281896; 2/3) (duration: 01m 09s)	[production]
23:38	<urbanecm@deploy1002>	Synchronized wmf-config/InitialiseSettings.php: d29dbb2f435afe64f2fee15b430ee04d5d13c8d7: Enable Growth features on enwiki in the dark mode (T281896; 1/3) (duration: 01m 09s)	[production]
23:31	<urbanecm@deploy1002>	Synchronized wmf-config/config/bgwiki.yaml: 5b4c516a1d0461065e27cacec5d2b1cb315a2c07: Enable Growth team features in dark mode on bgwiki (T280824; 3/3) (duration: 01m 09s)	[production]
23:30	<urbanecm@deploy1002>	sync-file aborted: 5b4c516a1d0461065e27cacec5d2b1cb315a2c07: Enable Growth team features in dark mode on bgwiki (T280824; 3/3) (duration: 00m 03s)	[production]
23:30	<urbanecm@deploy1002>	Synchronized dblists/growthexperiments.dblist: 5b4c516a1d0461065e27cacec5d2b1cb315a2c07: Enable Growth team features in dark mode on bgwiki (T280824; 2/3) (duration: 01m 09s)	[production]
23:28	<urbanecm@deploy1002>	Synchronized wmf-config/InitialiseSettings.php: 5b4c516a1d0461065e27cacec5d2b1cb315a2c07: Enable Growth team features in dark mode on bgwiki (T280824; 1/3) (duration: 01m 09s)	[production]
23:26	<Urbanecm>	Create tables for GrowthExperiments extension on enwiki (T281896)	[production]
23:24	<Urbanecm>	Create tables for GrowthExperiments extension on bgwiki (T280824)	[production]
23:22	<urbanecm@deploy1002>	Synchronized wmf-config/CommonSettings.php: a3c24f322b754c9a94c260ee5df4b5ae4de27f22: Avoid using User::getGroups() and ::getEffectiveGroups() (T281823) (duration: 01m 10s)	[production]
23:13	<urbanecm@deploy1002>	Synchronized wmf-config/InitialiseSettings.php: e467d92e5e257a3d2f9b05692db9accdd86ddb00: Add extendedconfirmed on ptwiki (T281926) (duration: 01m 10s)	[production]
23:06	<urbanecm@deploy1002>	Synchronized wmf-config/InitialiseSettings.php: 012d6138741ea76c985453428111aeddfdec2271: Add extendedconfirmed on azwiki (T281860) (duration: 01m 10s)	[production]
22:49	<bblack@cumin1001>	END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on cp5016.eqsin.wmnet with reason: REIMAGE	[production]
22:47	<bblack@cumin1001>	END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on cp5015.eqsin.wmnet with reason: REIMAGE	[production]
22:46	<bblack@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5014.eqsin.wmnet with reason: REIMAGE	[production]
22:44	<bblack@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cp5016.eqsin.wmnet with reason: REIMAGE	[production]
22:44	<bblack@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5013.eqsin.wmnet with reason: REIMAGE	[production]
22:42	<bblack@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cp5015.eqsin.wmnet with reason: REIMAGE	[production]