production SAL

3001-3050 of 10000 results (33ms)

2020-10-09 §
11:38	<jayme@deploy1001>	helmfile [codfw] Ran 'sync' command on namespace 'mobileapps' for release 'production' .	[production]
11:16	<jayme@deploy1001>	helmfile [eqiad] Ran 'sync' command on namespace 'mobileapps' for release 'production' .	[production]
11:13	<jayme@deploy1001>	helmfile [eqiad] Ran 'sync' command on namespace 'mathoid' for release 'production' .	[production]
11:13	<jayme@deploy1001>	helmfile [staging] Ran 'sync' command on namespace 'mobileapps' for release 'staging' .	[production]
10:52	<jayme@deploy1001>	helmfile [staging] Ran 'sync' command on namespace 'mathoid' for release 'staging' .	[production]
10:41	<gehel@cumin1001>	END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99)	[production]
10:17	<jayme@deploy1001>	helmfile [codfw] Ran 'sync' command on namespace 'eventgate-main' for release 'canary' .	[production]
10:17	<jayme@deploy1001>	helmfile [codfw] Ran 'sync' command on namespace 'eventgate-main' for release 'production' .	[production]
10:16	<jayme@deploy1001>	helmfile [codfw] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' .	[production]
10:11	<jayme@deploy1001>	helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-main' for release 'canary' .	[production]
10:11	<jayme@deploy1001>	helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-main' for release 'production' .	[production]
09:55	<jayme@deploy1001>	helmfile [staging] Ran 'sync' command on namespace 'eventgate-main' for release 'production' .	[production]
09:53	<jayme@deploy1001>	helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' .	[production]
09:47	<elukey>	roll restart of hadoop-yarn-nodemanager on all hadoop workers to pick up new settings	[production]
09:38	<jayme@deploy1001>	helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' .	[production]
09:38	<jayme@deploy1001>	helmfile [staging] Ran 'sync' command on namespace 'eventgate-analytics' for release 'canary' .	[production]
09:32	<kormat@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
09:32	<kormat@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
09:07	<XioNoX>	remove user from all network devices	[production]
08:22	<marostegui>	Restart dbstore1005 mysql to pick up new buffer pool sizes	[production]
08:11	<filippo@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
08:11	<filippo@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
07:36	<moritzm>	installing xen security updates for buster (libs only)	[production]
07:34	<filippo@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
07:34	<filippo@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
00:16	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.decommission (exit_code=0)	[production]
00:00	<dzahn@cumin1001>	START - Cookbook sre.hosts.decommission	[production]
2020-10-08 §
23:42	<ryankemper>	`cloudelastic1006` done. Writes thawed, maintenance window lifted; restarts are done for `cloudelastic`	[production]
23:37	<ryankemper>	`cloudelastic1005` done	[production]
23:31	<ryankemper>	`cloudelastic1004` done	[production]
23:27	<ryankemper>	`cloudelastic1003` done	[production]
23:23	<ryankemper>	`cloudelastic1002` done	[production]
23:16	<tgr_>	Evening deploys done	[production]
23:16	<ryankemper>	`cloudelastic1001` is done restarting and cluster is green again. Proceeding to `cloudelastic1002`	[production]
23:16	<tgr@deploy1001>	Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:632797\|Enable logging of session cookie changes everywhere (T264793)]] (duration: 01m 01s)	[production]
23:04	<ryankemper>	Beginning cluster restarts one server at a time. For each server, the process is depool->restart elasticsearch services->wait for services to restart and then pool->wait for cluster to return to green status before starting next server	[production]
23:01	<ryankemper>	Writes are frozen for `cloudelastic`: `/usr/local/bin/mwscript extensions/CirrusSearch/maintenance/FreezeWritesToCluster.php --wiki=enwiki --cluster=cloudelastic` on `mwmaint2001` => `Applied cluster-wide freeze`	[production]
22:56	<ryankemper>	`sudo apt policy wmf-elasticsearch-search-plugins` shows correct state: `Installed: 6.5.4-4~stretch`	[production]
22:56	<ryankemper>	`sudo -E cumin -b 6 C:role::elasticsearch::cloudelastic 'DEBIAN_FRONTEND=noninteractive sudo apt-get -y -o Dpkg::Options::="--force-confdef" -o Dpkg::Options::="--force-confold" install wmf-elasticsearch-search-plugins'`	[production]
22:54	<ryankemper>	About to start plugin upgrade followed by restarts of `cloudelastic`. Maintenance window set for the next 2 hours on `cloudelastic100[1-6]`	[production]
21:54	<ebernhardson@deploy1001>	Finished deploy [wikimedia/discovery/analytics@a923949]: search_satisfaction: update druid datasource to match previous data (duration: 01m 04s)	[production]
21:53	<ebernhardson@deploy1001>	Started deploy [wikimedia/discovery/analytics@a923949]: search_satisfaction: update druid datasource to match previous data	[production]
21:52	<hashar@deploy1001>	Synchronized php-1.36.0-wmf.10/includes/session/SessionBackend.php: Deduplicate SessionBackend::logPersistenceChange calls - T264793 (duration: 01m 01s)	[production]
21:05	<volans@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
21:00	<volans@cumin1001>	START - Cookbook sre.dns.netbox	[production]
21:00	<volans@cumin1001>	END (ERROR) - Cookbook sre.dns.netbox (exit_code=97)	[production]
21:00	<volans@cumin1001>	START - Cookbook sre.dns.netbox	[production]
20:50	<volans@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
20:45	<volans@cumin1001>	START - Cookbook sre.dns.netbox	[production]
20:43	<volans>	deploying Netbox DNS zone consolidation - T264273	[production]