production SAL

1651-1700 of 10000 results (65ms)

2019-07-03 §
11:05	<moritzm>	rebooting krypton nodes to pick up MDS-enabled qemu	[production]
11:05	<jmm@cumin2001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
11:05	<jmm@cumin2001>	START - Cookbook sre.hosts.downtime	[production]
11:04	<jmm@cumin2001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
11:04	<jmm@cumin2001>	START - Cookbook sre.hosts.downtime	[production]
10:36	<Amir1>	start of ladsgroup@mwmaint1002:~$ foreachwikiindblist wiktionary extensions/Cognate/maintenance/populateCognatePages.php (T226358)	[production]
10:12	<jmm@cumin2001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
10:11	<jmm@cumin2001>	START - Cookbook sre.hosts.downtime	[production]
10:11	<moritzm>	rolling reboot of eventschema service hosts to pick up MDS-enabled qemu	[production]
10:00	<marostegui>	Drop secret and stratch_tokens columns from the private wiki list T226826	[production]
09:58	<jmm@cumin2001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
09:58	<jmm@cumin2001>	START - Cookbook sre.hosts.downtime	[production]
09:54	<moritzm>	rebooting netmon2001 for kernel security update	[production]
09:52	<jmm@cumin2001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
09:52	<jmm@cumin2001>	START - Cookbook sre.hosts.downtime	[production]
09:47	<moritzm>	rebooting debmonitor nodes to pick up MDS-enabled qemu	[production]
09:46	<jmm@cumin2001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
09:46	<jmm@cumin2001>	START - Cookbook sre.hosts.downtime	[production]
09:27	<moritzm>	rebooting failoid nodes to pick up MDS-enabled qemu	[production]
09:25	<jmm@cumin2001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
09:25	<jmm@cumin2001>	START - Cookbook sre.hosts.downtime	[production]
09:01	<moritzm>	rolling reboot of kubernetes masters in eqiad to pick up MDS-enabled qemu	[production]
08:44	<moritzm>	rolling reboot of kubernetes masters in codfw to pick up MDS-enabled qemu	[production]
08:44	<moritzm>	rolling reboot of kubernetes masters in codfw	[production]
08:43	<jmm@cumin2001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
08:43	<jmm@cumin2001>	START - Cookbook sre.hosts.downtime	[production]
07:45	<jmm@cumin2001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
07:45	<jmm@cumin2001>	START - Cookbook sre.hosts.downtime	[production]
07:34	<godog>	reenable puppet fleetwide	[production]
07:33	<marostegui>	Upgrade db2078 (s8 codfw master)	[production]
07:25	<marostegui>	Upgrade db2100 (snapshots on that hosts are finished)	[production]
07:24	<godog>	temporarily disable puppet to test/apply https://gerrit.wikimedia.org/r/c/operations/puppet/+/520012	[production]
07:23	<moritzm>	updated buster installer d-i image to RC3	[production]
07:10	<marostegui>	Drop secret and scratch_tokens from labswiki (wikitech) and labstestwiki - T226826	[production]
07:06	<marostegui>	Drop secret and scratch_tokens from fishbowl wiki list T226826	[production]
07:05	<godog>	add 150G to graphite hosts lv, was at 94% utilization	[production]
06:55	<godog>	depool and roll-restart swift proxy - T209182	[production]
06:42	<marostegui@deploy1001>	Synchronized wmf-config/db-eqiad.php: Clarify db1069 status (duration: 00m 28s)	[production]
06:01	<marostegui@deploy1001>	Synchronized wmf-config/db-eqiad.php: Switchover x1 master eqiad from db1069 to db1120 T226358 (duration: 00m 27s)	[production]
06:00	<marostegui>	Starting x1 failover from db1069 to db1120 - T226358	[production]
06:00	<elukey>	move the zookeeper puppet submodule into operations/puppet - T226466	[production]
05:52	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
05:52	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
05:21	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
05:21	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
05:03	<vgutierrez>	restarting pybal on lvs4006	[production]
05:02	<marostegui>	Start pre-failover steps for x1 - T226358	[production]
04:47	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
04:47	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
04:34	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]