production SAL

6051-6100 of 10000 results (43ms)

2020-06-29 §
08:03	<godog>	prometheus eqiad -- lvextend --resizefs --size +200G vg-ssd/prometheus-ops	[production]
08:02	<marostegui@cumin1001>	dbctl commit (dc=all): 'Slowly pool db1135 into s1 T253217', diff saved to https://phabricator.wikimedia.org/P11685 and previous config saved to /var/cache/conftool/dbconfig/20200629-080253-marostegui.json	[production]
07:46	<marostegui@cumin1001>	dbctl commit (dc=all): 'Add db1135 (depooled) to s1 T253217', diff saved to https://phabricator.wikimedia.org/P11684 and previous config saved to /var/cache/conftool/dbconfig/20200629-074611-marostegui.json	[production]
07:16	<XioNoX>	push new pfw firewall rules - T256170	[production]
07:13	<marostegui>	Deploy schema change on db1085 with replication to labs T253276	[production]
07:12	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depool db1085', diff saved to https://phabricator.wikimedia.org/P11683 and previous config saved to /var/cache/conftool/dbconfig/20200629-071236-marostegui.json	[production]
06:53	<marostegui@cumin1001>	dbctl commit (dc=all): 'Remove db1080 from MW', diff saved to https://phabricator.wikimedia.org/P11682 and previous config saved to /var/cache/conftool/dbconfig/20200629-065335-marostegui.json	[production]
06:50	<elukey>	execute gnt-instance remove an-launcher1001.eqiad.wmnet on ganeti1011 - T256363	[production]
06:47	<elukey@cumin1001>	END (PASS) - Cookbook sre.hosts.decommission (exit_code=0)	[production]
06:46	<elukey@cumin1001>	START - Cookbook sre.hosts.decommission	[production]
06:45	<marostegui>	Deploy MCR schema change on db1090:3312	[production]
06:35	<elukey>	force puppet run on ores* to overcome celery OOMs on some nodes	[production]
04:57	<marostegui>	Stop MySQL on db1080 to clone db1135 T253217	[production]
04:56	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
04:53	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
2020-06-28 §
21:43	<krinkle@deploy1001>	Synchronized wmf-config/CommonSettings.php: no-op I56eb4a802 (duration: 00m 58s)	[production]
21:38	<krinkle@deploy1001>	Synchronized wmf-config/InitialiseSettings-labs.php: beta-only I56eb4a802 (duration: 01m 00s)	[production]
2020-06-27 §
20:22	<qchris>	Gerrit upgrade done.	[production]
19:49	<mutante>	removed 2620:0:861:3:208:80:154:136 from /etc/network/interfaces on gerrit1001, rebooting	[production]
19:27	<mutante>	rebooting gerrit1001 one more time	[production]
19:24	<mutante>	restarted ferm on gerrit1001	[production]
19:19	<mutante>	rebooting gerrit1001 one more time	[production]
19:05	<mutante>	rebooting gerrit1001	[production]
18:58	<mutante>	rebooting gerrit2001	[production]
18:49	<hashar>	Enabling beta cluster update job (gerrit maintenance) https://integration.wikimedia.org/ci/view/Beta/job/beta-code-update-eqiad/	[production]
18:35	<qchris@deploy1001>	Finished deploy [gerrit/gerrit@da40615]: Gerrit to v3.2.2-98-g98d827eaa3 on gerrit2001 (duration: 00m 10s)	[production]
18:34	<qchris@deploy1001>	Started deploy [gerrit/gerrit@da40615]: Gerrit to v3.2.2-98-g98d827eaa3 on gerrit2001	[production]
18:27	<qchris@deploy1001>	Finished deploy [gerrit/gerrit@da40615]: Gerrit to v3.2.2-98-g98d827eaa3 on gerrit1001 (duration: 00m 08s)	[production]
18:27	<qchris@deploy1001>	Started deploy [gerrit/gerrit@da40615]: Gerrit to v3.2.2-98-g98d827eaa3 on gerrit1001	[production]
17:25	<hashar>	Disabled beta cluster update job (gerrit maintenance) https://integration.wikimedia.org/ci/view/Beta/job/beta-code-update-eqiad/	[production]
17:19	<qchris>	Stopping gerrit on gerrit1001 for the Gerrit upgrade	[production]
17:14	<qchris>	Duplicating reviewdb changes so we get a cheap and quick rollback	[production]
17:11	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
17:11	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
17:11	<qchris>	Disabling puppet on gerrit1001 for Gerrit upgrades + data migrations	[production]
17:11	<dzahn@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
17:11	<dzahn@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
17:07	<qchris>	Starting Gerrit upgrade to v3.2.2-98-g98d827eaa3	[production]
15:44	<qchris@deploy1001>	Finished deploy [gerrit/gerrit@da40615]: Gerrit to v3.2.2-98-g98d827eaa3 on gerrit1002 (gerrit-test) (duration: 00m 08s)	[production]
15:44	<qchris@deploy1001>	Started deploy [gerrit/gerrit@da40615]: Gerrit to v3.2.2-98-g98d827eaa3 on gerrit1002 (gerrit-test)	[production]
13:03	<qchris@deploy1001>	Finished deploy [gerrit/gerrit@460e439]: Gerrit to v3.2.2-97-gcaf5020db1 on gerrit1002 (gerrit-test) (duration: 00m 08s)	[production]
13:03	<qchris@deploy1001>	Started deploy [gerrit/gerrit@460e439]: Gerrit to v3.2.2-97-gcaf5020db1 on gerrit1002 (gerrit-test)	[production]
2020-06-26 §
18:42	<robh>	all ulsfo onsite work completed as of 30 minutes ago	[production]
17:52	<robh>	msw2-ulsfo work done, all mgmt items confirmed back online and icinga alerts cleared, moving onto msw1-ulsfo (rack 22) and will lose all mgmt in that rack for next 10-20 minutes T256300	[production]
17:52	<robh>	msw2-ulsfo work done, all mgmt items confirmed back online and icinga alerts cleared, moving onto msw1-ulsfo (rack 22) and will lose all mgmt in that rack for next 10-20 minutes	[production]
17:11	<robh>	msw work in ulsfo via T256300	[production]
10:24	<ema>	pool 5006 T256449	[production]
10:22	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repool db1085', diff saved to https://phabricator.wikimedia.org/P11677 and previous config saved to /var/cache/conftool/dbconfig/20200626-102248-marostegui.json	[production]
10:22	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repool db1093', diff saved to https://phabricator.wikimedia.org/P11676 and previous config saved to /var/cache/conftool/dbconfig/20200626-102201-marostegui.json	[production]
10:03	<ema>	cp2039: restart purged T256444	[production]