production SAL

1301-1350 of 10000 results (25ms)

2020-05-19 §
13:09	<jayme>	updated helm: 2.16.7-1 -> 2.16.7-2 on deploy[1,2]001 and contint[1,2]001	[production]
13:09	<elukey@cumin1001>	START - Cookbook sre.ganeti.makevm	[production]
13:03	<kormat@cumin1001>	dbctl commit (dc=all): 'Pool db2136 into s4 T252985', diff saved to https://phabricator.wikimedia.org/P11233 and previous config saved to /var/cache/conftool/dbconfig/20200519-130313-kormat.json	[production]
12:40	<ariel@deploy1001>	Finished deploy [dumps/dumps@a329605]: make page content fixup script move inprog files into place if good (duration: 00m 04s)	[production]
12:40	<ariel@deploy1001>	Started deploy [dumps/dumps@a329605]: make page content fixup script move inprog files into place if good	[production]
12:37	<jayme>	imported helm 2.16.7-2 to main for buster-wikimedia, stretch-wikimedia, jessie-wikimedia	[production]
12:17	<hnowlan@cumin1001>	END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0)	[production]
11:51	<jynus>	starting backups of es1, es2, es3 on eqiad into backup1002	[production]
11:41	<jynus@cumin1001>	dbctl commit (dc=all): 'Depool es1018, es1015, es1019', diff saved to https://phabricator.wikimedia.org/P11232 and previous config saved to /var/cache/conftool/dbconfig/20200519-114148-jynus.json	[production]
11:12	<marostegui>	Deploy schema change on db2124 (frwiki, jawiki, ruwiki) T238966	[production]
10:34	<mutante>	releases2001 - restarted failed jenkins	[production]
10:33	<mutante>	releases2001 - Failed to restart jenkins.service: The name org.freedesktop.PolicyKit1 was not provided by any .service files	[production]
10:32	<volans>	flushed all Netbox caches (manage.py invalidate all) - T253091	[production]
10:29	<volans>	start Netbox restore - T253091	[production]
10:18	<jayme@deploy1001>	helmfile [STAGING] Ran 'sync' command on namespace 'mathoid' for release 'staging' .	[production]
10:13	<akosiaris>	upgrade etherpad-lite to 1.8.4 on etherpad1002	[production]
09:58	<hnowlan>	roll-restart of eqiad restbase hosts for java security updates	[production]
09:58	<hnowlan@cumin1001>	START - Cookbook sre.cassandra.roll-restart	[production]
09:55	<jayme@deploy1001>	helmfile [EQIAD] Ran 'sync' command on namespace 'mathoid' for release 'production' .	[production]
09:55	<jayme@deploy1001>	helmfile [CODFW] Ran 'sync' command on namespace 'mathoid' for release 'canary' .	[production]
09:55	<jayme@deploy1001>	helmfile [CODFW] Ran 'sync' command on namespace 'mathoid' for release 'production' .	[production]
09:54	<jayme@deploy1001>	helmfile [STAGING] Ran 'sync' command on namespace 'mathoid' for release 'staging' .	[production]
09:10	<godog>	eqiad-prod: decom ms-be101[678] - T252008	[production]
08:07	<XioNoX>	Push 596597: BGP: standardize fixed part of IX4/IX6 groups - eqsin	[production]
08:04	<XioNoX>	Push 596597: BGP: standardize fixed part of IX4/IX6 groups - esams	[production]
08:01	<XioNoX>	Push 596597: BGP: standardize fixed part of IX4/IX6 groups - eqiad	[production]
07:55	<volker-e@deploy1001>	Finished deploy [design/style-guide@37c67dd]: Deploy design/style-guide: (duration: 00m 06s)	[production]
07:54	<volker-e@deploy1001>	Started deploy [design/style-guide@37c67dd]: Deploy design/style-guide:	[production]
07:52	<XioNoX>	Push 596597: BGP: standardize fixed part of IX4/IX6 groups - *dfw	[production]
07:49	<XioNoX>	Push 596597: BGP: standardize fixed part of IX4/IX6 groups - ulsfo	[production]
07:45	<vgutierrez>	rolling upgrade to trafficserver 8.0.7-1wm10 with puppet disabled on cp hosts	[production]
07:09	<jynus>	starting es4 & es5 eqiad backups with low concurrency	[production]
06:35	<elukey@cumin1001>	END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0)	[production]
06:29	<elukey@cumin1001>	START - Cookbook sre.zookeeper.roll-restart-zookeeper	[production]
06:24	<elukey@cumin1001>	END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0)	[production]
06:17	<elukey@cumin1001>	START - Cookbook sre.zookeeper.roll-restart-zookeeper	[production]
05:57	<volker-e@deploy1001>	Finished deploy [design/style-guide@7bfbd2a]: Deploy design/style-guide: (duration: 00m 06s)	[production]
05:57	<volker-e@deploy1001>	Started deploy [design/style-guide@7bfbd2a]: Deploy design/style-guide:	[production]
05:03	<marostegui@cumin1001>	dbctl commit (dc=all): 'Set s2 and s8 as read-only=off for maintenance T251981', diff saved to https://phabricator.wikimedia.org/P11227 and previous config saved to /var/cache/conftool/dbconfig/20200519-050346-marostegui.json	[production]
05:00	<marostegui@cumin1001>	dbctl commit (dc=all): 'Set s2 and s8 as read-only for maintenance T251981', diff saved to https://phabricator.wikimedia.org/P11226 and previous config saved to /var/cache/conftool/dbconfig/20200519-050043-marostegui.json	[production]
04:27	<marostegui>	Repool labsdb1011 T249188	[production]
03:29	<volker-e@deploy1001>	Finished deploy [design/style-guide@4b4bc51]: Deploy design/style-guide: (duration: 00m 07s)	[production]
03:28	<volker-e@deploy1001>	Started deploy [design/style-guide@4b4bc51]: Deploy design/style-guide:	[production]
2020-05-18 §
23:50	<pt1979@cumin2001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
23:47	<pt1979@cumin2001>	START - Cookbook sre.hosts.downtime	[production]
23:25	<pt1979@cumin2001>	END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99)	[production]
23:23	<pt1979@cumin2001>	START - Cookbook sre.hosts.downtime	[production]
23:12	<ryankemper>	Restarted `wdqs-updater` across all wdqs nodes and restarted `wdqs-categories` across all nodes except 1010 (test wdqs server) and 1009 (automated deployment server)	[production]
22:55	<Krinkle>	Clear module_deps on dewiki (group2, old mw version, s5) to monitor regeneration	[production]
22:48	<Krinkle>	Clear module_deps on group0 (mostly s3) to monitor regeneration	[production]