production SAL

7951-8000 of 10000 results (47ms)

2020-07-01 §
10:14	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
10:14	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
10:09	<jayme>	draining and docker restart (one at a time) kubernetes[2001-2004].codfw.wmnet	[production]
09:52	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
09:52	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
09:46	<jayme>	cordoning kubernetes[2001-2004].codfw.wmnet,kubernetes[1001-1004].eqiad.wmnet - T256786	[production]
09:42	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
09:42	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
09:34	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
09:34	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
09:23	<jayme>	restarting dockerd on kubestage1002.eqiad.wmnet - T256786	[production]
09:15	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
09:15	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
09:08	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
09:08	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
08:53	<jayme>	draining kubernetes staging node kubestage1001.eqiad.wmnet - T256786	[production]
08:52	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
08:52	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
08:44	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
08:44	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
08:29	<XioNoX>	disable BGP to nfacct in eqiad - T256790	[production]
08:23	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
08:23	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
08:08	<jayme@deploy1001>	helmfile [STAGING] Ran 'sync' command on namespace 'mobileapps' for release 'staging' .	[production]
08:05	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
08:05	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
08:01	<vgutierrez>	rolling restart of esams cache nodes to catch up on kernel upgrades	[production]
07:42	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
07:42	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
07:40	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
07:40	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
07:39	<ema>	cp2041: restart purged, varnishkafka after librdkafka1 upgrade to 0.11.6-1.1wmf1 T256444	[production]
05:47	<_joe_>	restarting nfacctd on netflow1001, it's segfaulting	[production]
04:01	<krinkle@deploy1001>	Synchronized php-1.35.0-wmf.39/maintenance/findBadBlobs.php: I47c11190b665 (duration: 01m 08s)	[production]
00:14	<krinkle@deploy1001>	Synchronized private/PrivateSettings.php: T254795 - Set $wmgXhguiDBuser and $wmgXhguiDBpasswor (duration: 01m 06s)	[production]
2020-06-30 §
21:48	<crusnov@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0)	[production]
21:46	<crusnov@cumin1001>	START - Cookbook sre.hosts.reboot-single	[production]
21:45	<crusnov@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0)	[production]
21:43	<crusnov@cumin1001>	START - Cookbook sre.hosts.reboot-single	[production]
21:42	<crusnov@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0)	[production]
21:40	<crusnov@cumin1001>	START - Cookbook sre.hosts.reboot-single	[production]
21:40	<crusnov@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0)	[production]
21:38	<crusnov@cumin1001>	START - Cookbook sre.hosts.reboot-single	[production]
21:38	<crusnov@cumin1001>	END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99)	[production]
21:38	<crusnov@cumin1001>	START - Cookbook sre.hosts.reboot-single	[production]
19:19	<hashar@deploy1001>	rebuilt and synchronized wikiversions files: group 0 wikis to 1.35.0-wmf.39 # T254176	[production]
18:31	<cdanis>	T256790 ✔️ cdanis@netflow2001.codfw.wmnet ~ 🕝☕ sudo apt install valgrind	[production]
18:27	<tgr>	Morning deploys done	[production]
18:23	<tgr@deploy1001>	Synchronized php-1.35.0-wmf.39/extensions/ElectronPdfService/src/ElectronPdfServiceHooks.php: Backport: [[gerrit:608485\|Hotfix: "Undefined index: print" (T256761)]] (duration: 01m 05s)	[production]
18:11	<shdubsh>	restart varnishmtail,atsmtail,ncredirmtail on ncredir,cp hosts in codfw and eqsin	[production]