production SAL

3051-3100 of 10000 results (48ms)

2022-02-25 §
11:13	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp4025.ulsfo.wmnet with reason: host reimage	[production]
11:10	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cp4025.ulsfo.wmnet with reason: host reimage	[production]
11:04	<moritzm>	added ganeti2029 to codfw Ganeti cluster T298998	[production]
10:54	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.reimage for host cp4025.ulsfo.wmnet with OS buster	[production]
10:43	<jmm@cumin2002>	START - Cookbook sre.ganeti.addnode for new host ganeti2029.codfw.wmnet to ganeti01.svc.codfw.wmnet	[production]
10:42	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2029.codfw.wmnet	[production]
10:41	<moritzm>	enabled virtualisation in BIOS for ganeti2029 T298998	[production]
10:33	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host ganeti2029.codfw.wmnet	[production]
10:27	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ganeti2029.codfw.wmnet with reason: Enable virtualisation in BIOS	[production]
10:27	<jmm@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti2029.codfw.wmnet with reason: Enable virtualisation in BIOS	[production]
10:22	<jmm@cumin2002>	END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti2029.codfw.wmnet to ganeti01.svc.codfw.wmnet	[production]
10:22	<jmm@cumin2002>	START - Cookbook sre.ganeti.addnode for new host ganeti2029.codfw.wmnet to ganeti01.svc.codfw.wmnet	[production]
10:17	<vgutierrez>	rolling upgrade to HAProxy 2.4.13 on HAProxy cache nodes - T290005	[production]
09:34	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2029.codfw.wmnet	[production]
09:28	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host ganeti2029.codfw.wmnet	[production]
02:43	<cstone>	Donation Interface revision changed from a6a9b63e to 4638c0ec	[production]
2022-02-24 §
23:35	<ryankemper>	T302526 Deployed https://gerrit.wikimedia.org/r/765652 and ran puppet across wcqs*	[production]
22:06	<mutante>	static-bugzilla.wikimedia.org - kubernetes - deployed gerrit:765572 - first prod service behind a k8s ingress (T290966)	[production]
22:05	<mutante>	phabricator - disabled git repo - labs-tools-harvesting-data-refinery/repository/master/	[production]
21:50	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2086.codfw.wmnet with OS bullseye	[production]
21:45	<brennen>	end of UTC late backport & config window	[production]
21:43	<dancy@deploy1002>	Started scap: testing scap container image building	[production]
21:43	<tzatziki>	removing 1 file for legal compliance	[production]
21:42	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2085.codfw.wmnet with OS bullseye	[production]
21:41	<mutante>	phabricator - disabled git repo "frig" - outdated fundraising stuff, checked with fr-tech, not needed T296022	[production]
21:40	<brennen@deploy1002>	Synchronized php-1.38.0-wmf.23/includes: Backport: [[gerrit:765626\|Revert "Revert "Revert "Show message fallback keys when using &uselang=qqx"""]] (duration: 00m 57s)	[production]
21:39	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic2086.codfw.wmnet with reason: host reimage	[production]
21:36	<pt1979@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on elastic2086.codfw.wmnet with reason: host reimage	[production]
21:34	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic2085.codfw.wmnet with reason: host reimage	[production]
21:30	<pt1979@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on elastic2085.codfw.wmnet with reason: host reimage	[production]
21:29	<brennen@deploy1002>	Synchronized wmf-config/CirrusSearch-production.php: Config: [[gerrit:765577\|cirrus: Reduce write isolation to only cloudelastic (T295705)]] (duration: 00m 55s)	[production]
21:27	<mutante>	phabricator - disabling git repo rGEDS (Elasticdash) - only one commit from 2015 - T296022	[production]
21:19	<tzatziki>	removing 1 file for legal compliance	[production]
21:19	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host elastic2086.codfw.wmnet with OS bullseye	[production]
21:18	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2083.codfw.wmnet with OS bullseye	[production]
21:13	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host elastic2085.codfw.wmnet with OS bullseye	[production]
21:11	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2084.codfw.wmnet with OS bullseye	[production]
21:07	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic2083.codfw.wmnet with reason: host reimage	[production]
21:05	<tzatziki>	removing 4 files for legal compilance	[production]
21:04	<pt1979@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on elastic2083.codfw.wmnet with reason: host reimage	[production]
21:02	<taavi@deploy1002>	Finished deploy [horizon/deploy@9d02cd6]: (no justification provided) (duration: 03m 18s)	[production]
21:01	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic2084.codfw.wmnet with reason: host reimage	[production]
20:59	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host elastic2083.codfw.wmnet with OS bullseye	[production]
20:58	<taavi@deploy1002>	Started deploy [horizon/deploy@9d02cd6]: (no justification provided)	[production]
20:58	<pt1979@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on elastic2084.codfw.wmnet with reason: host reimage	[production]
20:51	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host elastic2084.codfw.wmnet with OS bullseye	[production]
20:14	<pt1979@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host elastic2084.codfw.wmnet with OS bullseye	[production]
20:10	<pt1979@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host elastic2083.codfw.wmnet with OS bullseye	[production]
20:04	<ryankemper>	T302526 `ryankemper@cumin1001:~$ sudo -E cumin -b 3 'wcqs*' 'enable-puppet "query_service: Simply jvm arg handling - T302526"; sudo run-puppet-agent'` in tmux `wcqs`	[production]
20:02	<ryankemper>	T302526 Depooled `wcqs1001`, ran puppet agent, and restarted `wcqs-blazegraph`. Service came up healthy, proceeding to rest of wcqs fleet	[production]