production SAL

1551-1600 of 10000 results (51ms)

2022-02-28 §
14:48	<elukey@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes2019.codfw.wmnet with OS bullseye	[production]
14:44	<cmooney@cumin1001>	START - Cookbook sre.dns.netbox	[production]
14:43	<klausman@cumin2001>	END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host ml-etcd-staging2001.codfw.wmnet	[production]
14:37	<elukey@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes2019.codfw.wmnet with reason: host reimage	[production]
14:35	<elukey@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes2019.codfw.wmnet with reason: host reimage	[production]
14:33	<klausman@cumin2001>	START - Cookbook sre.ganeti.makevm for new host ml-etcd-staging2001.codfw.wmnet	[production]
14:20	<elukey@cumin1001>	START - Cookbook sre.hosts.reimage for host kubernetes2019.codfw.wmnet with OS bullseye	[production]
14:18	<elukey@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes2018.codfw.wmnet with OS bullseye	[production]
14:09	<kharlan@deploy1002>	helmfile [staging] DONE helmfile.d/services/linkrecommendation: apply	[production]
14:09	<kharlan@deploy1002>	helmfile [staging] START helmfile.d/services/linkrecommendation: apply	[production]
14:07	<elukey@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes2018.codfw.wmnet with reason: host reimage	[production]
14:05	<elukey@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes2018.codfw.wmnet with reason: host reimage	[production]
14:03	<jelto>	update gitlab-ce to 14.7.4 on all GitLab hosts	[production]
14:00	<ebysans@deploy1002>	Finished deploy [airflow-dags/analytics@75e8eb7]: (no justification provided) (duration: 00m 14s)	[production]
14:00	<kharlan@deploy1002>	helmfile [staging] START helmfile.d/services/linkrecommendation: apply	[production]
14:00	<ebysans@deploy1002>	Started deploy [airflow-dags/analytics@75e8eb7]: (no justification provided)	[production]
13:51	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1111 (T302185)', diff saved to https://phabricator.wikimedia.org/P21600 and previous config saved to /var/cache/conftool/dbconfig/20220228-135158-ladsgroup.json	[production]
13:50	<elukey@cumin1001>	START - Cookbook sre.hosts.reimage for host kubernetes2018.codfw.wmnet with OS bullseye	[production]
13:36	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1111', diff saved to https://phabricator.wikimedia.org/P21599 and previous config saved to /var/cache/conftool/dbconfig/20220228-133653-ladsgroup.json	[production]
13:21	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1111', diff saved to https://phabricator.wikimedia.org/P21598 and previous config saved to /var/cache/conftool/dbconfig/20220228-132148-ladsgroup.json	[production]
13:14	<moritzm>	restarting apache on puppet masters to pick up expat security update	[production]
13:06	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1111 (T302185)', diff saved to https://phabricator.wikimedia.org/P21597 and previous config saved to /var/cache/conftool/dbconfig/20220228-130644-ladsgroup.json	[production]
13:01	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1111.eqiad.wmnet with OS bullseye	[production]
12:46	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1111.eqiad.wmnet with reason: host reimage	[production]
12:44	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Your commit message', diff saved to https://phabricator.wikimedia.org/P21596 and previous config saved to /var/cache/conftool/dbconfig/20220228-124454-ladsgroup.json	[production]
12:44	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on db1111.eqiad.wmnet with reason: host reimage	[production]
12:35	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.reimage for host db1111.eqiad.wmnet with OS bullseye	[production]
12:30	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1111 (T302185)', diff saved to https://phabricator.wikimedia.org/P21594 and previous config saved to /var/cache/conftool/dbconfig/20220228-123008-ladsgroup.json	[production]
12:30	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1111.eqiad.wmnet with reason: Maintenance	[production]
12:30	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1111.eqiad.wmnet with reason: Maintenance	[production]
12:25	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5011.eqsin.wmnet with OS buster	[production]
12:24	<vgutierrez>	pool cp5011 running HAProxy as TLS termination layer - T290005 T271421	[production]
12:22	<vgutierrez>	vgutierrez@apt1001:~$ sudo -i reprepro --component thirdparty/haproxy24 update buster-wikimedia - T290005	[production]
12:20	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1175 (T300992)', diff saved to https://phabricator.wikimedia.org/P21593 and previous config saved to /var/cache/conftool/dbconfig/20220228-122039-ladsgroup.json	[production]
12:05	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1175', diff saved to https://phabricator.wikimedia.org/P21592 and previous config saved to /var/cache/conftool/dbconfig/20220228-120535-ladsgroup.json	[production]
11:58	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5011.eqsin.wmnet with reason: host reimage	[production]
11:55	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cp5011.eqsin.wmnet with reason: host reimage	[production]
11:50	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1175', diff saved to https://phabricator.wikimedia.org/P21591 and previous config saved to /var/cache/conftool/dbconfig/20220228-115030-ladsgroup.json	[production]
11:42	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1114 (T302185)', diff saved to https://phabricator.wikimedia.org/P21590 and previous config saved to /var/cache/conftool/dbconfig/20220228-114230-ladsgroup.json	[production]
11:35	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1175 (T300992)', diff saved to https://phabricator.wikimedia.org/P21589 and previous config saved to /var/cache/conftool/dbconfig/20220228-113525-ladsgroup.json	[production]
11:29	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.reimage for host cp5011.eqsin.wmnet with OS buster	[production]
11:27	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1114', diff saved to https://phabricator.wikimedia.org/P21588 and previous config saved to /var/cache/conftool/dbconfig/20220228-112726-ladsgroup.json	[production]
11:17	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1175 (T300992)', diff saved to https://phabricator.wikimedia.org/P21587 and previous config saved to /var/cache/conftool/dbconfig/20220228-111700-ladsgroup.json	[production]
11:17	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1175.eqiad.wmnet with reason: Maintenance	[production]
11:16	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1175.eqiad.wmnet with reason: Maintenance	[production]
11:12	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1088.eqiad.wmnet with OS buster	[production]
11:12	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1114', diff saved to https://phabricator.wikimedia.org/P21586 and previous config saved to /var/cache/conftool/dbconfig/20220228-111221-ladsgroup.json	[production]
11:09	<vgutierrez>	pool cp1088 running HAProxy as TLS termination layer - T290005 T271421	[production]
10:57	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1114 (T302185)', diff saved to https://phabricator.wikimedia.org/P21585 and previous config saved to /var/cache/conftool/dbconfig/20220228-105716-ladsgroup.json	[production]
10:54	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance	[production]