production SAL

101-150 of 10000 results (44ms)

2022-02-28 §
15:25	<milimetric@deploy1002>	Finished deploy [analytics/refinery@84a0770] (thin): Add a few wikis to the sqoop list (duration: 00m 08s)	[production]
15:25	<milimetric@deploy1002>	Started deploy [analytics/refinery@84a0770] (thin): Add a few wikis to the sqoop list	[production]
15:23	<milimetric@deploy1002>	Finished deploy [analytics/refinery@84a0770]: Add a few wikis to the sqoop list (duration: 21m 18s)	[production]
15:18	<elukey@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=1) for host kubernetes2020.codfw.wmnet with OS bullseye	[production]
15:07	<elukey@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes2020.codfw.wmnet with reason: host reimage	[production]
15:06	<ntsako@deploy1002>	Finished deploy [airflow-dags/analytics@0a2ffb8]: (no justification provided) (duration: 00m 07s)	[production]
15:06	<ntsako@deploy1002>	Started deploy [airflow-dags/analytics@0a2ffb8]: (no justification provided)	[production]
15:04	<elukey@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes2020.codfw.wmnet with reason: host reimage	[production]
15:02	<krinkle@deploy1002>	Synchronized wmf-config/InitialiseSettings.php: I616f56388eee9df21e (duration: 00m 49s)	[production]
15:02	<milimetric@deploy1002>	Started deploy [analytics/refinery@84a0770]: Add a few wikis to the sqoop list	[production]
14:53	<cmooney@cumin1001>	END (FAIL) - Cookbook sre.dns.netbox (exit_code=99)	[production]
14:50	<elukey@cumin1001>	START - Cookbook sre.hosts.reimage for host kubernetes2020.codfw.wmnet with OS bullseye	[production]
14:48	<elukey@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes2019.codfw.wmnet with OS bullseye	[production]
14:44	<cmooney@cumin1001>	START - Cookbook sre.dns.netbox	[production]
14:43	<klausman@cumin2001>	END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host ml-etcd-staging2001.codfw.wmnet	[production]
14:37	<elukey@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes2019.codfw.wmnet with reason: host reimage	[production]
14:35	<elukey@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes2019.codfw.wmnet with reason: host reimage	[production]
14:33	<klausman@cumin2001>	START - Cookbook sre.ganeti.makevm for new host ml-etcd-staging2001.codfw.wmnet	[production]
14:20	<elukey@cumin1001>	START - Cookbook sre.hosts.reimage for host kubernetes2019.codfw.wmnet with OS bullseye	[production]
14:18	<elukey@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes2018.codfw.wmnet with OS bullseye	[production]
14:09	<kharlan@deploy1002>	helmfile [staging] DONE helmfile.d/services/linkrecommendation: apply	[production]
14:09	<kharlan@deploy1002>	helmfile [staging] START helmfile.d/services/linkrecommendation: apply	[production]
14:07	<elukey@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes2018.codfw.wmnet with reason: host reimage	[production]
14:05	<elukey@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes2018.codfw.wmnet with reason: host reimage	[production]
14:03	<jelto>	update gitlab-ce to 14.7.4 on all GitLab hosts	[production]
14:00	<ebysans@deploy1002>	Finished deploy [airflow-dags/analytics@75e8eb7]: (no justification provided) (duration: 00m 14s)	[production]
14:00	<kharlan@deploy1002>	helmfile [staging] START helmfile.d/services/linkrecommendation: apply	[production]
14:00	<ebysans@deploy1002>	Started deploy [airflow-dags/analytics@75e8eb7]: (no justification provided)	[production]
13:51	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1111 (T302185)', diff saved to https://phabricator.wikimedia.org/P21600 and previous config saved to /var/cache/conftool/dbconfig/20220228-135158-ladsgroup.json	[production]
13:50	<elukey@cumin1001>	START - Cookbook sre.hosts.reimage for host kubernetes2018.codfw.wmnet with OS bullseye	[production]
13:36	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1111', diff saved to https://phabricator.wikimedia.org/P21599 and previous config saved to /var/cache/conftool/dbconfig/20220228-133653-ladsgroup.json	[production]
13:21	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1111', diff saved to https://phabricator.wikimedia.org/P21598 and previous config saved to /var/cache/conftool/dbconfig/20220228-132148-ladsgroup.json	[production]
13:14	<moritzm>	restarting apache on puppet masters to pick up expat security update	[production]
13:06	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1111 (T302185)', diff saved to https://phabricator.wikimedia.org/P21597 and previous config saved to /var/cache/conftool/dbconfig/20220228-130644-ladsgroup.json	[production]
13:01	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1111.eqiad.wmnet with OS bullseye	[production]
12:46	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1111.eqiad.wmnet with reason: host reimage	[production]
12:44	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Your commit message', diff saved to https://phabricator.wikimedia.org/P21596 and previous config saved to /var/cache/conftool/dbconfig/20220228-124454-ladsgroup.json	[production]
12:44	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on db1111.eqiad.wmnet with reason: host reimage	[production]
12:35	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.reimage for host db1111.eqiad.wmnet with OS bullseye	[production]
12:30	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1111 (T302185)', diff saved to https://phabricator.wikimedia.org/P21594 and previous config saved to /var/cache/conftool/dbconfig/20220228-123008-ladsgroup.json	[production]
12:30	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1111.eqiad.wmnet with reason: Maintenance	[production]
12:30	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1111.eqiad.wmnet with reason: Maintenance	[production]
12:25	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5011.eqsin.wmnet with OS buster	[production]
12:24	<vgutierrez>	pool cp5011 running HAProxy as TLS termination layer - T290005 T271421	[production]
12:22	<vgutierrez>	vgutierrez@apt1001:~$ sudo -i reprepro --component thirdparty/haproxy24 update buster-wikimedia - T290005	[production]
12:20	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1175 (T300992)', diff saved to https://phabricator.wikimedia.org/P21593 and previous config saved to /var/cache/conftool/dbconfig/20220228-122039-ladsgroup.json	[production]
12:05	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1175', diff saved to https://phabricator.wikimedia.org/P21592 and previous config saved to /var/cache/conftool/dbconfig/20220228-120535-ladsgroup.json	[production]
11:58	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5011.eqsin.wmnet with reason: host reimage	[production]
11:55	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cp5011.eqsin.wmnet with reason: host reimage	[production]
11:50	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1175', diff saved to https://phabricator.wikimedia.org/P21591 and previous config saved to /var/cache/conftool/dbconfig/20220228-115030-ladsgroup.json	[production]