production SAL

2601-2650 of 10000 results (38ms)

2022-05-05 §
18:53	<herron@cumin1001>	END (PASS) - Cookbook sre.kafka.reboot-workers (exit_code=0) for Kafka main-eqiad cluster: Reboot kafka nodes	[production]
18:53	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
18:51	<ladsgroup@deploy1002>	Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:789562\|Set cebwiki to read new in templatelinks migration (T306673)]] (duration: 00m 49s)	[production]
18:51	<mutante>	contitn1001 - apt-get remove --purge docker.io after docker-ce was installed by puppet for T300682 (different behaviour from contint2001 since it did not have /var/lib/docker)	[production]
18:47	<razzi@cumin1001>	END (PASS) - Cookbook sre.kafka.reboot-workers (exit_code=0) for Kafka main-codfw cluster: Reboot kafka nodes	[production]
18:42	<mutante>	contitn2001 - apt-get remove --purge docker.io after docker-ce was installed by puppet for T300682	[production]
18:38	<robh@cumin1001>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host dumpsdata1006.eqiad.wmnet with OS bullseye	[production]
18:38	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
18:37	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
18:37	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
18:36	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
18:34	<ladsgroup@deploy1002>	Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:789558\|Stop writing to temp actor table in group0 (T275246)]] (duration: 00m 50s)	[production]
18:27	<mutante>	contint2001 - deleting /etc/apt/sources.list.d/repository_jenkins-thirdparty-ci.list is identical to thirdparty-ci.list . deleting the former to avoid duplicate definition warnings	[production]
18:18	<robh@cumin1001>	START - Cookbook sre.hosts.reimage for host dumpsdata1006.eqiad.wmnet with OS bullseye	[production]
18:13	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance	[production]
18:13	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance	[production]
18:13	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1157 (T307525)', diff saved to https://phabricator.wikimedia.org/P27756 and previous config saved to /var/cache/conftool/dbconfig/20220505-181314-ladsgroup.json	[production]
18:05	<mutante>	contint1001 - disabled puppet	[production]
17:58	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1157', diff saved to https://phabricator.wikimedia.org/P27755 and previous config saved to /var/cache/conftool/dbconfig/20220505-175809-ladsgroup.json	[production]
17:43	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1157', diff saved to https://phabricator.wikimedia.org/P27754 and previous config saved to /var/cache/conftool/dbconfig/20220505-174304-ladsgroup.json	[production]
17:36	<mutante>	phab1001 - apt-get remove subversion	[production]
17:27	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1157 (T307525)', diff saved to https://phabricator.wikimedia.org/P27753 and previous config saved to /var/cache/conftool/dbconfig/20220505-172758-ladsgroup.json	[production]
17:20	<mutante>	phabricator - believe it or not - disabling the last active SUBVERSION repository in Diffusion (https://phabricator.wikimedia.org/diffusion/TSVN)	[production]
17:11	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1157 (T307525)', diff saved to https://phabricator.wikimedia.org/P27752 and previous config saved to /var/cache/conftool/dbconfig/20220505-171140-ladsgroup.json	[production]
17:11	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1157.eqiad.wmnet with reason: Maintenance	[production]
17:11	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1157.eqiad.wmnet with reason: Maintenance	[production]
17:11	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1112 (T307525)', diff saved to https://phabricator.wikimedia.org/P27751 and previous config saved to /var/cache/conftool/dbconfig/20220505-171132-ladsgroup.json	[production]
16:56	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1112', diff saved to https://phabricator.wikimedia.org/P27750 and previous config saved to /var/cache/conftool/dbconfig/20220505-165627-ladsgroup.json	[production]
16:54	<herron@cumin1001>	START - Cookbook sre.kafka.reboot-workers for Kafka main-eqiad cluster: Reboot kafka nodes	[production]
16:47	<ebysans@deploy1002>	Finished deploy [airflow-dags/analytics@ebbdbb6]: (no justification provided) (duration: 00m 09s)	[production]
16:47	<ebysans@deploy1002>	Started deploy [airflow-dags/analytics@ebbdbb6]: (no justification provided)	[production]
16:41	<razzi@cumin1001>	START - Cookbook sre.kafka.reboot-workers for Kafka main-codfw cluster: Reboot kafka nodes	[production]
16:41	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1112', diff saved to https://phabricator.wikimedia.org/P27749 and previous config saved to /var/cache/conftool/dbconfig/20220505-164122-ladsgroup.json	[production]
16:38	<cmjohnson@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
16:35	<cmjohnson@cumin1001>	START - Cookbook sre.dns.netbox	[production]
16:26	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1112 (T307525)', diff saved to https://phabricator.wikimedia.org/P27748 and previous config saved to /var/cache/conftool/dbconfig/20220505-162617-ladsgroup.json	[production]
16:15	<akosiaris>	T307671 depool maps1007 from traffic per suggestion.	[production]
16:14	<akosiaris@cumin1001>	conftool action : set/pooled=no; selector: name=maps1007.eqiad.wmnet	[production]
16:12	<jelto@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host gitlab-runner2004.codfw.wmnet	[production]
16:07	<razzi@cumin1001>	END (PASS) - Cookbook sre.aqs.roll-restart (exit_code=0) for AQS aqs cluster: Roll restart of all AQS's nodejs daemons.	[production]
16:05	<jelto@cumin1001>	START - Cookbook sre.hosts.reboot-single for host gitlab-runner2004.codfw.wmnet	[production]
16:04	<razzi@cumin1001>	START - Cookbook sre.aqs.roll-restart for AQS aqs cluster: Roll restart of all AQS's nodejs daemons.	[production]
16:03	<jelto@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host gitlab-runner2003.codfw.wmnet	[production]
16:00	<klausman@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve2008.codfw.wmnet	[production]
15:56	<jelto@cumin1001>	START - Cookbook sre.hosts.reboot-single for host gitlab-runner2003.codfw.wmnet	[production]
15:55	<jelto@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host gitlab-runner2002.codfw.wmnet	[production]
15:52	<klausman@cumin1001>	START - Cookbook sre.hosts.reboot-single for host ml-serve2008.codfw.wmnet	[production]
15:52	<herron@cumin1001>	END (PASS) - Cookbook sre.kafka.reboot-workers (exit_code=0) for Kafka main-codfw cluster: Reboot kafka nodes	[production]
15:50	<klausman@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve2007.codfw.wmnet	[production]
15:48	<jelto@cumin1001>	START - Cookbook sre.hosts.reboot-single for host gitlab-runner2002.codfw.wmnet	[production]