production SAL

551-600 of 10000 results (94ms)

2024-10-14 §
19:29	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1014,1018].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance	[production]
19:29	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1014,1018].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance	[production]
19:29	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1156.eqiad.wmnet with reason: Maintenance	[production]
19:29	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1156.eqiad.wmnet with reason: Maintenance	[production]
18:57	<aqu@deploy2002>	Finished deploy [airflow-dags/analytics@a1a70ce]: Deploy last version for Refine staging [airflow-dags@a1a70ce8] (duration: 00m 29s)	[production]
18:57	<aqu@deploy2002>	Started deploy [airflow-dags/analytics@a1a70ce]: Deploy last version for Refine staging [airflow-dags@a1a70ce8]	[production]
18:52	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on dbstore1009.eqiad.wmnet with reason: Maintenance	[production]
18:52	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on dbstore1009.eqiad.wmnet with reason: Maintenance	[production]
18:52	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1231 (T376905)', diff saved to https://phabricator.wikimedia.org/P69825 and previous config saved to /var/cache/conftool/dbconfig/20241014-185225-ladsgroup.json	[production]
18:47	<aqu@deploy2002>	Finished deploy [airflow-dags/analytics_test@a1a70ce]: Deploy last fixes on Refine staging [airflow-dags@a1a70ce8] (duration: 00m 13s)	[production]
18:47	<aqu@deploy2002>	Started deploy [airflow-dags/analytics_test@a1a70ce]: Deploy last fixes on Refine staging [airflow-dags@a1a70ce8]	[production]
18:37	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1231', diff saved to https://phabricator.wikimedia.org/P69824 and previous config saved to /var/cache/conftool/dbconfig/20241014-183718-ladsgroup.json	[production]
18:22	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1231', diff saved to https://phabricator.wikimedia.org/P69823 and previous config saved to /var/cache/conftool/dbconfig/20241014-182211-ladsgroup.json	[production]
18:07	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1231 (T376905)', diff saved to https://phabricator.wikimedia.org/P69822 and previous config saved to /var/cache/conftool/dbconfig/20241014-180704-ladsgroup.json	[production]
17:06	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling db1231 (T376905)', diff saved to https://phabricator.wikimedia.org/P69821 and previous config saved to /var/cache/conftool/dbconfig/20241014-170647-ladsgroup.json	[production]
17:06	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1231.eqiad.wmnet with reason: Maintenance	[production]
17:06	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1231.eqiad.wmnet with reason: Maintenance	[production]
17:01	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1225.eqiad.wmnet with reason: Maintenance	[production]
17:01	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1225.eqiad.wmnet with reason: Maintenance	[production]
17:01	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1187 (T376905)', diff saved to https://phabricator.wikimedia.org/P69820 and previous config saved to /var/cache/conftool/dbconfig/20241014-170123-ladsgroup.json	[production]
16:51	<fnegri@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on cloudvirt1063.eqiad.wmnet with reason: cloudvirt1063 needs maintenance T375223	[production]
16:50	<fnegri@cumin1002>	START - Cookbook sre.hosts.downtime for 14 days, 0:00:00 on cloudvirt1063.eqiad.wmnet with reason: cloudvirt1063 needs maintenance T375223	[production]
16:46	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P69819 and previous config saved to /var/cache/conftool/dbconfig/20241014-164616-ladsgroup.json	[production]
16:31	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P69818 and previous config saved to /var/cache/conftool/dbconfig/20241014-163109-ladsgroup.json	[production]
16:16	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1187 (T376905)', diff saved to https://phabricator.wikimedia.org/P69817 and previous config saved to /var/cache/conftool/dbconfig/20241014-161602-ladsgroup.json	[production]
16:03	<sergi0>	Running `sgimeno@mwmaint2002:~$ foreachwiki userOptions.php --delete --old=1 growthexperiments-tour-newimpact-discovery` (T376461)	[production]
15:52	<aikochou@deploy2002>	helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revision-models' for release 'main' .	[production]
15:46	<aikochou@deploy2002>	helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revision-models' for release 'main' .	[production]
15:16	<isaranto@deploy2002>	helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'llm' for release 'main' .	[production]
15:15	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling db1187 (T376905)', diff saved to https://phabricator.wikimedia.org/P69816 and previous config saved to /var/cache/conftool/dbconfig/20241014-151546-ladsgroup.json	[production]
15:15	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1187.eqiad.wmnet with reason: Maintenance	[production]
15:15	<isaranto@deploy2002>	helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' .	[production]
15:15	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1187.eqiad.wmnet with reason: Maintenance	[production]
15:15	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1180 (T376905)', diff saved to https://phabricator.wikimedia.org/P69815 and previous config saved to /var/cache/conftool/dbconfig/20241014-151521-ladsgroup.json	[production]
15:07	<elukey@deploy2002>	helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'sync'.	[production]
15:06	<elukey@deploy2002>	helmfile [aux-k8s-eqiad] START helmfile.d/admin 'sync'.	[production]
15:05	<isaranto@deploy2002>	helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'llm' for release 'main' .	[production]
15:00	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P69814 and previous config saved to /var/cache/conftool/dbconfig/20241014-150014-ladsgroup.json	[production]
14:45	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P69813 and previous config saved to /var/cache/conftool/dbconfig/20241014-144507-ladsgroup.json	[production]
14:43	<aikochou@deploy2002>	helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revision-models' for release 'main' .	[production]
14:43	<jayme@deploy1003>	helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'.	[production]
14:41	<jayme@deploy1003>	helmfile [staging-eqiad] START helmfile.d/admin 'apply'.	[production]
14:41	<jayme@deploy1003>	helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
14:39	<jayme@deploy1003>	helmfile [staging-codfw] START helmfile.d/admin 'apply'.	[production]
14:30	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1180 (T376905)', diff saved to https://phabricator.wikimedia.org/P69812 and previous config saved to /var/cache/conftool/dbconfig/20241014-143000-ladsgroup.json	[production]
14:16	<stevemunene@cumin1002>	END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts an-worker1177.eqiad.wmnet	[production]
14:16	<stevemunene@cumin1002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
14:16	<stevemunene@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-worker1177.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - stevemunene@cumin1002"	[production]
14:16	<stevemunene@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-worker1177.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - stevemunene@cumin1002"	[production]
14:12	<stevemunene@cumin1002>	START - Cookbook sre.dns.netbox	[production]