production SAL

4601-4650 of 10000 results (83ms)

2022-12-07 §
17:33	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1194.eqiad.wmnet with reason: Maintenance	[production]
17:33	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1194.eqiad.wmnet with reason: Maintenance	[production]
17:33	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1181 (T322618)', diff saved to https://phabricator.wikimedia.org/P42472 and previous config saved to /var/cache/conftool/dbconfig/20221207-173329-ladsgroup.json	[production]
17:30	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1098:3316', diff saved to https://phabricator.wikimedia.org/P42471 and previous config saved to /var/cache/conftool/dbconfig/20221207-173057-ladsgroup.json	[production]
17:30	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2124', diff saved to https://phabricator.wikimedia.org/P42470 and previous config saved to /var/cache/conftool/dbconfig/20221207-173045-ladsgroup.json	[production]
17:27	<cmjohnson@cumin1001>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - cmjohnson@cumin1001"	[production]
17:26	<cmjohnson@cumin1001>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - cmjohnson@cumin1001"	[production]
17:26	<cwhite@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host logstash1026.eqiad.wmnet with OS bullseye	[production]
17:25	<sukhe>	running homer for Gerrit: 865712	[production]
17:18	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P42469 and previous config saved to /var/cache/conftool/dbconfig/20221207-171822-ladsgroup.json	[production]
17:18	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2173 (T322618)', diff saved to https://phabricator.wikimedia.org/P42468 and previous config saved to /var/cache/conftool/dbconfig/20221207-171803-ladsgroup.json	[production]
17:17	<sukhe@cumin2002>	END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts lvs5002.eqsin.wmnet	[production]
17:17	<sukhe@cumin2002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
17:17	<sukhe@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: lvs5002.eqsin.wmnet decommissioned, removing all IPs except the asset tag one - sukhe@cumin2002"	[production]
17:15	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1098:3316 (T322618)', diff saved to https://phabricator.wikimedia.org/P42467 and previous config saved to /var/cache/conftool/dbconfig/20221207-171551-ladsgroup.json	[production]
17:15	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2124 (T322618)', diff saved to https://phabricator.wikimedia.org/P42466 and previous config saved to /var/cache/conftool/dbconfig/20221207-171538-ladsgroup.json	[production]
17:15	<cmjohnson@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes1023.eqiad.wmnet with reason: host reimage	[production]
17:14	<sukhe@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: lvs5002.eqsin.wmnet decommissioned, removing all IPs except the asset tag one - sukhe@cumin2002"	[production]
17:14	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Set db2103 with weight 0 T324692', diff saved to https://phabricator.wikimedia.org/P42465 and previous config saved to /var/cache/conftool/dbconfig/20221207-171416-ladsgroup.json	[production]
17:13	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1098:3316 (T322618)', diff saved to https://phabricator.wikimedia.org/P42464 and previous config saved to /var/cache/conftool/dbconfig/20221207-171342-ladsgroup.json	[production]
17:13	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1098.eqiad.wmnet with reason: Maintenance	[production]
17:13	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db2124 (T322618)', diff saved to https://phabricator.wikimedia.org/P42463 and previous config saved to /var/cache/conftool/dbconfig/20221207-171326-ladsgroup.json	[production]
17:13	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1098.eqiad.wmnet with reason: Maintenance	[production]
17:13	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1096:3316 (T322618)', diff saved to https://phabricator.wikimedia.org/P42462 and previous config saved to /var/cache/conftool/dbconfig/20221207-171321-ladsgroup.json	[production]
17:13	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2124.codfw.wmnet with reason: Maintenance	[production]
17:13	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db2124.codfw.wmnet with reason: Maintenance	[production]
17:13	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2117 (T322618)', diff saved to https://phabricator.wikimedia.org/P42461 and previous config saved to /var/cache/conftool/dbconfig/20221207-171305-ladsgroup.json	[production]
17:13	<cmjohnson@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes1024.eqiad.wmnet with reason: host reimage	[production]
17:12	<sukhe@cumin2002>	START - Cookbook sre.dns.netbox	[production]
17:12	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 38 hosts with reason: Primary switchover s1 T324692	[production]
17:11	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1:00:00 on 38 hosts with reason: Primary switchover s1 T324692	[production]
17:10	<cmjohnson@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes1023.eqiad.wmnet with reason: host reimage	[production]
17:10	<cmjohnson@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes1024.eqiad.wmnet with reason: host reimage	[production]
17:08	<sukhe@cumin2002>	START - Cookbook sre.hosts.decommission for hosts lvs5002.eqsin.wmnet	[production]
17:03	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P42460 and previous config saved to /var/cache/conftool/dbconfig/20221207-170316-ladsgroup.json	[production]
17:02	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2173', diff saved to https://phabricator.wikimedia.org/P42459 and previous config saved to /var/cache/conftool/dbconfig/20221207-170256-ladsgroup.json	[production]
17:01	<jiji@deploy1002>	Finished scap: Backport for [[gerrit:865123\|ProductionServices: Use redis_misc servers for LockManager (6/6) (T267581)]] (duration: 14m 46s)	[production]
16:58	<cmjohnson@cumin1001>	START - Cookbook sre.hosts.reimage for host kubernetes1023.eqiad.wmnet with OS bullseye	[production]
16:58	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1096:3316', diff saved to https://phabricator.wikimedia.org/P42458 and previous config saved to /var/cache/conftool/dbconfig/20221207-165815-ladsgroup.json	[production]
16:57	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2117', diff saved to https://phabricator.wikimedia.org/P42457 and previous config saved to /var/cache/conftool/dbconfig/20221207-165758-ladsgroup.json	[production]
16:57	<cmjohnson@cumin1001>	START - Cookbook sre.hosts.reimage for host kubernetes1024.eqiad.wmnet with OS bullseye	[production]
16:56	<eevans@deploy1002>	helmfile [codfw] DONE helmfile.d/services/echostore: apply	[production]
16:55	<eevans@deploy1002>	helmfile [codfw] START helmfile.d/services/echostore: apply	[production]
16:55	<eevans@deploy1002>	helmfile [staging] DONE helmfile.d/services/echostore: apply	[production]
16:55	<eevans@deploy1002>	helmfile [staging] START helmfile.d/services/echostore: apply	[production]
16:48	<jiji@deploy1002>	jiji and jiji: Backport for [[gerrit:865123\|ProductionServices: Use redis_misc servers for LockManager (6/6) (T267581)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet	[production]
16:48	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1181 (T322618)', diff saved to https://phabricator.wikimedia.org/P42456 and previous config saved to /var/cache/conftool/dbconfig/20221207-164809-ladsgroup.json	[production]
16:47	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2173', diff saved to https://phabricator.wikimedia.org/P42455 and previous config saved to /var/cache/conftool/dbconfig/20221207-164748-ladsgroup.json	[production]
16:46	<jiji@deploy1002>	Started scap: Backport for [[gerrit:865123\|ProductionServices: Use redis_misc servers for LockManager (6/6) (T267581)]]	[production]
16:43	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1096:3316', diff saved to https://phabricator.wikimedia.org/P42454 and previous config saved to /var/cache/conftool/dbconfig/20221207-164308-ladsgroup.json	[production]