production SAL

601-650 of 10000 results (98ms)

2024-07-17 §
16:34	<klausman@deploy1002>	helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'.	[production]
16:34	<klausman@deploy1002>	helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'.	[production]
16:32	<klausman@deploy1002>	helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'.	[production]
16:31	<klausman@deploy1002>	helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'.	[production]
16:31	<klausman@deploy1002>	helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
16:31	<klausman@deploy1002>	helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'.	[production]
16:30	<klausman@deploy1002>	helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
16:30	<otto@deploy1002>	Finished deploy [analytics/refinery@8f00c85] (thin): THIN [analytics/refinery@8f00c859] (duration: 04m 08s)	[production]
16:29	<klausman@deploy1002>	helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'.	[production]
16:29	<arnaudb@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1202', diff saved to https://phabricator.wikimedia.org/P66755 and previous config saved to /var/cache/conftool/dbconfig/20240717-162952-arnaudb.json	[production]
16:26	<otto@deploy1002>	Started deploy [analytics/refinery@8f00c85] (thin): THIN [analytics/refinery@8f00c859]	[production]
16:21	<otto@deploy1002>	Finished deploy [analytics/refinery@8f00c85]: [analytics/refinery@8f00c859] (duration: 07m 59s)	[production]
16:14	<arnaudb@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1202', diff saved to https://phabricator.wikimedia.org/P66754 and previous config saved to /var/cache/conftool/dbconfig/20240717-161445-arnaudb.json	[production]
16:13	<otto@deploy1002>	Started deploy [analytics/refinery@8f00c85]: [analytics/refinery@8f00c859]	[production]
16:08	<inflatador>	bking@kafka-main1005 `kafka topics --create --topic ${TOPIC} --partitions 1 --replication-factor 3; kafka configs --entity-type topics --entity-name ${TOPIC} --alter --add-config retention.ms=2592000000` T367510	[production]
15:59	<arnaudb@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1202 (T367781)', diff saved to https://phabricator.wikimedia.org/P66752 and previous config saved to /var/cache/conftool/dbconfig/20240717-155937-arnaudb.json	[production]
15:56	<arnaudb@cumin1002>	dbctl commit (dc=all): 'Depooling db1202 (T367781)', diff saved to https://phabricator.wikimedia.org/P66751 and previous config saved to /var/cache/conftool/dbconfig/20240717-155628-arnaudb.json	[production]
15:56	<arnaudb@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1202.eqiad.wmnet with reason: Maintenance	[production]
15:56	<arnaudb@cumin1002>	START - Cookbook sre.hosts.downtime for 4:00:00 on db1202.eqiad.wmnet with reason: Maintenance	[production]
15:56	<arnaudb@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1194 (T367781)', diff saved to https://phabricator.wikimedia.org/P66750 and previous config saved to /var/cache/conftool/dbconfig/20240717-155606-arnaudb.json	[production]
15:53	<otto@deploy1002>	Finished deploy [analytics/refinery@8f00c85] (hadoop-test): - take 2 - TEST [analytics/refinery@8f00c859] (duration: 03m 33s)	[production]
15:50	<otto@deploy1002>	Started deploy [analytics/refinery@8f00c85] (hadoop-test): - take 2 - TEST [analytics/refinery@8f00c859]	[production]
15:46	<otto@deploy1002>	Finished deploy [analytics/refinery@0b53772] (hadoop-test): TEST [analytics/refinery@0b53772e] (duration: 03m 27s)	[production]
15:42	<otto@deploy1002>	Started deploy [analytics/refinery@0b53772] (hadoop-test): TEST [analytics/refinery@0b53772e]	[production]
15:40	<arnaudb@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P66748 and previous config saved to /var/cache/conftool/dbconfig/20240717-154059-arnaudb.json	[production]
15:38	<sukhe@cumin1002>	END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on A:lvs-low-traffic-eqiad and A:lvs	[production]
15:37	<sukhe@cumin1002>	START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on A:lvs-low-traffic-eqiad and A:lvs	[production]
15:35	<sukhe@cumin1002>	END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on A:lvs-secondary-eqiad and A:lvs	[production]
15:35	<sukhe@cumin1002>	START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on A:lvs-secondary-eqiad and A:lvs	[production]
15:33	<sukhe@cumin1002>	END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on A:lvs-low-traffic-codfw and A:lvs	[production]
15:32	<sukhe@cumin1002>	START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on A:lvs-low-traffic-codfw and A:lvs	[production]
15:32	<topranks>	Adjust anycast route policy at Chicago Network POP cr2-eqord to announce anycast ranges T367439	[production]
15:30	<sukhe>	sudo cumin "A:lvs" "run-puppet-agent" to pick up apus change	[production]
15:29	<sukhe@cumin1002>	END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on A:lvs-secondary-codfw and A:lvs	[production]
15:28	<sukhe@cumin1002>	START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on A:lvs-secondary-codfw and A:lvs	[production]
15:25	<arnaudb@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P66747 and previous config saved to /var/cache/conftool/dbconfig/20240717-152552-arnaudb.json	[production]
15:24	<jforrester@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply	[production]
15:23	<jforrester@deploy1002>	helmfile [eqiad] START helmfile.d/services/wikifunctions: apply	[production]
15:23	<jforrester@deploy1002>	helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply	[production]
15:22	<jforrester@deploy1002>	helmfile [codfw] START helmfile.d/services/wikifunctions: apply	[production]
15:21	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbproxy2007.codfw.wmnet with OS bookworm	[production]
15:21	<pt1979@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002"	[production]
15:21	<jforrester@deploy1002>	helmfile [staging] DONE helmfile.d/services/wikifunctions: apply	[production]
15:21	<sukhe@cumin1002>	END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) apus.discovery.wmnet on all recursors	[production]
15:20	<sukhe@cumin1002>	START - Cookbook sre.dns.wipe-cache apus.discovery.wmnet on all recursors	[production]
15:20	<jforrester@deploy1002>	helmfile [staging] START helmfile.d/services/wikifunctions: apply	[production]
15:19	<jgiannelos@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/changeprop: apply	[production]
15:18	<pt1979@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002"	[production]
15:18	<sukhe>	running authdns-update for CR 1054346	[production]
15:16	<jgiannelos@deploy1002>	helmfile [eqiad] START helmfile.d/services/changeprop: apply	[production]