production SAL

751-800 of 10000 results (72ms)

2023-09-19 §
12:44	<Emperor>	ms-be20[60-73] swift package updates T346730	[production]
12:22	<Emperor>	ms-be20[49-59] swift package updates T346730	[production]
12:19	<jebe@deploy1002>	Finished deploy [analytics/refinery@91bb4a0] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@91bb4a0] (duration: 02m 03s)	[production]
12:18	<Emperor>	ms-be2048 swift package updates T346730	[production]
12:17	<jebe@deploy1002>	Started deploy [analytics/refinery@91bb4a0] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@91bb4a0]	[production]
12:17	<jebe@deploy1002>	Finished deploy [analytics/refinery@91bb4a0] (thin): Regular analytics weekly train THIN [analytics/refinery@91bb4a0] (duration: 00m 05s)	[production]
12:17	<jebe@deploy1002>	Started deploy [analytics/refinery@91bb4a0] (thin): Regular analytics weekly train THIN [analytics/refinery@91bb4a0]	[production]
12:14	<Emperor>	ms-be2047 swift package updates T346730	[production]
12:12	<Emperor>	ms-be204{5,6} swift package updates T346730	[production]
12:10	<jebe@deploy1002>	Finished deploy [analytics/refinery@91bb4a0]: Regular analytics weekly train [analytics/refinery@91bb4a0] (duration: 06m 53s)	[production]
12:08	<cmooney@cumin1001>	START - Cookbook sre.dns.netbox	[production]
12:03	<jebe@deploy1002>	Started deploy [analytics/refinery@91bb4a0]: Regular analytics weekly train [analytics/refinery@91bb4a0]	[production]
11:51	<ayounsi@cumin1001>	END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox	[production]
11:51	<ayounsi@cumin1001>	START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox	[production]
11:50	<ayounsi@cumin1001>	END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox-canary	[production]
11:48	<ayounsi@cumin1001>	START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox-canary	[production]
11:21	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1134 (re)pooling @ 100%: Repooling after recloning db1128', diff saved to https://phabricator.wikimedia.org/P52530 and previous config saved to /var/cache/conftool/dbconfig/20230919-112156-root.json	[production]
11:09	<Emperor>	eqiad swift front-end swift package updates T346730	[production]
11:06	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1134 (re)pooling @ 75%: Repooling after recloning db1128', diff saved to https://phabricator.wikimedia.org/P52529 and previous config saved to /var/cache/conftool/dbconfig/20230919-110651-root.json	[production]
10:51	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1134 (re)pooling @ 50%: Repooling after recloning db1128', diff saved to https://phabricator.wikimedia.org/P52528 and previous config saved to /var/cache/conftool/dbconfig/20230919-105147-root.json	[production]
10:38	<stevemunene@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host an-worker1148.eqiad.wmnet with OS bullseye	[production]
10:36	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1134 (re)pooling @ 25%: Repooling after recloning db1128', diff saved to https://phabricator.wikimedia.org/P52527 and previous config saved to /var/cache/conftool/dbconfig/20230919-103642-root.json	[production]
10:34	<Emperor>	codfw swift front-end swift package updates T346730	[production]
10:25	<stevemunene@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host an-worker1147.eqiad.wmnet with OS bullseye	[production]
10:21	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1134 (re)pooling @ 10%: Repooling after recloning db1128', diff saved to https://phabricator.wikimedia.org/P52526 and previous config saved to /var/cache/conftool/dbconfig/20230919-102137-root.json	[production]
10:15	<stevemunene@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1148.eqiad.wmnet with reason: host reimage	[production]
10:11	<stevemunene@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1148.eqiad.wmnet with reason: host reimage	[production]
10:06	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1134 (re)pooling @ 5%: Repooling after recloning db1128', diff saved to https://phabricator.wikimedia.org/P52525 and previous config saved to /var/cache/conftool/dbconfig/20230919-100632-root.json	[production]
10:01	<stevemunene@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1147.eqiad.wmnet with reason: host reimage	[production]
09:58	<stevemunene@cumin1001>	START - Cookbook sre.hosts.reimage for host an-worker1148.eqiad.wmnet with OS bullseye	[production]
09:56	<stevemunene@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1147.eqiad.wmnet with reason: host reimage	[production]
09:51	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1134 (re)pooling @ 3%: Repooling after recloning db1128', diff saved to https://phabricator.wikimedia.org/P52524 and previous config saved to /var/cache/conftool/dbconfig/20230919-095127-root.json	[production]
09:48	<slyngshede@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host idm2001.wikimedia.org with OS bookworm	[production]
09:42	<stevemunene@cumin1001>	START - Cookbook sre.hosts.reimage for host an-worker1147.eqiad.wmnet with OS bullseye	[production]
09:40	<btullis@cumin1001>	START - Cookbook sre.kafka.roll-restart-brokers for Kafka A:kafka-jumbo-eqiad cluster: Roll restart of jvm daemons.	[production]
09:36	<marostegui@cumin1001>	dbctl commit (dc=all): 'db1134 (re)pooling @ 1%: Repooling after recloning db1128', diff saved to https://phabricator.wikimedia.org/P52523 and previous config saved to /var/cache/conftool/dbconfig/20230919-093622-root.json	[production]
09:12	<elukey@cumin1001>	END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM ml-staging-etcd2001.codfw.wmnet	[production]
09:08	<elukey@cumin1001>	START - Cookbook sre.ganeti.reboot-vm for VM ml-staging-etcd2001.codfw.wmnet	[production]
09:03	<elukey@cumin1001>	END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM ml-staging-etcd2002.codfw.wmnet	[production]
08:59	<elukey@cumin1001>	START - Cookbook sre.ganeti.reboot-vm for VM ml-staging-etcd2002.codfw.wmnet	[production]
08:47	<elukey@cumin1001>	END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM ml-staging-etcd2003.codfw.wmnet	[production]
08:44	<godog>	bounce benthos@webrequest_live to clear out old metrics	[production]
08:43	<elukey@cumin1001>	START - Cookbook sre.ganeti.reboot-vm for VM ml-staging-etcd2003.codfw.wmnet	[production]
08:41	<godog>	remove MediaWiki..growthexperiments.taskcount.link_recommendation. from graphite - T346371	[production]
08:39	<jmm@cumin2002>	END (PASS) - Cookbook sre.maps.roll-restart-reboot (exit_code=0) rolling restart_daemons on A:maps-replica-eqiad	[production]
08:36	<stevemunene@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host an-worker1146.eqiad.wmnet with OS bullseye	[production]
08:34	<jmm@cumin2002>	START - Cookbook sre.maps.roll-restart-reboot rolling restart_daemons on A:maps-replica-eqiad	[production]
08:30	<jmm@cumin2002>	END (PASS) - Cookbook sre.maps.roll-restart-reboot (exit_code=0) rolling restart_daemons on A:maps-replica-codfw	[production]
08:29	<slyngshede@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on idm2001.wikimedia.org with reason: host reimage	[production]
08:26	<brouberol@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mw-page-content-change-enrich: apply	[production]