production SAL

3801-3850 of 10000 results (84ms)

2023-08-28 §
10:54	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db2172 (T344589)', diff saved to https://phabricator.wikimedia.org/P51575 and previous config saved to /var/cache/conftool/dbconfig/20230828-105407-ladsgroup.json	[production]
10:54	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2172.codfw.wmnet with reason: Maintenance	[production]
10:53	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2172.codfw.wmnet with reason: Maintenance	[production]
10:53	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2155 (T344589)', diff saved to https://phabricator.wikimedia.org/P51574 and previous config saved to /var/cache/conftool/dbconfig/20230828-105342-ladsgroup.json	[production]
10:51	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1134 (T343718)', diff saved to https://phabricator.wikimedia.org/P51573 and previous config saved to /var/cache/conftool/dbconfig/20230828-105153-ladsgroup.json	[production]
10:50	<moritzm>	installing exim4 bugfix updates from Bookworm point release	[production]
10:50	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1150.eqiad.wmnet with reason: Maintenance	[production]
10:50	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1150.eqiad.wmnet with reason: Maintenance	[production]
10:50	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1149 (T344589)', diff saved to https://phabricator.wikimedia.org/P51572 and previous config saved to /var/cache/conftool/dbconfig/20230828-105002-ladsgroup.json	[production]
10:49	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host ganeti2016.codfw.wmnet	[production]
10:46	<jmm@cumin2002>	START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2016.codfw.wmnet	[production]
10:46	<jelto@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/miscweb: apply	[production]
10:44	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db2145 (T343718)', diff saved to https://phabricator.wikimedia.org/P51571 and previous config saved to /var/cache/conftool/dbconfig/20230828-104407-ladsgroup.json	[production]
10:44	<jelto@deploy1002>	helmfile [eqiad] START helmfile.d/services/miscweb: apply	[production]
10:44	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2145.codfw.wmnet with reason: Maintenance	[production]
10:43	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2145.codfw.wmnet with reason: Maintenance	[production]
10:42	<cgoubert@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mw-api-ext: apply	[production]
10:42	<cgoubert@deploy1002>	helmfile [eqiad] START helmfile.d/services/mw-api-ext: apply	[production]
10:41	<cgoubert@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mw-api-ext: apply	[production]
10:41	<cgoubert@deploy1002>	helmfile [codfw] START helmfile.d/services/mw-api-ext: apply	[production]
10:39	<elukey@cumin1001>	START - Cookbook sre.k8s.reboot-nodes rolling reboot on A:ml-serve-worker-eqiad	[production]
10:38	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on dbstore1005.eqiad.wmnet with reason: Maintenance	[production]
10:38	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2155', diff saved to https://phabricator.wikimedia.org/P51570 and previous config saved to /var/cache/conftool/dbconfig/20230828-103836-ladsgroup.json	[production]
10:38	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on dbstore1005.eqiad.wmnet with reason: Maintenance	[production]
10:38	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1214 (T344589)', diff saved to https://phabricator.wikimedia.org/P51569 and previous config saved to /var/cache/conftool/dbconfig/20230828-103827-ladsgroup.json	[production]
10:38	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance es2027 (T344589)', diff saved to https://phabricator.wikimedia.org/P51568 and previous config saved to /var/cache/conftool/dbconfig/20230828-103826-ladsgroup.json	[production]
10:37	<jelto@deploy1002>	helmfile [codfw] DONE helmfile.d/services/miscweb: apply	[production]
10:35	<jelto@deploy1002>	helmfile [codfw] START helmfile.d/services/miscweb: apply	[production]
10:34	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1149', diff saved to https://phabricator.wikimedia.org/P51567 and previous config saved to /var/cache/conftool/dbconfig/20230828-103456-ladsgroup.json	[production]
10:31	<jelto@deploy1002>	helmfile [staging] DONE helmfile.d/services/miscweb: apply	[production]
10:30	<jmm@cumin2002>	END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2015.codfw.wmnet	[production]
10:30	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2015.codfw.wmnet	[production]
10:29	<jelto@deploy1002>	helmfile [staging] START helmfile.d/services/miscweb: apply	[production]
10:23	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2155', diff saved to https://phabricator.wikimedia.org/P51566 and previous config saved to /var/cache/conftool/dbconfig/20230828-102330-ladsgroup.json	[production]
10:23	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1214', diff saved to https://phabricator.wikimedia.org/P51565 and previous config saved to /var/cache/conftool/dbconfig/20230828-102320-ladsgroup.json	[production]
10:23	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance es2027', diff saved to https://phabricator.wikimedia.org/P51564 and previous config saved to /var/cache/conftool/dbconfig/20230828-102320-ladsgroup.json	[production]
10:23	<elukey@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/changeprop: sync	[production]
10:23	<elukey@deploy1002>	helmfile [eqiad] START helmfile.d/services/changeprop: sync	[production]
10:22	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host ganeti2015.codfw.wmnet	[production]
10:21	<cgoubert@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mw-api-ext: apply	[production]
10:20	<cgoubert@deploy1002>	helmfile [codfw] START helmfile.d/services/mw-api-ext: apply	[production]
10:19	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1149', diff saved to https://phabricator.wikimedia.org/P51563 and previous config saved to /var/cache/conftool/dbconfig/20230828-101949-ladsgroup.json	[production]
10:17	<fabfur>	enable puppet and start pybal on lvs4008 for reboot (T344587)	[production]
10:16	<fabfur@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs4008.ulsfo.wmnet	[production]
10:16	<jmm@cumin2002>	START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2015.codfw.wmnet	[production]
10:15	<cgoubert@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply	[production]
10:14	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1134 (T343718)', diff saved to https://phabricator.wikimedia.org/P51562 and previous config saved to /var/cache/conftool/dbconfig/20230828-101426-ladsgroup.json	[production]
10:14	<cgoubert@deploy1002>	helmfile [eqiad] START helmfile.d/services/mw-debug: apply	[production]
10:14	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1134.eqiad.wmnet with reason: Maintenance	[production]
10:14	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1134.eqiad.wmnet with reason: Maintenance	[production]