production SAL

8401-8450 of 10000 results (50ms)

2022-01-12 §
10:33	<oblivian@deploy1002>	helmfile [codfw] DONE helmfile.d/services/shellbox-media: apply on main	[production]
10:33	<oblivian@deploy1002>	helmfile [codfw] START helmfile.d/services/shellbox-media: apply on main	[production]
10:33	<oblivian@deploy1002>	helmfile [codfw] DONE helmfile.d/services/shellbox-constraints: apply on main	[production]
10:33	<oblivian@deploy1002>	helmfile [codfw] START helmfile.d/services/shellbox-constraints: apply on main	[production]
10:33	<oblivian@deploy1002>	helmfile [codfw] DONE helmfile.d/services/shellbox: sync on main	[production]
10:32	<oblivian@deploy1002>	helmfile [codfw] START helmfile.d/services/shellbox: apply on main	[production]
10:31	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depool db1128', diff saved to https://phabricator.wikimedia.org/P18632 and previous config saved to /var/cache/conftool/dbconfig/20220112-103144-marostegui.json	[production]
10:29	<marostegui@cumin1001>	dbctl commit (dc=all): 'Pool db1128 in s1 with minimal weight T295965', diff saved to https://phabricator.wikimedia.org/P18631 and previous config saved to /var/cache/conftool/dbconfig/20220112-102938-marostegui.json	[production]
10:25	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1166', diff saved to https://phabricator.wikimedia.org/P18630 and previous config saved to /var/cache/conftool/dbconfig/20220112-102523-marostegui.json	[production]
10:10	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1166 (T297191)', diff saved to https://phabricator.wikimedia.org/P18629 and previous config saved to /var/cache/conftool/dbconfig/20220112-101018-marostegui.json	[production]
10:08	<jelto@cumin1001>	END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM gitlab1001.wikimedia.org	[production]
10:06	<jelto@cumin1001>	START - Cookbook sre.ganeti.reboot-vm for VM gitlab1001.wikimedia.org	[production]
10:03	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
10:02	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
10:02	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
10:00	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
09:57	<marostegui@deploy1002>	Synchronized wmf-config/ProductionServices.php: Revert: Promote pc1014 to master in pc1 (duration: 01m 07s)	[production]
09:54	<hnowlan>	Decommissioning cassandra instance restbase2009-b via nodetool	[production]
09:53	<jelto@cumin1001>	END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM gitlab-runner1001.eqiad.wmnet	[production]
09:51	<moritzm>	reverting kubetcd2006 back to "plain" storage	[production]
09:51	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kubetcd2006.codfw.wmnet with reason: switch to plain disk storage	[production]
09:51	<jmm@cumin2002>	START - Cookbook sre.hosts.downtime for 1:00:00 on kubetcd2006.codfw.wmnet with reason: switch to plain disk storage	[production]
09:51	<jelto@cumin1001>	START - Cookbook sre.ganeti.reboot-vm for VM gitlab-runner1001.eqiad.wmnet	[production]
09:50	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
09:49	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
09:49	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
09:48	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
09:37	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host pc1011.eqiad.wmnet with OS bullseye	[production]
09:21	<moritzm>	reverting kubetcd2005 back to "plain" storage	[production]
09:20	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kubetcd2005.codfw.wmnet with reason: switch to plain disk storage	[production]
09:20	<jmm@cumin2002>	START - Cookbook sre.hosts.downtime for 1:00:00 on kubetcd2005.codfw.wmnet with reason: switch to plain disk storage	[production]
09:13	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
09:12	<marostegui@cumin1001>	START - Cookbook sre.hosts.reimage for host pc1011.eqiad.wmnet with OS bullseye	[production]
09:11	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
09:11	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
09:10	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
09:09	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1166 (T297191)', diff saved to https://phabricator.wikimedia.org/P18628 and previous config saved to /var/cache/conftool/dbconfig/20220112-090959-marostegui.json	[production]
09:09	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1166.eqiad.wmnet with reason: Maintenance	[production]
09:09	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1166.eqiad.wmnet with reason: Maintenance	[production]
09:08	<marostegui@deploy1002>	Synchronized wmf-config/ProductionServices.php: Promote pc1014 to master in pc1 (duration: 01m 08s)	[production]
09:05	<marostegui>	Reset replication on pc1014	[production]
08:50	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on 6 hosts with reason: Maintenance	[production]
08:50	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 12:00:00 on 6 hosts with reason: Maintenance	[production]
08:50	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2105.codfw.wmnet with reason: Maintenance	[production]
08:50	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db2105.codfw.wmnet with reason: Maintenance	[production]
08:50	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1112 (T297191)', diff saved to https://phabricator.wikimedia.org/P18627 and previous config saved to /var/cache/conftool/dbconfig/20220112-085024-marostegui.json	[production]
08:40	<jmm@cumin2002>	END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM miscweb1002.eqiad.wmnet	[production]
08:37	<jmm@cumin2002>	START - Cookbook sre.ganeti.reboot-vm for VM miscweb1002.eqiad.wmnet	[production]
08:35	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1112', diff saved to https://phabricator.wikimedia.org/P18626 and previous config saved to /var/cache/conftool/dbconfig/20220112-083520-marostegui.json	[production]
08:30	<jmm@cumin2002>	END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM mwdebug1002.eqiad.wmnet	[production]