production SAL

8251-8300 of 10000 results (35ms)

2022-01-25 §
11:07	<godog>	temp disable alerting on prometheus200[56] - T296199	[production]
10:57	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1146:3314 (T285149)', diff saved to https://phabricator.wikimedia.org/P19124 and previous config saved to /var/cache/conftool/dbconfig/20220125-105744-marostegui.json	[production]
10:56	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1146:3314 (T285149)', diff saved to https://phabricator.wikimedia.org/P19123 and previous config saved to /var/cache/conftool/dbconfig/20220125-105636-marostegui.json	[production]
10:56	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1146.eqiad.wmnet with reason: Maintenance	[production]
10:56	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1146.eqiad.wmnet with reason: Maintenance	[production]
10:56	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1143 (T285149)', diff saved to https://phabricator.wikimedia.org/P19122 and previous config saved to /var/cache/conftool/dbconfig/20220125-105628-marostegui.json	[production]
10:55	<marostegui@cumin1001>	START - Cookbook sre.hosts.reimage for host es2021.codfw.wmnet with OS bullseye	[production]
10:53	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.reimage for host es2027.codfw.wmnet with OS bullseye	[production]
10:52	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es2027.codfw.wmnet with reason: reimage for upgrade - T299911	[production]
10:52	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on es2027.codfw.wmnet with reason: reimage for upgrade - T299911	[production]
10:50	<hnowlan>	disabling puppet on all maps hosts to test cassandra removal	[production]
10:45	<hnowlan@puppetmaster1001>	conftool action : set/pooled=no; selector: name=restbase2011.eqiad.wmnet	[production]
10:43	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repool es2020', diff saved to https://phabricator.wikimedia.org/P19121 and previous config saved to /var/cache/conftool/dbconfig/20220125-104331-marostegui.json	[production]
10:41	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host es2020.codfw.wmnet with OS bullseye	[production]
10:41	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1143', diff saved to https://phabricator.wikimedia.org/P19120 and previous config saved to /var/cache/conftool/dbconfig/20220125-104124-marostegui.json	[production]
10:37	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host es2029.codfw.wmnet with OS bullseye	[production]
10:36	<hnowlan>	nodetool removenode for restbase2011-c	[production]
10:32	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
10:29	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depool es1022 T299123', diff saved to https://phabricator.wikimedia.org/P19119 and previous config saved to /var/cache/conftool/dbconfig/20220125-102912-marostegui.json	[production]
10:28	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
10:28	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
10:26	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1143', diff saved to https://phabricator.wikimedia.org/P19118 and previous config saved to /var/cache/conftool/dbconfig/20220125-102619-marostegui.json	[production]
10:24	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1101:3317 (T299827)', diff saved to https://phabricator.wikimedia.org/P19117 and previous config saved to /var/cache/conftool/dbconfig/20220125-102448-marostegui.json	[production]
10:24	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1101.eqiad.wmnet with reason: Maintenance	[production]
10:24	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1101.eqiad.wmnet with reason: Maintenance	[production]
10:24	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance	[production]
10:24	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance	[production]
10:24	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
10:24	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1171.eqiad.wmnet with reason: Maintenance	[production]
10:24	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1171.eqiad.wmnet with reason: Maintenance	[production]
10:24	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1098:3317 (T299827)', diff saved to https://phabricator.wikimedia.org/P19116 and previous config saved to /var/cache/conftool/dbconfig/20220125-102426-marostegui.json	[production]
10:18	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1013.eqiad.wmnet	[production]
10:13	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host ganeti1013.eqiad.wmnet	[production]
10:11	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1143 (T285149)', diff saved to https://phabricator.wikimedia.org/P19115 and previous config saved to /var/cache/conftool/dbconfig/20220125-101114-marostegui.json	[production]
10:09	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
10:09	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1098:3317', diff saved to https://phabricator.wikimedia.org/P19114 and previous config saved to /var/cache/conftool/dbconfig/20220125-100921-marostegui.json	[production]
10:09	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1143 (T285149)', diff saved to https://phabricator.wikimedia.org/P19113 and previous config saved to /var/cache/conftool/dbconfig/20220125-100907-marostegui.json	[production]
10:09	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1143.eqiad.wmnet with reason: Maintenance	[production]
10:09	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1143.eqiad.wmnet with reason: Maintenance	[production]
10:09	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1144:3314 (T285149)', diff saved to https://phabricator.wikimedia.org/P19112 and previous config saved to /var/cache/conftool/dbconfig/20220125-100900-marostegui.json	[production]
10:08	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
10:08	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
10:06	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
10:04	<taavi@deploy1002>	Synchronized wmf-config/extension-list: Config: [[gerrit:755534\|Undeploy UserMerge (3) (T216089)]] (duration: 00m 48s)	[production]
10:03	<marostegui@cumin1001>	START - Cookbook sre.hosts.reimage for host es2020.codfw.wmnet with OS bullseye	[production]
10:02	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.reimage for host es2029.codfw.wmnet with OS bullseye	[production]
10:01	<taavi@deploy1002>	Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:755533\|Undeploy UserMerge (2) (T216089)]] (duration: 00m 49s)	[production]
10:01	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: sync on pinkunicorn	[production]
10:00	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply on pinkunicorn	[production]
10:00	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es2029.codfw.wmnet with reason: reimage for upgrade - T299911	[production]