production SAL

3401-3450 of 10000 results (51ms)

2021-12-10 §
15:09	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1174', diff saved to https://phabricator.wikimedia.org/P18111 and previous config saved to /var/cache/conftool/dbconfig/20211210-150906-marostegui.json	[production]
15:08	<jelto@deploy1002>	helmfile [codfw] Ran 'sync' command on namespace 'blubberoid' for release 'production' .	[production]
15:06	<jynus>	increase backup2005's allocated disk space	[production]
15:01	<jelto@deploy1002>	helmfile [codfw] DONE helmfile.d/admin 'apply'.	[production]
15:01	<jelto>	remove tiller from codfw Kubernetes cluster	[production]
15:01	<jelto@deploy1002>	helmfile [codfw] START helmfile.d/admin 'apply'.	[production]
14:55	<moritzm>	drain primary/secondary instances off ganeti2017 T296622	[production]
14:54	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1174', diff saved to https://phabricator.wikimedia.org/P18110 and previous config saved to /var/cache/conftool/dbconfig/20211210-145401-marostegui.json	[production]
14:50	<jelto@deploy1002>	helmfile [staging] Ran 'sync' command on namespace 'blubberoid' for release 'staging' .	[production]
14:49	<hnowlan@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on restbase2026.codfw.wmnet with reason: New cassandra hosts awaiting syncing	[production]
14:49	<hnowlan@cumin1001>	START - Cookbook sre.hosts.downtime for 3 days, 0:00:00 on restbase2026.codfw.wmnet with reason: New cassandra hosts awaiting syncing	[production]
14:48	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on ganeti2008.codfw.wmnet with reason: Temporarily remove node from Ganeti for reimage	[production]
14:48	<jmm@cumin2002>	START - Cookbook sre.hosts.downtime for 4 days, 0:00:00 on ganeti2008.codfw.wmnet with reason: Temporarily remove node from Ganeti for reimage	[production]
14:48	<jelto@deploy1002>	helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'.	[production]
14:48	<jelto@deploy1002>	helmfile [staging-eqiad] START helmfile.d/admin 'apply'.	[production]
14:48	<jelto>	remove tiller from staging-eqiad Kubernetes cluster	[production]
14:38	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1174 (T277354)', diff saved to https://phabricator.wikimedia.org/P18109 and previous config saved to /var/cache/conftool/dbconfig/20211210-143856-marostegui.json	[production]
14:36	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1174 (T277354)', diff saved to https://phabricator.wikimedia.org/P18108 and previous config saved to /var/cache/conftool/dbconfig/20211210-143636-marostegui.json	[production]
14:36	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1174.eqiad.wmnet with reason: Maintenance	[production]
14:36	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 4:00:00 on db1174.eqiad.wmnet with reason: Maintenance	[production]
14:36	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1158 (T277354)', diff saved to https://phabricator.wikimedia.org/P18107 and previous config saved to /var/cache/conftool/dbconfig/20211210-143628-marostegui.json	[production]
14:36	<hnowlan@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host restbase2026.codfw.wmnet with OS buster	[production]
14:33	<jelto@deploy1002>	helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
14:33	<jelto@deploy1002>	helmfile [staging-codfw] START helmfile.d/admin 'apply'.	[production]
14:33	<jelto@deploy1002>	helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
14:33	<jelto@deploy1002>	helmfile [staging-codfw] START helmfile.d/admin 'apply'.	[production]
14:29	<jelto>	remove tiller from staging-codfw Kubernetes cluster	[production]
14:28	<jelto@deploy1002>	helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
14:27	<jelto@deploy1002>	helmfile [staging-codfw] START helmfile.d/admin 'apply'.	[production]
14:21	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P18106 and previous config saved to /var/cache/conftool/dbconfig/20211210-142123-marostegui.json	[production]
14:17	<jelto@deploy1002>	helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
14:17	<jelto@deploy1002>	helmfile [staging-codfw] START helmfile.d/admin 'apply'.	[production]
14:06	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P18105 and previous config saved to /var/cache/conftool/dbconfig/20211210-140618-marostegui.json	[production]
14:01	<jynus>	increase backup2004's allocated disk space	[production]
13:51	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1158 (T277354)', diff saved to https://phabricator.wikimedia.org/P18104 and previous config saved to /var/cache/conftool/dbconfig/20211210-135114-marostegui.json	[production]
13:49	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1158 (T277354)', diff saved to https://phabricator.wikimedia.org/P18103 and previous config saved to /var/cache/conftool/dbconfig/20211210-134953-marostegui.json	[production]
13:49	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db[1155,1158].eqiad.wmnet with reason: Maintenance	[production]
13:49	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 4:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db[1155,1158].eqiad.wmnet with reason: Maintenance	[production]
13:49	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1127 (T277354)', diff saved to https://phabricator.wikimedia.org/P18102 and previous config saved to /var/cache/conftool/dbconfig/20211210-134941-marostegui.json	[production]
13:34	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1127', diff saved to https://phabricator.wikimedia.org/P18101 and previous config saved to /var/cache/conftool/dbconfig/20211210-133437-marostegui.json	[production]
13:19	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1127', diff saved to https://phabricator.wikimedia.org/P18100 and previous config saved to /var/cache/conftool/dbconfig/20211210-131932-marostegui.json	[production]
13:17	<hnowlan@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host restbase2025.codfw.wmnet with OS buster	[production]
13:04	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1127 (T277354)', diff saved to https://phabricator.wikimedia.org/P18099 and previous config saved to /var/cache/conftool/dbconfig/20211210-130427-marostegui.json	[production]
13:00	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1127 (T277354)', diff saved to https://phabricator.wikimedia.org/P18098 and previous config saved to /var/cache/conftool/dbconfig/20211210-130051-marostegui.json	[production]
13:00	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1127.eqiad.wmnet with reason: Maintenance	[production]
13:00	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 4:00:00 on db1127.eqiad.wmnet with reason: Maintenance	[production]
12:56	<hnowlan@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on restbase[2024-2025].codfw.wmnet with reason: New cassandra hosts awaiting syncing	[production]
12:56	<hnowlan@cumin1001>	START - Cookbook sre.hosts.downtime for 3 days, 0:00:00 on restbase[2024-2025].codfw.wmnet with reason: New cassandra hosts awaiting syncing	[production]
12:53	<hnowlan@cumin2002>	START - Cookbook sre.hosts.reimage for host restbase2026.codfw.wmnet with OS buster	[production]
12:51	<hnowlan@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host restbase2024.codfw.wmnet with OS buster	[production]