production SAL

2351-2400 of 10000 results (86ms)

2022-11-07 §
16:26	<filippo@cumin1001>	START - Cookbook sre.dns.netbox	[production]
16:26	<filippo@cumin1001>	START - Cookbook sre.ganeti.makevm for new host dispatch-be2001.codfw.wmnet	[production]
16:26	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1199', diff saved to https://phabricator.wikimedia.org/P38366 and previous config saved to /var/cache/conftool/dbconfig/20221107-162616-marostegui.json	[production]
16:23	<volans@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
16:21	<pt1979@cumin2002>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dbprov2004.mgmt.codfw.wmnet with reboot policy FORCED	[production]
16:21	<volans@cumin1001>	START - Cookbook sre.dns.netbox	[production]
16:20	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1127', diff saved to https://phabricator.wikimedia.org/P38365 and previous config saved to /var/cache/conftool/dbconfig/20221107-162033-ladsgroup.json	[production]
16:20	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1198', diff saved to https://phabricator.wikimedia.org/P38364 and previous config saved to /var/cache/conftool/dbconfig/20221107-162023-ladsgroup.json	[production]
16:18	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db2121 (T318605)', diff saved to https://phabricator.wikimedia.org/P38363 and previous config saved to /var/cache/conftool/dbconfig/20221107-161837-ladsgroup.json	[production]
16:18	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2121.codfw.wmnet with reason: Maintenance	[production]
16:18	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2121.codfw.wmnet with reason: Maintenance	[production]
16:18	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2120 (T318605)', diff saved to https://phabricator.wikimedia.org/P38362 and previous config saved to /var/cache/conftool/dbconfig/20221107-161816-ladsgroup.json	[production]
16:14	<pt1979@cumin2002>	START - Cookbook sre.hosts.provision for host dbprov2004.mgmt.codfw.wmnet with reboot policy FORCED	[production]
16:13	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1201 (T321130)', diff saved to https://phabricator.wikimedia.org/P38361 and previous config saved to /var/cache/conftool/dbconfig/20221107-161327-marostegui.json	[production]
16:11	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1201 (T321130)', diff saved to https://phabricator.wikimedia.org/P38360 and previous config saved to /var/cache/conftool/dbconfig/20221107-161118-marostegui.json	[production]
16:11	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1199', diff saved to https://phabricator.wikimedia.org/P38359 and previous config saved to /var/cache/conftool/dbconfig/20221107-161109-marostegui.json	[production]
16:11	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db1201.eqiad.wmnet with reason: Maintenance	[production]
16:10	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 5:00:00 on db1201.eqiad.wmnet with reason: Maintenance	[production]
16:10	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1187 (T321130)', diff saved to https://phabricator.wikimedia.org/P38358 and previous config saved to /var/cache/conftool/dbconfig/20221107-161050-marostegui.json	[production]
16:06	<cgoubert@deploy1002>	helmfile [eqiad] DONE helmfile.d/admin 'apply'.	[production]
16:06	<ebernhardson@deploy1002>	Finished deploy [wikimedia/discovery/analytics@e51ff67]: import_cirrus_indexes: set executor cores to 1 (duration: 02m 19s)	[production]
16:05	<cgoubert@deploy1002>	helmfile [eqiad] START helmfile.d/admin 'apply'.	[production]
16:05	<cgoubert@deploy1002>	helmfile [codfw] DONE helmfile.d/admin 'apply'.	[production]
16:05	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1127 (T318605)', diff saved to https://phabricator.wikimedia.org/P38357 and previous config saved to /var/cache/conftool/dbconfig/20221107-160527-ladsgroup.json	[production]
16:05	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1198 (T318955)', diff saved to https://phabricator.wikimedia.org/P38356 and previous config saved to /var/cache/conftool/dbconfig/20221107-160516-ladsgroup.json	[production]
16:04	<cgoubert@deploy1002>	helmfile [codfw] START helmfile.d/admin 'apply'.	[production]
16:03	<jmm@cumin2002>	END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti1010.eqiad.wmnet to cluster eqiad and group C	[production]
16:03	<ebernhardson@deploy1002>	Started deploy [wikimedia/discovery/analytics@e51ff67]: import_cirrus_indexes: set executor cores to 1	[production]
16:03	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2120', diff saved to https://phabricator.wikimedia.org/P38355 and previous config saved to /var/cache/conftool/dbconfig/20221107-160310-ladsgroup.json	[production]
16:02	<jmm@cumin2002>	START - Cookbook sre.ganeti.addnode for new host ganeti1010.eqiad.wmnet to cluster eqiad and group C	[production]
16:02	<pt1979@cumin2002>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dbprov2004.mgmt.codfw.wmnet with reboot policy FORCED	[production]
16:02	<cgoubert@deploy1002>	helmfile [codfw] DONE helmfile.d/admin 'apply'.	[production]
16:02	<pt1979@cumin2002>	START - Cookbook sre.hosts.provision for host dbprov2004.mgmt.codfw.wmnet with reboot policy FORCED	[production]
16:01	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1198 (T318955)', diff saved to https://phabricator.wikimedia.org/P38354 and previous config saved to /var/cache/conftool/dbconfig/20221107-160124-ladsgroup.json	[production]
16:01	<cgoubert@deploy1002>	helmfile [codfw] START helmfile.d/admin 'apply'.	[production]
16:01	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1198.eqiad.wmnet with reason: Maintenance	[production]
16:01	<claime>	cleaning up stale mwdebug kubernetes config	[production]
16:01	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 8:00:00 on db1198.eqiad.wmnet with reason: Maintenance	[production]
16:01	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1189 (T318955)', diff saved to https://phabricator.wikimedia.org/P38353 and previous config saved to /var/cache/conftool/dbconfig/20221107-160102-ladsgroup.json	[production]
16:00	<cgoubert@deploy1002>	helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
15:59	<cgoubert@deploy1002>	helmfile [staging-codfw] START helmfile.d/admin 'apply'.	[production]
15:56	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1199 (T321123)', diff saved to https://phabricator.wikimedia.org/P38352 and previous config saved to /var/cache/conftool/dbconfig/20221107-155603-marostegui.json	[production]
15:55	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P38351 and previous config saved to /var/cache/conftool/dbconfig/20221107-155544-marostegui.json	[production]
15:55	<elukey>	upgrade istioctl to 1.15.3 on apt1001 for {buster,bullseye}-wikimedia - T322193	[production]
15:54	<marostegui@cumin1001>	dbctl commit (dc=all): 'Depooling db1199 (T321123)', diff saved to https://phabricator.wikimedia.org/P38350 and previous config saved to /var/cache/conftool/dbconfig/20221107-155455-marostegui.json	[production]
15:54	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1199.eqiad.wmnet with reason: Maintenance	[production]
15:54	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 8:00:00 on db1199.eqiad.wmnet with reason: Maintenance	[production]
15:54	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1190 (T321123)', diff saved to https://phabricator.wikimedia.org/P38349 and previous config saved to /var/cache/conftool/dbconfig/20221107-155434-marostegui.json	[production]
15:51	<pt1979@cumin2002>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dbprov2004.mgmt.codfw.wmnet with reboot policy FORCED	[production]
15:50	<pt1979@cumin2002>	START - Cookbook sre.hosts.provision for host dbprov2004.mgmt.codfw.wmnet with reboot policy FORCED	[production]