2351-2400 of 10000 results (76ms)
2022-11-07 ยง
16:26 <filippo@cumin1001> START - Cookbook sre.dns.netbox [production]
16:26 <filippo@cumin1001> START - Cookbook sre.ganeti.makevm for new host dispatch-be2001.codfw.wmnet [production]
16:26 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1199', diff saved to https://phabricator.wikimedia.org/P38366 and previous config saved to /var/cache/conftool/dbconfig/20221107-162616-marostegui.json [production]
16:23 <volans@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:21 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dbprov2004.mgmt.codfw.wmnet with reboot policy FORCED [production]
16:21 <volans@cumin1001> START - Cookbook sre.dns.netbox [production]
16:20 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1127', diff saved to https://phabricator.wikimedia.org/P38365 and previous config saved to /var/cache/conftool/dbconfig/20221107-162033-ladsgroup.json [production]
16:20 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1198', diff saved to https://phabricator.wikimedia.org/P38364 and previous config saved to /var/cache/conftool/dbconfig/20221107-162023-ladsgroup.json [production]
16:18 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2121 (T318605)', diff saved to https://phabricator.wikimedia.org/P38363 and previous config saved to /var/cache/conftool/dbconfig/20221107-161837-ladsgroup.json [production]
16:18 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2121.codfw.wmnet with reason: Maintenance [production]
16:18 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2121.codfw.wmnet with reason: Maintenance [production]
16:18 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2120 (T318605)', diff saved to https://phabricator.wikimedia.org/P38362 and previous config saved to /var/cache/conftool/dbconfig/20221107-161816-ladsgroup.json [production]
16:14 <pt1979@cumin2002> START - Cookbook sre.hosts.provision for host dbprov2004.mgmt.codfw.wmnet with reboot policy FORCED [production]
16:13 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1201 (T321130)', diff saved to https://phabricator.wikimedia.org/P38361 and previous config saved to /var/cache/conftool/dbconfig/20221107-161327-marostegui.json [production]
16:11 <marostegui@cumin1001> dbctl commit (dc=all): 'Depooling db1201 (T321130)', diff saved to https://phabricator.wikimedia.org/P38360 and previous config saved to /var/cache/conftool/dbconfig/20221107-161118-marostegui.json [production]
16:11 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1199', diff saved to https://phabricator.wikimedia.org/P38359 and previous config saved to /var/cache/conftool/dbconfig/20221107-161109-marostegui.json [production]
16:11 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db1201.eqiad.wmnet with reason: Maintenance [production]
16:10 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 5:00:00 on db1201.eqiad.wmnet with reason: Maintenance [production]
16:10 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1187 (T321130)', diff saved to https://phabricator.wikimedia.org/P38358 and previous config saved to /var/cache/conftool/dbconfig/20221107-161050-marostegui.json [production]
16:06 <cgoubert@deploy1002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
16:06 <ebernhardson@deploy1002> Finished deploy [wikimedia/discovery/analytics@e51ff67]: import_cirrus_indexes: set executor cores to 1 (duration: 02m 19s) [production]
16:05 <cgoubert@deploy1002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
16:05 <cgoubert@deploy1002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
16:05 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1127 (T318605)', diff saved to https://phabricator.wikimedia.org/P38357 and previous config saved to /var/cache/conftool/dbconfig/20221107-160527-ladsgroup.json [production]
16:05 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1198 (T318955)', diff saved to https://phabricator.wikimedia.org/P38356 and previous config saved to /var/cache/conftool/dbconfig/20221107-160516-ladsgroup.json [production]
16:04 <cgoubert@deploy1002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
16:03 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti1010.eqiad.wmnet to cluster eqiad and group C [production]
16:03 <ebernhardson@deploy1002> Started deploy [wikimedia/discovery/analytics@e51ff67]: import_cirrus_indexes: set executor cores to 1 [production]
16:03 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2120', diff saved to https://phabricator.wikimedia.org/P38355 and previous config saved to /var/cache/conftool/dbconfig/20221107-160310-ladsgroup.json [production]
16:02 <jmm@cumin2002> START - Cookbook sre.ganeti.addnode for new host ganeti1010.eqiad.wmnet to cluster eqiad and group C [production]
16:02 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dbprov2004.mgmt.codfw.wmnet with reboot policy FORCED [production]
16:02 <cgoubert@deploy1002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
16:02 <pt1979@cumin2002> START - Cookbook sre.hosts.provision for host dbprov2004.mgmt.codfw.wmnet with reboot policy FORCED [production]
16:01 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1198 (T318955)', diff saved to https://phabricator.wikimedia.org/P38354 and previous config saved to /var/cache/conftool/dbconfig/20221107-160124-ladsgroup.json [production]
16:01 <cgoubert@deploy1002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
16:01 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1198.eqiad.wmnet with reason: Maintenance [production]
16:01 <claime> cleaning up stale mwdebug kubernetes config [production]
16:01 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 8:00:00 on db1198.eqiad.wmnet with reason: Maintenance [production]
16:01 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1189 (T318955)', diff saved to https://phabricator.wikimedia.org/P38353 and previous config saved to /var/cache/conftool/dbconfig/20221107-160102-ladsgroup.json [production]
16:00 <cgoubert@deploy1002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
15:59 <cgoubert@deploy1002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
15:56 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1199 (T321123)', diff saved to https://phabricator.wikimedia.org/P38352 and previous config saved to /var/cache/conftool/dbconfig/20221107-155603-marostegui.json [production]
15:55 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P38351 and previous config saved to /var/cache/conftool/dbconfig/20221107-155544-marostegui.json [production]
15:55 <elukey> upgrade istioctl to 1.15.3 on apt1001 for {buster,bullseye}-wikimedia - T322193 [production]
15:54 <marostegui@cumin1001> dbctl commit (dc=all): 'Depooling db1199 (T321123)', diff saved to https://phabricator.wikimedia.org/P38350 and previous config saved to /var/cache/conftool/dbconfig/20221107-155455-marostegui.json [production]
15:54 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1199.eqiad.wmnet with reason: Maintenance [production]
15:54 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 8:00:00 on db1199.eqiad.wmnet with reason: Maintenance [production]
15:54 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1190 (T321123)', diff saved to https://phabricator.wikimedia.org/P38349 and previous config saved to /var/cache/conftool/dbconfig/20221107-155434-marostegui.json [production]
15:51 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dbprov2004.mgmt.codfw.wmnet with reboot policy FORCED [production]
15:50 <pt1979@cumin2002> START - Cookbook sre.hosts.provision for host dbprov2004.mgmt.codfw.wmnet with reboot policy FORCED [production]