1051-1100 of 10000 results (48ms)
2022-05-02 ยง
10:34 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host sretest1002.eqiad.wmnet [production]
10:30 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P27341 and previous config saved to /var/cache/conftool/dbconfig/20220502-103023-ladsgroup.json [production]
10:24 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1169 (T306560)', diff saved to https://phabricator.wikimedia.org/P27340 and previous config saved to /var/cache/conftool/dbconfig/20220502-102402-ladsgroup.json [production]
10:19 <klausman@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ores2002.codfw.wmnet with OS buster [production]
10:18 <klausman@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ores2002.codfw.wmnet with reason: host reimage [production]
10:15 <klausman@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ores2002.codfw.wmnet with reason: host reimage [production]
10:15 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1180 (T298563)', diff saved to https://phabricator.wikimedia.org/P27338 and previous config saved to /var/cache/conftool/dbconfig/20220502-101518-ladsgroup.json [production]
10:08 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1169', diff saved to https://phabricator.wikimedia.org/P27337 and previous config saved to /var/cache/conftool/dbconfig/20220502-100857-ladsgroup.json [production]
10:06 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
10:05 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
10:05 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
10:04 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
09:53 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1169', diff saved to https://phabricator.wikimedia.org/P27334 and previous config saved to /var/cache/conftool/dbconfig/20220502-095352-ladsgroup.json [production]
09:50 <klausman@cumin2002> START - Cookbook sre.hosts.reimage for host ores2002.codfw.wmnet with OS buster [production]
09:49 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1180 (T298563)', diff saved to https://phabricator.wikimedia.org/P27333 and previous config saved to /var/cache/conftool/dbconfig/20220502-094938-ladsgroup.json [production]
09:49 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on db1180.eqiad.wmnet with reason: Maintenance [production]
09:49 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 10:00:00 on db1180.eqiad.wmnet with reason: Maintenance [production]
09:49 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1165 (T298563)', diff saved to https://phabricator.wikimedia.org/P27332 and previous config saved to /var/cache/conftool/dbconfig/20220502-094930-ladsgroup.json [production]
09:38 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1169 (T306560)', diff saved to https://phabricator.wikimedia.org/P27331 and previous config saved to /var/cache/conftool/dbconfig/20220502-093847-ladsgroup.json [production]
09:36 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1169 (T306560)', diff saved to https://phabricator.wikimedia.org/P27330 and previous config saved to /var/cache/conftool/dbconfig/20220502-093628-ladsgroup.json [production]
09:36 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1169.eqiad.wmnet with reason: Maintenance [production]
09:36 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1169.eqiad.wmnet with reason: Maintenance [production]
09:36 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance [production]
09:36 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance [production]
09:35 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1140.eqiad.wmnet with reason: Maintenance [production]
09:35 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1140.eqiad.wmnet with reason: Maintenance [production]
09:35 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1105:3311 (T306560)', diff saved to https://phabricator.wikimedia.org/P27329 and previous config saved to /var/cache/conftool/dbconfig/20220502-093547-ladsgroup.json [production]
09:34 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1165', diff saved to https://phabricator.wikimedia.org/P27328 and previous config saved to /var/cache/conftool/dbconfig/20220502-093425-ladsgroup.json [production]
09:20 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1105:3311', diff saved to https://phabricator.wikimedia.org/P27327 and previous config saved to /var/cache/conftool/dbconfig/20220502-092042-ladsgroup.json [production]
09:19 <moritzm> installing ghostscript security updates on Stretch (newer distros not affected) [production]
09:19 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1165', diff saved to https://phabricator.wikimedia.org/P27326 and previous config saved to /var/cache/conftool/dbconfig/20220502-091920-ladsgroup.json [production]
09:06 <jynus@cumin1001> START - Cookbook sre.hosts.reimage for host backup1002.eqiad.wmnet with OS bullseye [production]
09:05 <jynus@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host backup1002.eqiad.wmnet with OS bullseye [production]
09:05 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1105:3311', diff saved to https://phabricator.wikimedia.org/P27325 and previous config saved to /var/cache/conftool/dbconfig/20220502-090537-ladsgroup.json [production]
09:05 <jynus@cumin1001> START - Cookbook sre.hosts.reimage for host backup1002.eqiad.wmnet with OS bullseye [production]
09:04 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1165 (T298563)', diff saved to https://phabricator.wikimedia.org/P27324 and previous config saved to /var/cache/conftool/dbconfig/20220502-090415-ladsgroup.json [production]
09:04 <jynus@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host backup1002.eqiad.wmnet with OS bullseye [production]
09:03 <jynus@cumin1001> START - Cookbook sre.hosts.reimage for host backup1002.eqiad.wmnet with OS bullseye [production]
09:01 <jynus@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host backup1002.eqiad.wmnet with OS bullseye [production]
09:01 <jynus@cumin1001> START - Cookbook sre.hosts.reimage for host backup1002.eqiad.wmnet with OS bullseye [production]
09:00 <jynus@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host backup1002.eqiad.wmnet with OS bullseye [production]
09:00 <jynus@cumin1001> START - Cookbook sre.hosts.reimage for host backup1002.eqiad.wmnet with OS bullseye [production]
09:00 <jynus@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host backup1002.eqiad.wmnet with OS buster [production]
08:57 <jynus@cumin1001> START - Cookbook sre.hosts.reimage for host backup1002.eqiad.wmnet with OS buster [production]
08:50 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1105:3311 (T306560)', diff saved to https://phabricator.wikimedia.org/P27323 and previous config saved to /var/cache/conftool/dbconfig/20220502-085032-ladsgroup.json [production]
08:48 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1105:3311 (T306560)', diff saved to https://phabricator.wikimedia.org/P27322 and previous config saved to /var/cache/conftool/dbconfig/20220502-084812-ladsgroup.json [production]
08:48 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1105.eqiad.wmnet with reason: Maintenance [production]
08:48 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1105.eqiad.wmnet with reason: Maintenance [production]
08:47 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1133.eqiad.wmnet with reason: Maintenance [production]
08:47 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1133.eqiad.wmnet with reason: Maintenance [production]