2022-11-03
ยง
|
10:49 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2101.codfw.wmnet with reason: Maintenance |
[production] |
10:49 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1096.eqiad.wmnet with reason: Maintenance |
[production] |
10:49 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2101.codfw.wmnet with reason: Maintenance |
[production] |
10:43 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depooling db1170:3317 (T321123)', diff saved to https://phabricator.wikimedia.org/P37900 and previous config saved to /var/cache/conftool/dbconfig/20221103-104313-marostegui.json |
[production] |
10:43 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1170.eqiad.wmnet with reason: Maintenance |
[production] |
10:42 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db1170.eqiad.wmnet with reason: Maintenance |
[production] |
10:42 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1158 (T321123)', diff saved to https://phabricator.wikimedia.org/P37899 and previous config saved to /var/cache/conftool/dbconfig/20221103-104239-marostegui.json |
[production] |
10:27 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P37898 and previous config saved to /var/cache/conftool/dbconfig/20221103-102730-marostegui.json |
[production] |
10:19 |
<jmm@cumin2002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ganeti1025.eqiad.wmnet'] |
[production] |
10:12 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['ganeti1025.eqiad.wmnet'] |
[production] |
10:12 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P37897 and previous config saved to /var/cache/conftool/dbconfig/20221103-101222-marostegui.json |
[production] |
09:57 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1158 (T321123)', diff saved to https://phabricator.wikimedia.org/P37896 and previous config saved to /var/cache/conftool/dbconfig/20221103-095715-marostegui.json |
[production] |
09:55 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depooling db1158 (T321123)', diff saved to https://phabricator.wikimedia.org/P37895 and previous config saved to /var/cache/conftool/dbconfig/20221103-095501-marostegui.json |
[production] |
09:54 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 16:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance |
[production] |
09:54 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 16:00:00 on clouddb[1014,1018,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance |
[production] |
09:54 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1158.eqiad.wmnet with reason: Maintenance |
[production] |
09:54 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db1158.eqiad.wmnet with reason: Maintenance |
[production] |
09:54 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1136 (T321123)', diff saved to https://phabricator.wikimedia.org/P37894 and previous config saved to /var/cache/conftool/dbconfig/20221103-095409-marostegui.json |
[production] |
09:39 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1136', diff saved to https://phabricator.wikimedia.org/P37893 and previous config saved to /var/cache/conftool/dbconfig/20221103-093901-marostegui.json |
[production] |
09:36 |
<jmm@cumin2002> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ganeti1025.eqiad.wmnet'] |
[production] |
09:26 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.ores.roll-restart-workers (exit_code=0) for ORES codfw cluster: Roll restart of ORES's daemons. |
[production] |
09:23 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1136', diff saved to https://phabricator.wikimedia.org/P37892 and previous config saved to /var/cache/conftool/dbconfig/20221103-092353-marostegui.json |
[production] |
09:08 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1136 (T321123)', diff saved to https://phabricator.wikimedia.org/P37891 and previous config saved to /var/cache/conftool/dbconfig/20221103-090844-marostegui.json |
[production] |
09:06 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depooling db1136 (T321123)', diff saved to https://phabricator.wikimedia.org/P37890 and previous config saved to /var/cache/conftool/dbconfig/20221103-090631-marostegui.json |
[production] |
09:06 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1136.eqiad.wmnet with reason: Maintenance |
[production] |
09:06 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db1136.eqiad.wmnet with reason: Maintenance |
[production] |
09:06 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1127 (T321123)', diff saved to https://phabricator.wikimedia.org/P37889 and previous config saved to /var/cache/conftool/dbconfig/20221103-090607-marostegui.json |
[production] |
09:05 |
<elukey@cumin1001> |
START - Cookbook sre.ores.roll-restart-workers for ORES codfw cluster: Roll restart of ORES's daemons. |
[production] |
09:02 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reimage for host ganeti1025.eqiad.wmnet with OS bullseye |
[production] |
09:02 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ganeti1025.eqiad.wmnet with OS bullseye |
[production] |
08:56 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.ores.roll-restart-workers (exit_code=0) for ORES eqiad cluster: Roll restart of ORES's daemons. |
[production] |
08:53 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reimage for host ganeti1025.eqiad.wmnet with OS bullseye |
[production] |
08:53 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ganeti1025.eqiad.wmnet with OS bullseye |
[production] |
08:50 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1127', diff saved to https://phabricator.wikimedia.org/P37888 and previous config saved to /var/cache/conftool/dbconfig/20221103-085059-marostegui.json |
[production] |
08:44 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reimage for host ganeti1025.eqiad.wmnet with OS bullseye |
[production] |
08:43 |
<jmm@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host ganeti1025.eqiad.wmnet with OS bullseye |
[production] |
08:39 |
<moritzm> |
installing ruby-nokogiri security updates |
[production] |
08:37 |
<elukey@cumin1001> |
START - Cookbook sre.ores.roll-restart-workers for ORES eqiad cluster: Roll restart of ORES's daemons. |
[production] |
08:35 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1127', diff saved to https://phabricator.wikimedia.org/P37887 and previous config saved to /var/cache/conftool/dbconfig/20221103-083549-marostegui.json |
[production] |
08:20 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1127 (T321123)', diff saved to https://phabricator.wikimedia.org/P37886 and previous config saved to /var/cache/conftool/dbconfig/20221103-082040-marostegui.json |
[production] |
08:18 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depooling db1127 (T321123)', diff saved to https://phabricator.wikimedia.org/P37885 and previous config saved to /var/cache/conftool/dbconfig/20221103-081827-marostegui.json |
[production] |
08:18 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1127.eqiad.wmnet with reason: Maintenance |
[production] |
08:18 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db1127.eqiad.wmnet with reason: Maintenance |
[production] |
08:18 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1101:3317 (T321123)', diff saved to https://phabricator.wikimedia.org/P37884 and previous config saved to /var/cache/conftool/dbconfig/20221103-081805-marostegui.json |
[production] |
08:17 |
<moritzm> |
installing glibc security updates on buster |
[production] |
08:02 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1101:3317', diff saved to https://phabricator.wikimedia.org/P37883 and previous config saved to /var/cache/conftool/dbconfig/20221103-080257-marostegui.json |
[production] |
08:01 |
<moritzm> |
installing exim4 security updates |
[production] |
07:58 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reimage for host ganeti1025.eqiad.wmnet with OS bullseye |
[production] |
07:55 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on ganeti1025.eqiad.wmnet with reason: Remove from cluster for eventual reimage |
[production] |
07:55 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on ganeti1025.eqiad.wmnet with reason: Remove from cluster for eventual reimage |
[production] |