2024-01-22
ยง
|
10:18 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.reimage for host db2158.codfw.wmnet with OS bookworm |
[production] |
10:16 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db2158', diff saved to https://phabricator.wikimedia.org/P55172 and previous config saved to /var/cache/conftool/dbconfig/20240122-101634-marostegui.json |
[production] |
10:13 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for gerrit[1003,2002].wikimedia.org |
[production] |
10:13 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.remove-downtime for gerrit[1003,2002].wikimedia.org |
[production] |
10:07 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1213:3315 (re)pooling @ 25%: Upgrade to 10.6.16 and bookworm', diff saved to https://phabricator.wikimedia.org/P55171 and previous config saved to /var/cache/conftool/dbconfig/20240122-100722-root.json |
[production] |
10:07 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1213:3316 (re)pooling @ 25%: Upgrade to 10.6.16 and bookworm', diff saved to https://phabricator.wikimedia.org/P55170 and previous config saved to /var/cache/conftool/dbconfig/20240122-100707-root.json |
[production] |
10:06 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2156', diff saved to https://phabricator.wikimedia.org/P55169 and previous config saved to /var/cache/conftool/dbconfig/20240122-100651-marostegui.json |
[production] |
10:04 |
<hashar> |
gerrit: running jgit gc on every repository to regenerate potentially faulty reachability bitmaps files preventing fetches on some repositories # T355173 |
[production] |
10:00 |
<jelto> |
start envoy on ticket-test.wikimedia.org to test alerting - T354479 |
[production] |
09:57 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host mc2049.codfw.wmnet |
[production] |
09:56 |
<jelto> |
stop envoy on ticket-test.wikimedia.org to test alerting - T354479 |
[production] |
09:52 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-host for host mc2049.codfw.wmnet |
[production] |
09:52 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host mc1049.eqiad.wmnet |
[production] |
09:52 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1213:3315 (re)pooling @ 10%: Upgrade to 10.6.16 and bookworm', diff saved to https://phabricator.wikimedia.org/P55167 and previous config saved to /var/cache/conftool/dbconfig/20240122-095217-root.json |
[production] |
09:52 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1213:3316 (re)pooling @ 10%: Upgrade to 10.6.16 and bookworm', diff saved to https://phabricator.wikimedia.org/P55166 and previous config saved to /var/cache/conftool/dbconfig/20240122-095202-root.json |
[production] |
09:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2156', diff saved to https://phabricator.wikimedia.org/P55165 and previous config saved to /var/cache/conftool/dbconfig/20240122-095145-marostegui.json |
[production] |
09:51 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.addnode (exit_code=0) for new host ganeti1035.eqiad.wmnet to cluster eqiad and group A |
[production] |
09:49 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.addnode for new host ganeti1035.eqiad.wmnet to cluster eqiad and group A |
[production] |
09:47 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-host for host mc1049.eqiad.wmnet |
[production] |
09:38 |
<hashar> |
Restarted Gerrit with upgraded version 3.7.6 # T354885 |
[production] |
09:37 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1213:3315 (re)pooling @ 5%: Upgrade to 10.6.16 and bookworm', diff saved to https://phabricator.wikimedia.org/P55164 and previous config saved to /var/cache/conftool/dbconfig/20240122-093712-root.json |
[production] |
09:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1213:3316 (re)pooling @ 5%: Upgrade to 10.6.16 and bookworm', diff saved to https://phabricator.wikimedia.org/P55163 and previous config saved to /var/cache/conftool/dbconfig/20240122-093657-root.json |
[production] |
09:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2156 (T354336)', diff saved to https://phabricator.wikimedia.org/P55162 and previous config saved to /var/cache/conftool/dbconfig/20240122-093638-marostegui.json |
[production] |
09:26 |
<cgoubert@cumin1002> |
conftool action : set/pooled=no; selector: name=mw2394.codfw.wmnet |
[production] |
09:26 |
<cgoubert@cumin1002> |
conftool action : set/pooled=yes; selector: name=mw2444.codfw.wmnet |
[production] |
09:22 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1213:3315 (re)pooling @ 1%: Upgrade to 10.6.16 and bookworm', diff saved to https://phabricator.wikimedia.org/P55161 and previous config saved to /var/cache/conftool/dbconfig/20240122-092207-root.json |
[production] |
09:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1213:3316 (re)pooling @ 1%: Upgrade to 10.6.16 and bookworm', diff saved to https://phabricator.wikimedia.org/P55160 and previous config saved to /var/cache/conftool/dbconfig/20240122-092152-root.json |
[production] |
09:19 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2156 (T354336)', diff saved to https://phabricator.wikimedia.org/P55159 and previous config saved to /var/cache/conftool/dbconfig/20240122-091916-marostegui.json |
[production] |
09:19 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 16:00:00 on db2186.codfw.wmnet with reason: Maintenance |
[production] |
09:18 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 16:00:00 on db2186.codfw.wmnet with reason: Maintenance |
[production] |
09:18 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2156.codfw.wmnet with reason: Maintenance |
[production] |
09:18 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti1035.eqiad.wmnet to cluster eqiad and group A |
[production] |
09:18 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.addnode for new host ganeti1035.eqiad.wmnet to cluster eqiad and group A |
[production] |
09:18 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db2156.codfw.wmnet with reason: Maintenance |
[production] |
09:18 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2149 (T354336)', diff saved to https://phabricator.wikimedia.org/P55158 and previous config saved to /var/cache/conftool/dbconfig/20240122-091838-marostegui.json |
[production] |
09:17 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1213.eqiad.wmnet with OS bookworm |
[production] |
09:17 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on gerrit[1003,2002].wikimedia.org with reason: Gerrit update |
[production] |
09:17 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on gerrit[1003,2002].wikimedia.org with reason: Gerrit update |
[production] |
09:15 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1035.eqiad.wmnet |
[production] |
09:11 |
<hashar> |
Gerrit: reindexing all changes for 3.6 > 3.7 migration # T354885 |
[production] |
09:08 |
<hashar@deploy2002> |
Finished deploy [gerrit/gerrit@bdd1a8b]: Gerrit to version 3.7.6 (duration: 00m 10s) |
[production] |
09:08 |
<hashar@deploy2002> |
Started deploy [gerrit/gerrit@bdd1a8b]: Gerrit to version 3.7.6 |
[production] |
09:06 |
<hashar> |
Upgrading Gerrit # T354885 |
[production] |
09:05 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti1035.eqiad.wmnet |
[production] |
09:05 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2169:3317 (re)pooling @ 100%: Upgrade to 10.6.16 and bookworm', diff saved to https://phabricator.wikimedia.org/P55157 and previous config saved to /var/cache/conftool/dbconfig/20240122-090504-root.json |
[production] |
09:03 |
<cgoubert@cumin1002> |
conftool action : set/pooled=no; selector: name=mw2444.codfw.wmnet |
[production] |
09:03 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2149', diff saved to https://phabricator.wikimedia.org/P55156 and previous config saved to /var/cache/conftool/dbconfig/20240122-090332-marostegui.json |
[production] |
09:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2169:3316 (re)pooling @ 100%: Upgrade to 10.6.16 and bookworm', diff saved to https://phabricator.wikimedia.org/P55155 and previous config saved to /var/cache/conftool/dbconfig/20240122-090218-root.json |
[production] |
09:01 |
<cgoubert@cumin1002> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw2394.codfw.wmnet |
[production] |
09:01 |
<cgoubert@cumin1002> |
START - Cookbook sre.hosts.remove-downtime for mw2394.codfw.wmnet |
[production] |