2024-05-05
§
|
11:09 |
<brennen@deploy1002> |
Finished deploy [phabricator/deployment@dd53761]: test deploy phab1004 for T364271 (duration: 00m 32s) |
[production] |
11:08 |
<brennen@deploy1002> |
Started deploy [phabricator/deployment@dd53761]: test deploy phab1004 for T364271 |
[production] |
11:08 |
<brennen@deploy1002> |
Finished deploy [phabricator/deployment@dd53761]: test deploy phab2002 for T364271 (duration: 00m 32s) |
[production] |
11:07 |
<brennen@deploy1002> |
Started deploy [phabricator/deployment@dd53761]: test deploy phab2002 for T364271 |
[production] |
11:04 |
<taavi@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on phab.wmfusercontent.org with reason: brennen is deploying things |
[production] |
11:03 |
<taavi@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on phab.wmfusercontent.org with reason: brennen is deploying things |
[production] |
11:03 |
<taavi@cumin1002> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 1:00:00 on phabricator.wikimedia.org with reason: brennen is deploying things |
[production] |
11:03 |
<taavi@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on phabricator.wikimedia.org with reason: brennen is deploying things |
[production] |
11:03 |
<taavi@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on phab1004.eqiad.wmnet with reason: brennen is deploying things |
[production] |
11:03 |
<taavi@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on phab1004.eqiad.wmnet with reason: brennen is deploying things |
[production] |
08:42 |
<taavi> |
taavi@gerrit1003 ~ $ sudo systemctl restart apache2 |
[production] |
2024-05-03
§
|
21:38 |
<ryankemper@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6 days, 0:00:00 on wdqs2023.codfw.wmnet with reason: T362920 |
[production] |
21:38 |
<ryankemper@cumin2002> |
START - Cookbook sre.hosts.downtime for 6 days, 0:00:00 on wdqs2023.codfw.wmnet with reason: T362920 |
[production] |
21:27 |
<ryankemper> |
T362920 [wdqs] Depooled `wdqs2023` in preparation to switch it to a graph split host |
[production] |
19:02 |
<sukhe> |
cleaning up stale confd template files for magru related reimaging |
[production] |
18:44 |
<brett@cumin2002> |
conftool action : set/pooled=yes; selector: name=ncredir7002.magru.wmnet,service=nginx |
[production] |
18:43 |
<brett@cumin2002> |
conftool action : set/pooled=yes; selector: name=ncredir7001.magru.wmnet,service=nginx |
[production] |
18:38 |
<brett@cumin2002> |
conftool action : set/pooled=no; selector: name=ncredir7001.magru.wmnet,service=nginx |
[production] |
18:38 |
<brett@cumin2002> |
conftool action : set/pooled=no; selector: name=ncredir7002.magru.wmnet,service=nginx |
[production] |
18:29 |
<brett@cumin2002> |
conftool action : set/pooled=yes; selector: name=ncredir7002.magru.wmnet,service=nginx |
[production] |
18:29 |
<brett@cumin2002> |
conftool action : set/weight=1; selector: name=ncredir7002.magru.wmnet,service=nginx |
[production] |
18:29 |
<brett@cumin2002> |
conftool action : set/pooled=yes; selector: name=ncredir7001.magru.wmnet,service=nginx |
[production] |
18:28 |
<brett@cumin2002> |
conftool action : set/weight=1; selector: name=ncredir7001.magru.wmnet,service=nginx |
[production] |
17:45 |
<dcausse> |
repooling wdqs1012 |
[production] |
17:27 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2200.codfw.wmnet with reason: Maintenance |
[production] |
17:27 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db2200.codfw.wmnet with reason: Maintenance |
[production] |
17:14 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host ncredir7002.magru.wmnet |
[production] |
17:14 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ncredir7002.magru.wmnet with OS bookworm |
[production] |
17:13 |
<denisse> |
Run `sudo mdadm --add /dev/md1 /dev/sdg` on `centrallog1002` - T363660 |
[production] |
17:01 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2198.codfw.wmnet with reason: Maintenance |
[production] |
17:00 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db2198.codfw.wmnet with reason: Maintenance |
[production] |
17:00 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2195 (T361627)', diff saved to https://phabricator.wikimedia.org/P61862 and previous config saved to /var/cache/conftool/dbconfig/20240503-170054-marostegui.json |
[production] |
16:47 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ncredir7002.magru.wmnet with reason: host reimage |
[production] |
16:45 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2195', diff saved to https://phabricator.wikimedia.org/P61860 and previous config saved to /var/cache/conftool/dbconfig/20240503-164546-marostegui.json |
[production] |
16:44 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ncredir7002.magru.wmnet with reason: host reimage |
[production] |
16:30 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2195', diff saved to https://phabricator.wikimedia.org/P61859 and previous config saved to /var/cache/conftool/dbconfig/20240503-163039-marostegui.json |
[production] |
16:18 |
<brett@cumin2002> |
START - Cookbook sre.hosts.reimage for host ncredir7002.magru.wmnet with OS bookworm |
[production] |
16:15 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2195 (T361627)', diff saved to https://phabricator.wikimedia.org/P61858 and previous config saved to /var/cache/conftool/dbconfig/20240503-161531-marostegui.json |
[production] |
15:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db2195 (T361627)', diff saved to https://phabricator.wikimedia.org/P61857 and previous config saved to /var/cache/conftool/dbconfig/20240503-155432-marostegui.json |
[production] |
15:54 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2195.codfw.wmnet with reason: Maintenance |
[production] |