2023-03-08
ยง
|
20:54 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2163.codfw.wmnet with reason: Maintenance |
[production] |
20:54 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2162 (T329260)', diff saved to https://phabricator.wikimedia.org/P45582 and previous config saved to /var/cache/conftool/dbconfig/20230308-205414-marostegui.json |
[production] |
20:51 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.ganeti.reimage (exit_code=0) for host acmechief2001.codfw.wmnet with OS bullseye |
[production] |
20:41 |
<mutante> |
deploy2002 - systemctl restart keyholder-proxy.service to fix T331568 - after this SSH_AUTH_SOCK=/run/keyholder/proxy.sock ssh -i /etc/keyholder.d/deploy_jenkins -l deploy-jenkins releases1002.eqiad.wmnet works |
[production] |
20:39 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on acmechief2001.codfw.wmnet with reason: host reimage |
[production] |
20:39 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2162', diff saved to https://phabricator.wikimedia.org/P45581 and previous config saved to /var/cache/conftool/dbconfig/20230308-203907-marostegui.json |
[production] |
20:36 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on acmechief2001.codfw.wmnet with reason: host reimage |
[production] |
20:24 |
<brett@cumin2002> |
START - Cookbook sre.ganeti.reimage for host acmechief2001.codfw.wmnet with OS bullseye |
[production] |
20:24 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2162', diff saved to https://phabricator.wikimedia.org/P45580 and previous config saved to /var/cache/conftool/dbconfig/20230308-202401-marostegui.json |
[production] |
20:18 |
<urandom> |
power cycle restbase2022 (unresponsive; cannot SSH) |
[production] |
20:08 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2162 (T329260)', diff saved to https://phabricator.wikimedia.org/P45579 and previous config saved to /var/cache/conftool/dbconfig/20230308-200855-marostegui.json |
[production] |
20:01 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.ganeti.reimage (exit_code=0) for host acmechief-test1001.eqiad.wmnet with OS bullseye |
[production] |
19:46 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depooling db2162 (T329260)', diff saved to https://phabricator.wikimedia.org/P45578 and previous config saved to /var/cache/conftool/dbconfig/20230308-194646-marostegui.json |
[production] |
19:46 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2162.codfw.wmnet with reason: Maintenance |
[production] |
19:46 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2162.codfw.wmnet with reason: Maintenance |
[production] |
19:46 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2161 (T329260)', diff saved to https://phabricator.wikimedia.org/P45577 and previous config saved to /var/cache/conftool/dbconfig/20230308-194625-marostegui.json |
[production] |
19:44 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on acmechief-test1001.eqiad.wmnet with reason: host reimage |
[production] |
19:41 |
<brett@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on acmechief-test1001.eqiad.wmnet with reason: host reimage |
[production] |
19:31 |
<brett@cumin2002> |
START - Cookbook sre.ganeti.reimage for host acmechief-test1001.eqiad.wmnet with OS bullseye |
[production] |
19:31 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2161', diff saved to https://phabricator.wikimedia.org/P45576 and previous config saved to /var/cache/conftool/dbconfig/20230308-193118-marostegui.json |
[production] |
19:16 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2161', diff saved to https://phabricator.wikimedia.org/P45575 and previous config saved to /var/cache/conftool/dbconfig/20230308-191612-marostegui.json |
[production] |
19:16 |
<jhuneidi@deploy2002> |
Synchronized php: group1 wikis to 1.40.0-wmf.26 refs T330204 (duration: 06m 16s) |
[production] |
19:14 |
<cmooney@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
19:14 |
<cmooney@cumin1001> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add reverse entries for new links from CRs to cloudsw1-b1-codfw. - cmooney@cumin1001" |
[production] |
19:13 |
<cmooney@cumin1001> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add reverse entries for new links from CRs to cloudsw1-b1-codfw. - cmooney@cumin1001" |
[production] |
19:09 |
<jhuneidi@deploy2002> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.40.0-wmf.26 refs T330204 |
[production] |
19:09 |
<sukhe@cumin2002> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for acmechief-test2001.codfw.wmnet |
[production] |
19:09 |
<sukhe@cumin2002> |
START - Cookbook sre.hosts.remove-downtime for acmechief-test2001.codfw.wmnet |
[production] |
19:08 |
<cmooney@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
19:01 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2161 (T329260)', diff saved to https://phabricator.wikimedia.org/P45574 and previous config saved to /var/cache/conftool/dbconfig/20230308-190106-marostegui.json |
[production] |
18:43 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2176 (T328817)', diff saved to https://phabricator.wikimedia.org/P45573 and previous config saved to /var/cache/conftool/dbconfig/20230308-184328-marostegui.json |
[production] |
18:42 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depooling db2161 (T329260)', diff saved to https://phabricator.wikimedia.org/P45572 and previous config saved to /var/cache/conftool/dbconfig/20230308-184204-marostegui.json |
[production] |
18:41 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2161.codfw.wmnet with reason: Maintenance |
[production] |
18:41 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2161.codfw.wmnet with reason: Maintenance |
[production] |
18:41 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2154 (T329260)', diff saved to https://phabricator.wikimedia.org/P45571 and previous config saved to /var/cache/conftool/dbconfig/20230308-184143-marostegui.json |
[production] |
18:36 |
<hnowlan@deploy2002> |
helmfile [staging] DONE helmfile.d/services/thumbor: apply |
[production] |
18:30 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1109 (T318605)', diff saved to https://phabricator.wikimedia.org/P45570 and previous config saved to /var/cache/conftool/dbconfig/20230308-183020-ladsgroup.json |
[production] |
18:28 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2176', diff saved to https://phabricator.wikimedia.org/P45569 and previous config saved to /var/cache/conftool/dbconfig/20230308-182822-marostegui.json |
[production] |
18:28 |
<inflatador> |
bking@cumin2002 repool elastic1060-1066 to finish off T322082 |
[production] |
18:27 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2179 (T329203)', diff saved to https://phabricator.wikimedia.org/P45568 and previous config saved to /var/cache/conftool/dbconfig/20230308-182726-marostegui.json |
[production] |
18:27 |
<inflatador> |
bking@cumin2002 unban elastic1060-1066 to finish off T322082 |
[production] |
18:26 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2154', diff saved to https://phabricator.wikimedia.org/P45567 and previous config saved to /var/cache/conftool/dbconfig/20230308-182637-marostegui.json |
[production] |
18:26 |
<hnowlan@deploy2002> |
helmfile [staging] START helmfile.d/services/thumbor: apply |
[production] |
18:20 |
<hnowlan@deploy2002> |
helmfile [staging] START helmfile.d/services/thumbor: apply |
[production] |
18:19 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "update locatoin of elastic1064-65 - bking@cumin2002 - T322082" |
[production] |
18:18 |
<bking@cumin2002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "update locatoin of elastic1064-65 - bking@cumin2002 - T322082" |
[production] |
18:16 |
<hnowlan@deploy2002> |
helmfile [staging] START helmfile.d/services/thumbor: apply |
[production] |
18:16 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.ganeti.reimage (exit_code=0) for host acmechief-test2001.codfw.wmnet with OS bullseye |
[production] |
18:15 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1109', diff saved to https://phabricator.wikimedia.org/P45566 and previous config saved to /var/cache/conftool/dbconfig/20230308-181514-ladsgroup.json |
[production] |
18:14 |
<hnowlan@deploy2002> |
helmfile [staging] START helmfile.d/services/thumbor: apply |
[production] |