2023-03-09
ยง
|
18:24 |
<sukhe@cumin2002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: authdns[1001,2001].wikimedia.org decommissioned, removing all IPs except the asset tag one - sukhe@cumin2002" |
[production] |
18:22 |
<sukhe> |
running puppet-agent on A:dns-auth to remove deprecated authdns[12]001 |
[production] |
18:22 |
<sukhe@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
18:21 |
<cmooney@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
18:15 |
<sukhe@cumin2002> |
START - Cookbook sre.hosts.decommission for hosts authdns[1001,2001].wikimedia.org |
[production] |
18:11 |
<bd808@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/developer-portal: apply |
[production] |
18:10 |
<bd808@deploy2002> |
helmfile [codfw] START helmfile.d/services/developer-portal: apply |
[production] |
18:10 |
<bd808@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/developer-portal: apply |
[production] |
18:10 |
<cmooney@cumin1001> |
END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) |
[production] |
18:09 |
<bd808@deploy2002> |
helmfile [eqiad] START helmfile.d/services/developer-portal: apply |
[production] |
18:09 |
<bd808@deploy2002> |
helmfile [staging] DONE helmfile.d/services/developer-portal: apply |
[production] |
18:09 |
<bd808@deploy2002> |
helmfile [staging] START helmfile.d/services/developer-portal: apply |
[production] |
18:08 |
<cmooney@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
18:08 |
<cmooney@cumin1001> |
END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) |
[production] |
18:01 |
<cmooney@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
18:00 |
<sukhe> |
cr*-codfw [ns0]: set routing-options static route 208.80.154.238/32 next-hop 208.80.153.77: T330670 |
[production] |
17:53 |
<sukhe> |
cr*-codfw [ns1]: set routing-options static route 208.80.153.231/32 next-hop 208.80.153.77: T330670 |
[production] |
17:50 |
<zabe@deploy2002> |
Finished scap: Backport for [[gerrit:896030|Revert "TransformHandler: Load stashed page bundle based on ETag." (T331629)]] (duration: 11m 57s) |
[production] |
17:47 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
17:47 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2179 (T329260)', diff saved to https://phabricator.wikimedia.org/P45725 and previous config saved to /var/cache/conftool/dbconfig/20230309-174723-marostegui.json |
[production] |
17:47 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
17:42 |
<sukhe> |
[ns1] set routing-options static route 208.80.153.231/32 next-hop 208.80.154.10: T330670 |
[production] |
17:39 |
<zabe@deploy2002> |
zabe and ssastry: Backport for [[gerrit:896030|Revert "TransformHandler: Load stashed page bundle based on ETag." (T331629)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet |
[production] |
17:38 |
<zabe@deploy2002> |
Started scap: Backport for [[gerrit:896030|Revert "TransformHandler: Load stashed page bundle based on ETag." (T331629)]] |
[production] |
17:37 |
<sukhe> |
cr2-eqiad: set routing-options static route 208.80.154.238/32 next-hop 208.80.154.10: T330670 |
[production] |
17:37 |
<sukhe> |
cr1-eqiad: set routing-options static route 208.80.154.238/32 next-hop 208.80.154.10: T330670 |
[production] |
17:36 |
<sukhe> |
cr1-eqiad: set routing-options static route 208.80.154.238/32 next-hop 208.80.154.10 |
[production] |
17:32 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2179', diff saved to https://phabricator.wikimedia.org/P45724 and previous config saved to /var/cache/conftool/dbconfig/20230309-173217-marostegui.json |
[production] |
17:17 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2179', diff saved to https://phabricator.wikimedia.org/P45723 and previous config saved to /var/cache/conftool/dbconfig/20230309-171711-marostegui.json |
[production] |
17:13 |
<topranks> |
Add EBGP peering from cr1-codfw to cloudsw1-b1-codfw (prod links) T327919 |
[production] |
17:02 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2179 (T329260)', diff saved to https://phabricator.wikimedia.org/P45722 and previous config saved to /var/cache/conftool/dbconfig/20230309-170205-marostegui.json |
[production] |
16:55 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
16:52 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depooling db2179 (T329260)', diff saved to https://phabricator.wikimedia.org/P45721 and previous config saved to /var/cache/conftool/dbconfig/20230309-165210-marostegui.json |
[production] |
16:52 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2179.codfw.wmnet with reason: Maintenance |
[production] |
16:51 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2179.codfw.wmnet with reason: Maintenance |
[production] |
16:51 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2172 (T329260)', diff saved to https://phabricator.wikimedia.org/P45720 and previous config saved to /var/cache/conftool/dbconfig/20230309-165149-marostegui.json |
[production] |
16:51 |
<topranks> |
Add EBGP peering from cr1-codfw to cloudsw1-b1-codfw (cloud vrf) T327919 |
[production] |
16:50 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
16:36 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2172', diff saved to https://phabricator.wikimedia.org/P45719 and previous config saved to /var/cache/conftool/dbconfig/20230309-163643-marostegui.json |
[production] |
16:27 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2165.codfw.wmnet with reason: Maintenance |
[production] |
16:26 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 12:00:00 on db2165.codfw.wmnet with reason: Maintenance |
[production] |
16:26 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2163 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P45718 and previous config saved to /var/cache/conftool/dbconfig/20230309-162608-root.json |
[production] |
16:21 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2172', diff saved to https://phabricator.wikimedia.org/P45717 and previous config saved to /var/cache/conftool/dbconfig/20230309-162137-marostegui.json |
[production] |
16:18 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.ganeti.reimage (exit_code=0) for host acmechief1001.eqiad.wmnet with OS bullseye |
[production] |
16:11 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2163 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P45716 and previous config saved to /var/cache/conftool/dbconfig/20230309-161103-root.json |
[production] |
16:09 |
<zabe@deploy2002> |
Finished scap: T308932 (duration: 07m 19s) |
[production] |
16:06 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2172 (T329260)', diff saved to https://phabricator.wikimedia.org/P45715 and previous config saved to /var/cache/conftool/dbconfig/20230309-160630-marostegui.json |
[production] |
16:04 |
<brett@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on acmechief1001.eqiad.wmnet with reason: host reimage |
[production] |
16:02 |
<marostegui> |
Restart mailman service T331626 |
[production] |
16:02 |
<zabe@deploy2002> |
Started scap: T308932 |
[production] |