2024-02-22
ยง
|
17:54 |
<hnowlan@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1484.eqiad.wmnet with reason: host reimage |
[production] |
17:54 |
<hnowlan@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1485.eqiad.wmnet with reason: host reimage |
[production] |
17:54 |
<hnowlan@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1468.eqiad.wmnet with reason: host reimage |
[production] |
17:52 |
<hnowlan@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1467.eqiad.wmnet with reason: host reimage |
[production] |
17:52 |
<hnowlan@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1458.eqiad.wmnet with reason: host reimage |
[production] |
17:51 |
<hnowlan@cumin1002> |
START - Cookbook sre.hosts.reimage for host mw2384.codfw.wmnet with OS bullseye |
[production] |
17:45 |
<cdanis@deploy2002> |
helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply |
[production] |
17:44 |
<cdanis@deploy2002> |
helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply |
[production] |
17:44 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2152 (T357189)', diff saved to https://phabricator.wikimedia.org/P57751 and previous config saved to /var/cache/conftool/dbconfig/20240222-174449-arnaudb.json |
[production] |
17:44 |
<cdanis@deploy2002> |
helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply |
[production] |
17:43 |
<cdanis@deploy2002> |
helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply |
[production] |
17:43 |
<cdanis@deploy2002> |
helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply |
[production] |
17:43 |
<cdanis@deploy2002> |
helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply |
[production] |
17:43 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Depooling db2152 (T357189)', diff saved to https://phabricator.wikimedia.org/P57750 and previous config saved to /var/cache/conftool/dbconfig/20240222-174328-arnaudb.json |
[production] |
17:43 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2152.codfw.wmnet with reason: Maintenance |
[production] |
17:43 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset-next: apply |
[production] |
17:43 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2152.codfw.wmnet with reason: Maintenance |
[production] |
17:42 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2100.codfw.wmnet with reason: Maintenance |
[production] |
17:42 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2100.codfw.wmnet with reason: Maintenance |
[production] |
17:42 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2098.codfw.wmnet with reason: Maintenance |
[production] |
17:42 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2098.codfw.wmnet with reason: Maintenance |
[production] |
17:42 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on dbstore1009.eqiad.wmnet with reason: Maintenance |
[production] |
17:42 |
<hnowlan@cumin1002> |
START - Cookbook sre.hosts.reimage for host mw1494.eqiad.wmnet with OS bullseye |
[production] |
17:41 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on dbstore1009.eqiad.wmnet with reason: Maintenance |
[production] |
17:41 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1226 (T357189)', diff saved to https://phabricator.wikimedia.org/P57749 and previous config saved to /var/cache/conftool/dbconfig/20240222-174138-arnaudb.json |
[production] |
17:41 |
<hnowlan@cumin1002> |
START - Cookbook sre.hosts.reimage for host mw1485.eqiad.wmnet with OS bullseye |
[production] |
17:41 |
<hnowlan@cumin1002> |
START - Cookbook sre.hosts.reimage for host mw1484.eqiad.wmnet with OS bullseye |
[production] |
17:41 |
<hnowlan@cumin1002> |
START - Cookbook sre.hosts.reimage for host mw1483.eqiad.wmnet with OS bullseye |
[production] |
17:41 |
<hnowlan@cumin1002> |
START - Cookbook sre.hosts.reimage for host mw1468.eqiad.wmnet with OS bullseye |
[production] |
17:40 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset-next: apply |
[production] |
17:39 |
<hnowlan@cumin1002> |
START - Cookbook sre.hosts.reimage for host mw1467.eqiad.wmnet with OS bullseye |
[production] |
17:39 |
<hnowlan@cumin1002> |
START - Cookbook sre.hosts.reimage for host mw1458.eqiad.wmnet with OS bullseye |
[production] |
17:39 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset-next: apply |
[production] |
17:36 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset-next: apply |
[production] |
17:35 |
<cmooney@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=93) for host testvm2002.codfw.wmnet with OS bullseye |
[production] |
17:26 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1226', diff saved to https://phabricator.wikimedia.org/P57748 and previous config saved to /var/cache/conftool/dbconfig/20240222-172632-arnaudb.json |
[production] |
17:11 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1226', diff saved to https://phabricator.wikimedia.org/P57747 and previous config saved to /var/cache/conftool/dbconfig/20240222-171125-arnaudb.json |
[production] |
17:05 |
<topranks> |
disabling IPv6 RAs for private1-a-codfw vlan on codfw core routers T355544 |
[production] |
16:58 |
<cmooney@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Remove legacy codfw vc switches from synced hiera data after netbox status change - cmooney@cumin1002 - T355544" |
[production] |
16:57 |
<cmooney@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Remove legacy codfw vc switches from synced hiera data after netbox status change - cmooney@cumin1002 - T355544" |
[production] |
16:56 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1226 (T357189)', diff saved to https://phabricator.wikimedia.org/P57746 and previous config saved to /var/cache/conftool/dbconfig/20240222-165619-arnaudb.json |
[production] |
16:56 |
<topranks> |
disabling link from asw-a-codfw vc to ssw1-a1-codfw and ssw1-a8-codfw T355544 |
[production] |
16:54 |
<dancy@deploy2002> |
Finished scap: testing T357402 again (duration: 08m 58s) |
[production] |
16:54 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Depooling db1226 (T357189)', diff saved to https://phabricator.wikimedia.org/P57745 and previous config saved to /var/cache/conftool/dbconfig/20240222-165401-arnaudb.json |
[production] |
16:53 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1226.eqiad.wmnet with reason: Maintenance |
[production] |
16:53 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1226.eqiad.wmnet with reason: Maintenance |
[production] |
16:53 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1216.eqiad.wmnet with reason: Maintenance |
[production] |
16:53 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1216.eqiad.wmnet with reason: Maintenance |
[production] |
16:53 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1214 (T357189)', diff saved to https://phabricator.wikimedia.org/P57744 and previous config saved to /var/cache/conftool/dbconfig/20240222-165312-arnaudb.json |
[production] |
16:45 |
<dancy@deploy2002> |
Started scap: testing T357402 again |
[production] |