2022-08-18
ยง
|
22:33 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
22:32 |
<pt1979@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
22:32 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
22:32 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
22:31 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
22:31 |
<dancy@deploy1002> |
rebuilt and synchronized wikiversions files: group2 wikis to 1.39.0-wmf.23 refs T314186 |
[production] |
22:25 |
<dancy> |
Rolling the train back to group1 due to T315620 |
[production] |
22:25 |
<xcollazo@deploy1002> |
Finished deploy [airflow-dags/platform_eng@ff0a0e2]: (no justification provided) (duration: 00m 19s) |
[production] |
22:24 |
<xcollazo@deploy1002> |
Started deploy [airflow-dags/platform_eng@ff0a0e2]: (no justification provided) |
[production] |
22:16 |
<pt1979@cumin2002> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kubernetes2024.mgmt.codfw.wmnet with reboot policy FORCED |
[production] |
22:09 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.provision for host kubernetes2024.mgmt.codfw.wmnet with reboot policy FORCED |
[production] |
22:05 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
22:02 |
<pt1979@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
22:02 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
21:50 |
<pt1979@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
21:48 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host kubernetes2024 |
[production] |
21:47 |
<pt1979@cumin2002> |
START - Cookbook sre.network.configure-switch-interfaces for host kubernetes2024 |
[production] |
21:20 |
<brennen> |
end of UTC late backport and config window |
[production] |
21:20 |
<brennen@deploy1002> |
Finished scap: [[gerrit:824433|Set initial-zoom via JavaScript to avoid font-scaling issue in iPad (T311795)]] (duration: 10m 16s) |
[production] |
21:15 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
21:14 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
21:14 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
21:14 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on relforge[1003-1004].eqiad.wmnet with reason: elastic 7 upgrade |
[production] |
21:14 |
<bking@cumin2002> |
START - Cookbook sre.hosts.downtime for 4 days, 0:00:00 on relforge[1003-1004].eqiad.wmnet with reason: elastic 7 upgrade |
[production] |
21:13 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
21:09 |
<brennen@deploy1002> |
Started scap: [[gerrit:824433|Set initial-zoom via JavaScript to avoid font-scaling issue in iPad (T311795)]] |
[production] |
21:03 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-stretch2002.codfw.wmnet with OS bullseye |
[production] |
20:40 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
20:40 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
20:40 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host kafka-stretch2002.codfw.wmnet with OS bullseye |
[production] |
20:39 |
<brennen@deploy1002> |
Finished scap: [[gerrit:816239|Allow admin to grant/revoke "transwiki" group on zh(wikt|wb|wq|ws) (T313657)]] (duration: 07m 09s) |
[production] |
20:39 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
20:37 |
<pt1979@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kafka-stretch2002.codfw.wmnet with OS bullseye |
[production] |
20:32 |
<brennen@deploy1002> |
Started scap: [[gerrit:816239|Allow admin to grant/revoke "transwiki" group on zh(wikt|wb|wq|ws) (T313657)]] |
[production] |
20:29 |
<brennen@deploy1002> |
Finished scap: [[gerrit:824395|Deploy partial action blocks to cswiki (T315525)]] (duration: 19m 16s) |
[production] |
20:20 |
<robh@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host dumpsdata1006.eqiad.wmnet with OS bullseye |
[production] |
20:19 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
20:18 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
20:18 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
20:17 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
20:10 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host kafka-stretch2002.codfw.wmnet with OS bullseye |
[production] |
20:09 |
<brennen@deploy1002> |
Started scap: [[gerrit:824395|Deploy partial action blocks to cswiki (T315525)]] |
[production] |
20:00 |
<robh@cumin1001> |
START - Cookbook sre.hosts.reimage for host dumpsdata1006.eqiad.wmnet with OS bullseye |
[production] |
19:57 |
<ottomata> |
renable puppet on an-master* |
[production] |
19:47 |
<ottomata> |
temporarily disable puppet on an-master100* while applying change in test cluster - T312858 |
[production] |
19:34 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dumpsdata1007.eqiad.wmnet with OS bullseye |
[production] |
19:19 |
<robh@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dumpsdata1007.eqiad.wmnet with reason: host reimage |
[production] |
19:16 |
<robh@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on dumpsdata1007.eqiad.wmnet with reason: host reimage |
[production] |
19:10 |
<cmooney@cumin1001> |
END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) |
[production] |
19:00 |
<robh@cumin1001> |
START - Cookbook sre.hosts.reimage for host dumpsdata1007.eqiad.wmnet with OS bullseye |
[production] |