2023-05-31
ยง
|
17:10 |
<brett> |
Maglev LVS scheduler rollout in codfw finished - T263797 |
[production] |
17:10 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.cdn.run-puppet-restart-varnish (exit_code=0) rolling custom on P{cp[2037,2039,2041].codfw.wmnet} and A:cp |
[production] |
16:59 |
<brett@deploy1002> |
Locking from deployment [ALL REPOSITORIES]: LVS maintenance in codfw, blocking deploys T322937 |
[production] |
16:37 |
<mfossati@deploy1002> |
Finished deploy [airflow-dags/platform_eng@5379d83]: (no justification provided) (duration: 00m 34s) |
[production] |
16:37 |
<mfossati@deploy1002> |
Started deploy [airflow-dags/platform_eng@5379d83]: (no justification provided) |
[production] |
16:22 |
<elukey> |
`systemctl reset-failed session-c6111.scope session-c7230.scope` on stat1005 to clear old alerts |
[production] |
16:20 |
<vgutierrez@cumin1001> |
START - Cookbook sre.cdn.run-puppet-restart-varnish rolling custom on P{cp[2037,2039,2041].codfw.wmnet} and A:cp |
[production] |
16:13 |
<vgutierrez> |
repool cp2035 - T337247 T323557 |
[production] |
16:12 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp2035.codfw.wmnet |
[production] |
16:12 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.remove-downtime for cp2035.codfw.wmnet |
[production] |
16:10 |
<otto@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
16:10 |
<otto@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
16:08 |
<otto@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
16:08 |
<otto@deploy1002> |
helmfile [codfw] START helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
16:04 |
<otto@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
16:04 |
<otto@deploy1002> |
helmfile [codfw] START helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
15:51 |
<Emperor> |
swift delete virtual machines from "swift" WMCS project |
[production] |
15:51 |
<brett@deploy1002> |
Locking from deployment [ALL REPOSITORIES]: LVS maintenance in codfw, blocking deploys T322937 |
[production] |
15:50 |
<brett@deploy1002> |
Unlocked for deployment [ALL REPOSITORIES]: LVS maintenance in eqiad, blocking deploys T322937 (duration: 02m 24s) |
[production] |
15:48 |
<brett@deploy1002> |
Locking from deployment [ALL REPOSITORIES]: LVS maintenance in eqiad, blocking deploys T322937 |
[production] |
15:47 |
<Emperor> |
delete virtual machines from "swift" WMCS project |
[production] |
15:45 |
<vgutierrez> |
cp2035 depooled as puppet is unable to run due to ipmi issues - T337247 |
[production] |
15:42 |
<brett> |
Maglev LVS scheduler rollout began IN PROGRESS, not finished - T263797 |
[production] |
15:42 |
<brett> |
Maglev LVS scheduler rollout finished in codfw - T263797 |
[production] |
15:40 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on cp2035.codfw.wmnet with reason: ipmi/mgmt console issues |
[production] |
15:40 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on cp2035.codfw.wmnet with reason: ipmi/mgmt console issues |
[production] |
15:39 |
<vgutierrez@cumin1001> |
END (FAIL) - Cookbook sre.cdn.run-puppet-restart-varnish (exit_code=1) rolling custom on A:cp-text_codfw |
[production] |
14:55 |
<klausman@deploy1002> |
helmfile [staging] DONE helmfile.d/services/api-gateway: apply |
[production] |
14:55 |
<klausman@deploy1002> |
helmfile [staging] START helmfile.d/services/api-gateway: apply |
[production] |
14:54 |
<klausman@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/api-gateway: apply |
[production] |
14:54 |
<klausman@deploy1002> |
helmfile [eqiad] START helmfile.d/services/api-gateway: apply |
[production] |
14:50 |
<klausman@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/api-gateway: apply |
[production] |
14:50 |
<klausman@deploy1002> |
helmfile [codfw] START helmfile.d/services/api-gateway: apply |
[production] |
14:44 |
<bking@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:43 |
<bking@deploy1002> |
helmfile [eqiad] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:41 |
<klausman@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
14:27 |
<bking@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:27 |
<bking@deploy1002> |
helmfile [codfw] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:25 |
<bking@deploy1002> |
helmfile [codfw] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:25 |
<bking@deploy1002> |
helmfile [staging] DONE helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:25 |
<bking@deploy1002> |
helmfile [staging] START helmfile.d/services/rdf-streaming-updater: apply |
[production] |
14:14 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:924941|NewImpact: Cache empty user impact on account creation (T337320)]], [[gerrit:924940|Personalized praise: Fix first-ever notifications (T322452)]] (duration: 07m 26s) |
[production] |
14:08 |
<urbanecm@deploy1002> |
urbanecm: Backport for [[gerrit:924941|NewImpact: Cache empty user impact on account creation (T337320)]], [[gerrit:924940|Personalized praise: Fix first-ever notifications (T322452)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet |
[production] |
14:07 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:924941|NewImpact: Cache empty user impact on account creation (T337320)]], [[gerrit:924940|Personalized praise: Fix first-ever notifications (T322452)]] |
[production] |
14:02 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:924939|Personalized praise: Fix first-ever notifications (T322452)]], [[gerrit:924576|DeleteAction: Replace remaining OOUI fields (T337809)]] (duration: 11m 11s) |
[production] |
14:02 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on P{lvs1019*,lvs2009*} and A:lvs (T329049) |
[production] |
13:58 |
<otto@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
13:57 |
<otto@deploy1002> |
helmfile [codfw] START helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
13:56 |
<jayme@deploy1002> |
helmfile [eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
13:56 |
<jayme@deploy1002> |
helmfile [eqiad] START helmfile.d/admin 'apply'. |
[production] |