2023-07-17
§
|
09:42 |
<btullis@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host kafka-stretch1001.eqiad.wmnet |
[production] |
09:42 |
<btullis@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host kafka-stretch1001.eqiad.wmnet |
[production] |
09:39 |
<mvernon@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host ms-be1047.eqiad.wmnet |
[production] |
09:38 |
<mvernon@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be1046.eqiad.wmnet |
[production] |
09:38 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ms-be2047.codfw.wmnet |
[production] |
09:35 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2046.codfw.wmnet |
[production] |
09:30 |
<mvernon@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host ms-be1046.eqiad.wmnet |
[production] |
09:29 |
<mvernon@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be1045.eqiad.wmnet |
[production] |
09:27 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ms-be2046.codfw.wmnet |
[production] |
09:26 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2045.codfw.wmnet |
[production] |
09:22 |
<mvernon@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host ms-be1045.eqiad.wmnet |
[production] |
09:19 |
<mvernon@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be1044.eqiad.wmnet |
[production] |
09:18 |
<elukey@deploy1002> |
helmfile [ml-serve-codfw] DONE helmfile.d/admin 'sync'. |
[production] |
09:18 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ms-be2045.codfw.wmnet |
[production] |
09:18 |
<elukey@deploy1002> |
helmfile [ml-serve-codfw] START helmfile.d/admin 'sync'. |
[production] |
09:17 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
09:17 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |
09:17 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ms-be2044.codfw.wmnet |
[production] |
09:02 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ms-be2044.codfw.wmnet |
[production] |
09:01 |
<mvernon@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host ms-be1044.eqiad.wmnet |
[production] |
08:51 |
<fabfur> |
enable puppet on A:cp-eqsin to apply https://gerrit.wikimedia.org/r/c/operations/puppet/+/938002 (T340983) |
[production] |
08:37 |
<fabfur> |
enable puppet on cp5024 and cp5032 to deploy 938002 |
[production] |
08:30 |
<fabfur> |
disable puppet on all cp* hosts in eqsin to apply https://gerrit.wikimedia.org/r/c/operations/puppet/+/938002 (T340983) |
[production] |
04:33 |
<hashar@deploy1002> |
Finished deploy [gerrit/gerrit@1153a16]: wm-checks-api: check undefined real_author (2) - T328484 (duration: 00m 08s) |
[production] |
04:33 |
<hashar@deploy1002> |
Started deploy [gerrit/gerrit@1153a16]: wm-checks-api: check undefined real_author (2) - T328484 |
[production] |
04:08 |
<hashar@deploy1002> |
Finished deploy [gerrit/gerrit@cad3002]: wm-checks-api: check undefined real_author - T328484 (duration: 00m 08s) |
[production] |
04:08 |
<hashar@deploy1002> |
Started deploy [gerrit/gerrit@cad3002]: wm-checks-api: check undefined real_author - T328484 |
[production] |
2023-07-14
§
|
19:57 |
<jforrester@deploy1002> |
helmfile [staging] DONE helmfile.d/services/wikifunctions: apply |
[production] |
19:56 |
<jforrester@deploy1002> |
helmfile [staging] START helmfile.d/services/wikifunctions: apply |
[production] |
19:55 |
<jforrester@deploy1002> |
helmfile [staging] DONE helmfile.d/services/wikifunctions: apply |
[production] |
19:55 |
<jforrester@deploy1002> |
helmfile [staging] START helmfile.d/services/wikifunctions: apply |
[production] |
19:39 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=93) for host an-worker1153.eqiad.wmnet with OS bullseye |
[production] |
19:19 |
<xcollazo@deploy1002> |
Finished deploy [airflow-dags/analytics@37d3ad6]: Run page_content_change_to_wikitext_raw DAG serially. T335860 (duration: 00m 14s) |
[production] |
19:19 |
<xcollazo@deploy1002> |
Started deploy [airflow-dags/analytics@37d3ad6]: Run page_content_change_to_wikitext_raw DAG serially. T335860 |
[production] |
18:42 |
<jhancock@cumin2002> |
START - Cookbook sre.hosts.reimage for host an-worker1153.eqiad.wmnet with OS bullseye |
[production] |
16:05 |
<hnowlan@deploy1002> |
helmfile [staging] DONE helmfile.d/services/thumbor: apply |
[production] |
16:04 |
<hnowlan@deploy1002> |
helmfile [staging] START helmfile.d/services/thumbor: apply |
[production] |
14:25 |
<elukey@deploy1002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . |
[production] |
14:22 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . |
[production] |
10:43 |
<jelto@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/miscweb: apply |
[production] |
10:42 |
<jelto@deploy1002> |
helmfile [eqiad] START helmfile.d/services/miscweb: apply |
[production] |
10:41 |
<jelto@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/miscweb: apply |
[production] |
10:40 |
<jelto@deploy1002> |
helmfile [codfw] START helmfile.d/services/miscweb: apply |
[production] |
10:38 |
<jelto@deploy1002> |
helmfile [staging] DONE helmfile.d/services/miscweb: apply |
[production] |
10:38 |
<jelto@deploy1002> |
helmfile [staging] START helmfile.d/services/miscweb: apply |
[production] |
09:02 |
<klausman@puppetmaster1001> |
conftool action : set/pooled=inactive; selector: name=ores2003.codfw.wmnet |
[production] |
09:02 |
<klausman> |
Setting ores2003 to pooled=inactive wheile we attempt repairs/decide on decom |
[production] |
08:51 |
<oblivian@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mw-web: apply |
[production] |