2023-12-20
§
|
13:35 |
<aqu@deploy2002> |
Finished deploy [airflow-dags/analytics_product@d5ac513]: Make sure airflow-dags is up-to-date before activating metrics [airflow-dags@d5ac5131] (duration: 00m 09s) |
[production] |
13:35 |
<aqu@deploy2002> |
Started deploy [airflow-dags/analytics_product@d5ac513]: Make sure airflow-dags is up-to-date before activating metrics [airflow-dags@d5ac5131] |
[production] |
13:34 |
<aqu@deploy2002> |
Finished deploy [airflow-dags/analytics@d5ac513]: Make sure airflow-dags is up-to-date before activating metrics [airflow-dags@d5ac5131] (duration: 00m 05s) |
[production] |
13:34 |
<aqu@deploy2002> |
Started deploy [airflow-dags/analytics@d5ac513]: Make sure airflow-dags is up-to-date before activating metrics [airflow-dags@d5ac5131] |
[production] |
13:34 |
<aqu@deploy2002> |
Finished deploy [airflow-dags/analytics_test@d5ac513]: Make sure airflow-dags is up-to-date before activating metrics [airflow-dags@d5ac5131] (duration: 00m 11s) |
[production] |
13:34 |
<aqu@deploy2002> |
Started deploy [airflow-dags/analytics_test@d5ac513]: Make sure airflow-dags is up-to-date before activating metrics [airflow-dags@d5ac5131] |
[production] |
13:32 |
<aqu@deploy2002> |
Finished deploy [airflow-dags/analytics_test@d5ac513]: Make sure airflow-dags is up-to-date before activating metrics [airflow-dags@d5ac5131] (duration: 00m 01s) |
[production] |
13:32 |
<aqu@deploy2002> |
Started deploy [airflow-dags/analytics_test@d5ac513]: Make sure airflow-dags is up-to-date before activating metrics [airflow-dags@d5ac5131] |
[production] |
13:31 |
<aqu@deploy2002> |
Finished deploy [airflow-dags/analytics@d5ac513]: Make sure airflow-dags is up-to-date before activating metrics [airflow-dags@d5ac5131] (duration: 00m 01s) |
[production] |
13:31 |
<aqu@deploy2002> |
Started deploy [airflow-dags/analytics@d5ac513]: Make sure airflow-dags is up-to-date before activating metrics [airflow-dags@d5ac5131] |
[production] |
12:12 |
<aikochou@deploy2002> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
11:30 |
<kostajh> |
T353703 Manual run: /usr/local/bin/foreachwikiindblist /srv/mediawiki/dblists/mediamoderation.dblist extensions/MediaModeration/maintenance/updateMetrics.php --verbose |
[production] |
11:22 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_grid_node for tools-sgeexec-10-14, tools-sgeexec-10-15, tools-sgeweblight-10-18, tools-sgeweblight-10-24 |
[tools] |
10:22 |
<cgoubert@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on mw2448.codfw.wmnet with reason: hw failure |
[production] |
10:22 |
<cgoubert@cumin2002> |
START - Cookbook sre.hosts.downtime for 14 days, 0:00:00 on mw2448.codfw.wmnet with reason: hw failure |
[production] |
10:01 |
<taavi> |
rebooting tools-sgeweblight-10-18, -24, -25, to get rid of a large number of jobs in deleting status |
[tools] |
09:43 |
<fabfur@cumin1001> |
END (PASS) - Cookbook sre.dns.roll-restart-reboot-wikimedia-dns (exit_code=0) rolling restart_daemons on A:wikidough and A:wikidough |
[production] |
09:39 |
<fabfur@cumin1001> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for doh5002.wikimedia.org |
[production] |
09:39 |
<fabfur@cumin1001> |
START - Cookbook sre.hosts.remove-downtime for doh5002.wikimedia.org |
[production] |
09:10 |
<fabfur@cumin1001> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for doh2001.wikimedia.org |
[production] |
09:10 |
<fabfur@cumin1001> |
START - Cookbook sre.hosts.remove-downtime for doh2001.wikimedia.org |
[production] |
08:47 |
<fabfur@cumin1001> |
START - Cookbook sre.dns.roll-restart-reboot-wikimedia-dns rolling restart_daemons on A:wikidough and A:wikidough |
[production] |
06:31 |
<ryankemper> |
T351671 Pooled `wdqs10[17-21]*`; data xfers completed and test queries are passing on `wdqs1018`. Will decom related hosts tomorrow (2023-12-20) |
[production] |
02:47 |
<rzl@deploy2002> |
helmfile [eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
02:45 |
<rzl@deploy2002> |
helmfile [eqiad] START helmfile.d/admin 'apply'. |
[production] |
02:44 |
<rzl@deploy2002> |
helmfile [codfw] DONE helmfile.d/admin 'apply'. |
[production] |
02:43 |
<rzl@deploy2002> |
helmfile [codfw] START helmfile.d/admin 'apply'. |
[production] |
02:43 |
<rzl@deploy2002> |
helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
02:41 |
<rzl@deploy2002> |
helmfile [staging-eqiad] START helmfile.d/admin 'apply'. |
[production] |
02:39 |
<rzl@deploy2002> |
helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
02:37 |
<rzl@deploy2002> |
helmfile [staging-codfw] START helmfile.d/admin 'apply'. |
[production] |
02:08 |
<ryankemper@cumin1001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
02:08 |
<ryankemper@cumin1001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
00:34 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
00:34 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
00:27 |
<ryankemper@cumin1001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
00:27 |
<ryankemper@cumin1001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
00:25 |
<ryankemper@cumin1001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
00:23 |
<thcipriani> |
updating docker-pkg files on contint primary for https://gerrit.wikimedia.org/r/983892 |
[releng] |
00:03 |
<ryankemper@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 22:00:00 on wdqs[1017-1021].eqiad.wmnet with reason: bringing new wdqs hosts online T351671 |
[production] |
00:02 |
<ryankemper@cumin1001> |
START - Cookbook sre.hosts.downtime for 22:00:00 on wdqs[1017-1021].eqiad.wmnet with reason: bringing new wdqs hosts online T351671 |
[production] |