2021-01-28
§
|
16:49 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host deploy1002.eqiad.wmnet |
[production] |
16:49 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'cxserver' for release 'production' . |
[production] |
16:49 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'cxserver' for release 'staging' . |
[production] |
16:49 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host deploy2002.codfw.wmnet |
[production] |
16:48 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'cxserver' for release 'staging' . |
[production] |
16:48 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'cxserver' for release 'production' . |
[production] |
16:45 |
<elukey@cumin1001> |
END (FAIL) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=99) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:44 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:44 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:44 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:41 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:41 |
<arturo> |
running homer on cr*-eqiad* again for reverting latest changes (T271476) |
[production] |
16:39 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host deploy2002.codfw.wmnet |
[production] |
16:28 |
<akosiaris@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'production' . |
[production] |
16:28 |
<akosiaris@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'staging' . |
[production] |
16:28 |
<akosiaris@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'apertium' for release 'plain' . |
[production] |
16:26 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'production' . |
[production] |
16:25 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'plain' . |
[production] |
16:25 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'apertium' for release 'staging' . |
[production] |
16:24 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:24 |
<akosiaris> |
stop scraping apertium from prometheus, it doesn't have a prometheus endpoint. |
[production] |
16:23 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'production' . |
[production] |
16:23 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'plain' . |
[production] |
16:23 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'apertium' for release 'staging' . |
[production] |
16:19 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:17 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:06 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
16:03 |
<arturo> |
running homer on cr*-eqiad* for T271476 |
[production] |
15:55 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.change-distro-from-cdh for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
15:54 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.stop-cluster (exit_code=0) for Hadoop test cluster: Stop the Hadoop cluster before maintenance. - elukey@cumin1001 |
[production] |
15:52 |
<cdanis> |
draining traffic from Zayo OGYX/120003 codfw<>eqiad in preparation for decommission 🥃 T272675 |
[production] |
15:49 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.stop-cluster for Hadoop test cluster: Stop the Hadoop cluster before maintenance. - elukey@cumin1001 |
[production] |
15:49 |
<ebernhardson@deploy1001> |
Finished deploy [wikimedia/discovery/analytics@d0a6933]: align threshold path references across days (duration: 01m 15s) |
[production] |
15:49 |
<marostegui> |
Power off clouddb1019 for memory replacement T272125 |
[production] |
15:48 |
<ebernhardson@deploy1001> |
Started deploy [wikimedia/discovery/analytics@d0a6933]: align threshold path references across days |
[production] |
15:25 |
<otto@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Migrate NavigationTiming schemas to Event Platform on all wikis - T271208 (duration: 01m 11s) |
[production] |
15:06 |
<jayme@deploy1001> |
helmfile [staging-codfw] DONE helmfile.d/admin 'sync'. |
[production] |
15:05 |
<jayme@deploy1001> |
helmfile [staging-codfw] START helmfile.d/admin 'sync'. |
[production] |
14:26 |
<jayme@deploy1001> |
helmfile [staging-codfw] DONE helmfile.d/admin 'sync'. |
[production] |
14:14 |
<jayme@deploy1001> |
helmfile [staging-codfw] START helmfile.d/admin 'sync'. |
[production] |
14:14 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db1148 after kernel upgrade and enablement of report_host', diff saved to https://phabricator.wikimedia.org/P14039 and previous config saved to /var/cache/conftool/dbconfig/20210128-141425-marostegui.json |
[production] |
13:57 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1148 for kernel upgrade and enablement of report_host', diff saved to https://phabricator.wikimedia.org/P14038 and previous config saved to /var/cache/conftool/dbconfig/20210128-135730-marostegui.json |
[production] |
13:56 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1146:3314 (re)pooling @ 100%: After upgrading the kernel', diff saved to https://phabricator.wikimedia.org/P14037 and previous config saved to /var/cache/conftool/dbconfig/20210128-135612-root.json |
[production] |
13:56 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1146:3312 (re)pooling @ 100%: After upgrading the kernel', diff saved to https://phabricator.wikimedia.org/P14036 and previous config saved to /var/cache/conftool/dbconfig/20210128-135602-root.json |
[production] |
13:41 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1146:3314 (re)pooling @ 75%: After upgrading the kernel', diff saved to https://phabricator.wikimedia.org/P14035 and previous config saved to /var/cache/conftool/dbconfig/20210128-134109-root.json |
[production] |
13:40 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1146:3312 (re)pooling @ 75%: After upgrading the kernel', diff saved to https://phabricator.wikimedia.org/P14034 and previous config saved to /var/cache/conftool/dbconfig/20210128-134057-root.json |
[production] |
13:26 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1146:3314 (re)pooling @ 50%: After upgrading the kernel', diff saved to https://phabricator.wikimedia.org/P14033 and previous config saved to /var/cache/conftool/dbconfig/20210128-132605-root.json |
[production] |
13:25 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1146:3312 (re)pooling @ 50%: After upgrading the kernel', diff saved to https://phabricator.wikimedia.org/P14032 and previous config saved to /var/cache/conftool/dbconfig/20210128-132553-root.json |
[production] |
13:17 |
<godog> |
swift codfw-prod decrease HDD weight for ms-be20[16-27] - T272837 |
[production] |
13:11 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1146:3314 (re)pooling @ 25%: After upgrading the kernel', diff saved to https://phabricator.wikimedia.org/P14031 and previous config saved to /var/cache/conftool/dbconfig/20210128-131101-root.json |
[production] |