2020-05-07
§
|
07:44 |
<godog> |
further decrease weight for ms-be10[678] - T252008 |
[production] |
07:43 |
<elukey> |
kill application_1583418280867_333560 after a chat with David, the job is consuming ~2TB of RAM |
[analytics] |
07:32 |
<elukey> |
re-run mediawiki history load |
[analytics] |
07:18 |
<elukey> |
execute yarn application -movetoqueue application_1583418280867_332862 -queue root.nice |
[analytics] |
07:06 |
<elukey> |
restart mediawiki-history-load via hue |
[analytics] |
06:41 |
<elukey> |
restart oozie on an-coord1001 |
[analytics] |
05:49 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
05:46 |
<elukey> |
re-run mediarequest-hourly-wf-2020-5-6-19 |
[analytics] |
05:45 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
05:35 |
<elukey> |
re-run two failed hours for webrequest load text (07/05T05) and upload (06/05T23) |
[analytics] |
05:33 |
<elukey> |
restart hadoop yarn nodemanager on analytics1071 |
[analytics] |
05:33 |
<elukey> |
restart hadoop yarn nodemanager on analytics1071 |
[production] |
05:22 |
<marostegui> |
Reimage db2078 |
[production] |
05:04 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Set s3 and s7 as read-only=off for maintenance T251158', diff saved to https://phabricator.wikimedia.org/P11167 and previous config saved to /var/cache/conftool/dbconfig/20200507-050419-marostegui.json |
[production] |
05:00 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Set s3 and s7 as read-only for maintenance T251158', diff saved to https://phabricator.wikimedia.org/P11166 and previous config saved to /var/cache/conftool/dbconfig/20200507-050046-marostegui.json |
[production] |
02:56 |
<brennen@deploy1001> |
rebuilt and synchronized wikiversions files: Revert group1 wikis to 1.35.0-wmf.30 for T252079 |
[production] |
02:55 |
<brennen> |
reverting group1 to 1.35.0-wmf.30 for T252079 |
[production] |
00:12 |
<ryankemper@cumin1001> |
END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) |
[production] |
2020-05-06
§
|
23:59 |
<catrope@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Disable GrowthExperiments guidance on testwiki (duration: 01m 07s) |
[production] |
23:18 |
<catrope@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Enable password-reset-update on Wikipedias (T245791) (duration: 01m 07s) |
[production] |
22:22 |
<brennen@deploy1001> |
Synchronized php-1.35.0-wmf.31/includes/revisionlist/RevisionItem.php: [[gerrit:594803|RevisionItem: Fix providing timestamp in getRevisionLink ]] (duration: 01m 09s) |
[production] |
21:45 |
<andrewbogott> |
updating puppet compiler facts |
[production] |
21:20 |
<bd808> |
Kubectl delete node tools-k8s-worker-[16-20] (T248702) |
[tools] |
21:07 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
21:05 |
<ryankemper@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) |
[production] |
21:04 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
20:53 |
<James_F> |
Created integration-cumin-01 instance in WMCS based on stretch for final part of T236576 |
[releng] |
20:35 |
<ejegg> |
updated Fundraising CiviCRM from b15b2cfbb5 to cfb6101e39 |
[production] |
20:20 |
<hashar> |
Running jjb for all Jenkins jobs to drop ansicolor definition (now globally enabled) https://gerrit.wikimedia.org/r/#/c/integration/config/+/594716/ # T233688 |
[releng] |
19:11 |
<bstorm_> |
updated toollabs-webservice to 0.69 for toolsbeta |
[toolsbeta] |
19:08 |
<brennen@deploy1001> |
Synchronized php: group1 wikis to 1.35.0-wmf.31 (duration: 01m 08s) |
[production] |
19:07 |
<brennen@deploy1001> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.35.0-wmf.31 |
[production] |
19:03 |
<brennen> |
CORRECTION: 1.35.0-wmf.31 train unblocked (T249963), rolling forward to group1 |
[production] |
19:03 |
<brennen> |
1.35.0-wmf.31 train unblocked (T249963), rolling forward to group0 |
[production] |
18:58 |
<twentyafterfour@deploy1001> |
Synchronized php-1.35.0-wmf.31/includes/specials/pagers/DeletedContribsPager.php: deploy https://gerrit.wikimedia.org/r/#/c/mediawiki/core/+/594778/ fixes UBN T252052 (duration: 01m 09s) |
[production] |
18:54 |
<volans> |
upgraded spicerack to spicerack_0.0.34-1_amd64.deb on cumin[12]001 |
[production] |
18:45 |
<volans> |
uploaded spicerack_0.0.34-1_amd64.deb to apt.wikimedia.org stretch-wikimedia |
[production] |
18:44 |
<volans@deploy1001> |
Finished deploy [homer/deploy@8224f0a]: Release v0.2.2 (duration: 00m 18s) |
[production] |
18:43 |
<volans@deploy1001> |
Started deploy [homer/deploy@8224f0a]: Release v0.2.2 |
[production] |
18:28 |
<twentyafterfour@deploy1001> |
Synchronized php-1.35.0-wmf.31/includes/specials/pagers/DeletedContribsPager.php: sync https://gerrit.wikimedia.org/r/#/c/mediawiki/core/+/594768/ fixes T252043 (duration: 01m 08s) |
[production] |
18:24 |
<bd808> |
Updated "profile::toolforge::k8s::worker_nodes" list in "tools-k8s-haproxy" prefix puppet (T248702) |
[tools] |
18:14 |
<bd808> |
Shutdown tools-k8s-worker-[16-20] instances (T248702) |
[tools] |
18:04 |
<bd808> |
Draining tools-k8s-worker-[16-20] in preparation for decomm (T248702) |
[tools] |
17:56 |
<bd808> |
Cordoned tools-k8s-worker-[16-20] in preparation for decomm (T248702) |
[tools] |
17:34 |
<hnowlan@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . |
[production] |
17:31 |
<hnowlan@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . |
[production] |
17:12 |
<hnowlan@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . |
[production] |
17:06 |
<hnowlan@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . |
[production] |
17:05 |
<hnowlan@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . |
[production] |
17:01 |
<James_F> |
Deleting integration-agent-puppet-docker-1001 from WMC for final stage of T250502. |
[releng] |