2020-06-02
ยง
|
16:55 |
<cstone> |
SmashPig revision changed from 44690f761c to b9de3c7aac |
[production] |
15:57 |
<ejegg> |
updated payments-wiki from d11efeb1cf to 1942a537ef |
[production] |
15:50 |
<cdanis> |
thumbor1003 and thumbor1004 blipped, no obvious explanation, logs gathered at P11365 P11366 P11367 |
[production] |
15:49 |
<XioNoX> |
push frack fw rules - T254260 |
[production] |
15:48 |
<mutante> |
contint1001 - rm -rf /mnt/docker (T224591) |
[production] |
15:45 |
<mutante> |
contint1001 - restarting docker afer changed data-root path (T224591) |
[production] |
15:37 |
<cdanis@cumin1001> |
conftool action : set/pooled=no; selector: name=wtp1032.* |
[production] |
15:35 |
<cdanis> |
power cycling wtp1032 which is bootlooping? https://phabricator.wikimedia.org/P11364 |
[production] |
15:31 |
<hnowlan@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'production' . |
[production] |
15:24 |
<rzl@cumin1001> |
conftool action : set/pooled=yes; selector: name=thumbor100[34].* |
[production] |
15:23 |
<XioNoX> |
repool codfw - T254216 |
[production] |
15:19 |
<XioNoX> |
rollback ospf changes - T254216 |
[production] |
15:09 |
<hnowlan@deploy1001> |
Finished deploy [cpjobqueue/deploy@8a53ff1]: (no justification provided) (duration: 02m 33s) |
[production] |
15:07 |
<XioNoX> |
reboot cr1-codfw:fpc5 - T254216 |
[production] |
15:06 |
<hnowlan@deploy1001> |
Started deploy [cpjobqueue/deploy@8a53ff1]: (no justification provided) |
[production] |
15:05 |
<hnowlan> |
shifting all high traffic cpjobqueue rules to k8s |
[production] |
14:57 |
<hnowlan@deploy1001> |
helmfile [CODFW] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'production' . |
[production] |
14:56 |
<XioNoX> |
depref ulsfo-codfw link - T254216 |
[production] |
14:51 |
<hnowlan@deploy1001> |
helmfile [EQIAD] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'production' . |
[production] |
14:50 |
<jynus@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
14:49 |
<jynus@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:49 |
<XioNoX> |
prefer eqsin-ulsfo tunnel - T254216 |
[production] |
14:47 |
<cdanis@cumin1001> |
conftool action : set/pooled=no; selector: name=thumbor100[34].* |
[production] |
14:38 |
<hnowlan@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . |
[production] |
14:31 |
<XioNoX> |
depool codfw - T254216 |
[production] |
14:09 |
<hnowlan@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . |
[production] |
13:45 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
13:42 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
13:42 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
13:38 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
13:37 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
13:28 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
13:18 |
<dzahn@cumin1001> |
END (ERROR) - Cookbook sre.hosts.decommission (exit_code=97) |
[production] |
13:18 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
13:18 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) |
[production] |
13:18 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
13:05 |
<cdanis@deploy1001> |
Synchronized php-1.35.0-wmf.31/includes/specials/pagers/ContribsPager.php: revert contribs limit to 5000 T234450 (duration: 00m 57s) |
[production] |
13:04 |
<cdanis@deploy1001> |
Synchronized php-1.35.0-wmf.32/includes/specials/pagers/ContribsPager.php: revert contribs limit to 5000 T234450 (duration: 00m 57s) |
[production] |
13:03 |
<cdanis@deploy1001> |
Synchronized php-1.35.0-wmf.34/includes/specials/pagers/ContribsPager.php: revert contribs limit to 5000 T234450 (duration: 00m 58s) |
[production] |
12:59 |
<marostegui@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
12:59 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
12:56 |
<cdanis@deploy1001> |
Synchronized wmf-config/PoolCounterSettings.php: 5debc3223 limit per-user Special:Contributions concurrency to 2 T234450 (duration: 00m 58s) |
[production] |
12:50 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'Pool db2140 into s4 T252985', diff saved to https://phabricator.wikimedia.org/P11363 and previous config saved to /var/cache/conftool/dbconfig/20200602-125012-kormat.json |
[production] |
12:39 |
<hnowlan@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . |
[production] |
12:31 |
<dzahn@cumin1001> |
conftool action : set/pooled=inactive; selector: name=mw217[3-9].codfw.wmnet |
[production] |
12:30 |
<kormat@cumin1001> |
dbctl commit (dc=all): 'Repool db2110, copy to db2140 complete T252985', diff saved to https://phabricator.wikimedia.org/P11362 and previous config saved to /var/cache/conftool/dbconfig/20200602-123020-kormat.json |
[production] |
12:28 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw217[3-9].codfw.wmnet |
[production] |
11:10 |
<kart_> |
Finished EU Mid-day SWAT. |
[production] |
11:08 |
<mutante> |
contint1001 - common issue after reinstalls again - a2dismod mpm_event ; systemctl restart apache2 ; puppet agent -tv ( T196968) https://gerrit.wikimedia.org/r/c/operations/puppet/+/451206 |
[production] |
11:07 |
<kartik@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit|601174|Create URL campaign for African languages for COVID-19 translation project (T253305)]] (duration: 01m 00s) |
[production] |