1951-2000 of 10000 results (30ms)
2020-06-02 ยง
15:50 <cdanis> thumbor1003 and thumbor1004 blipped, no obvious explanation, logs gathered at P11365 P11366 P11367 [production]
15:49 <XioNoX> push frack fw rules - T254260 [production]
15:48 <mutante> contint1001 - rm -rf /mnt/docker (T224591) [production]
15:45 <mutante> contint1001 - restarting docker afer changed data-root path (T224591) [production]
15:37 <cdanis@cumin1001> conftool action : set/pooled=no; selector: name=wtp1032.* [production]
15:35 <cdanis> power cycling wtp1032 which is bootlooping? https://phabricator.wikimedia.org/P11364 [production]
15:31 <hnowlan@deploy1001> helmfile [EQIAD] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'production' . [production]
15:24 <rzl@cumin1001> conftool action : set/pooled=yes; selector: name=thumbor100[34].* [production]
15:23 <XioNoX> repool codfw - T254216 [production]
15:19 <XioNoX> rollback ospf changes - T254216 [production]
15:09 <hnowlan@deploy1001> Finished deploy [cpjobqueue/deploy@8a53ff1]: (no justification provided) (duration: 02m 33s) [production]
15:07 <XioNoX> reboot cr1-codfw:fpc5 - T254216 [production]
15:06 <hnowlan@deploy1001> Started deploy [cpjobqueue/deploy@8a53ff1]: (no justification provided) [production]
15:05 <hnowlan> shifting all high traffic cpjobqueue rules to k8s [production]
14:57 <hnowlan@deploy1001> helmfile [CODFW] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'production' . [production]
14:56 <XioNoX> depref ulsfo-codfw link - T254216 [production]
14:51 <hnowlan@deploy1001> helmfile [EQIAD] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'production' . [production]
14:50 <jynus@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
14:49 <jynus@cumin1001> START - Cookbook sre.hosts.downtime [production]
14:49 <XioNoX> prefer eqsin-ulsfo tunnel - T254216 [production]
14:47 <cdanis@cumin1001> conftool action : set/pooled=no; selector: name=thumbor100[34].* [production]
14:38 <hnowlan@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . [production]
14:31 <XioNoX> depool codfw - T254216 [production]
14:09 <hnowlan@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . [production]
13:45 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) [production]
13:42 <dzahn@cumin1001> START - Cookbook sre.hosts.decommission [production]
13:42 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) [production]
13:38 <dzahn@cumin1001> START - Cookbook sre.hosts.decommission [production]
13:37 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) [production]
13:28 <dzahn@cumin1001> START - Cookbook sre.hosts.decommission [production]
13:18 <dzahn@cumin1001> END (ERROR) - Cookbook sre.hosts.decommission (exit_code=97) [production]
13:18 <dzahn@cumin1001> START - Cookbook sre.hosts.decommission [production]
13:18 <dzahn@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) [production]
13:18 <dzahn@cumin1001> START - Cookbook sre.hosts.decommission [production]
13:05 <cdanis@deploy1001> Synchronized php-1.35.0-wmf.31/includes/specials/pagers/ContribsPager.php: revert contribs limit to 5000 T234450 (duration: 00m 57s) [production]
13:04 <cdanis@deploy1001> Synchronized php-1.35.0-wmf.32/includes/specials/pagers/ContribsPager.php: revert contribs limit to 5000 T234450 (duration: 00m 57s) [production]
13:03 <cdanis@deploy1001> Synchronized php-1.35.0-wmf.34/includes/specials/pagers/ContribsPager.php: revert contribs limit to 5000 T234450 (duration: 00m 58s) [production]
12:59 <marostegui@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
12:59 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime [production]
12:56 <cdanis@deploy1001> Synchronized wmf-config/PoolCounterSettings.php: 5debc3223 limit per-user Special:Contributions concurrency to 2 T234450 (duration: 00m 58s) [production]
12:50 <kormat@cumin1001> dbctl commit (dc=all): 'Pool db2140 into s4 T252985', diff saved to https://phabricator.wikimedia.org/P11363 and previous config saved to /var/cache/conftool/dbconfig/20200602-125012-kormat.json [production]
12:39 <hnowlan@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . [production]
12:31 <dzahn@cumin1001> conftool action : set/pooled=inactive; selector: name=mw217[3-9].codfw.wmnet [production]
12:30 <kormat@cumin1001> dbctl commit (dc=all): 'Repool db2110, copy to db2140 complete T252985', diff saved to https://phabricator.wikimedia.org/P11362 and previous config saved to /var/cache/conftool/dbconfig/20200602-123020-kormat.json [production]
12:28 <dzahn@cumin1001> conftool action : set/pooled=no; selector: name=mw217[3-9].codfw.wmnet [production]
11:10 <kart_> Finished EU Mid-day SWAT. [production]
11:08 <mutante> contint1001 - common issue after reinstalls again - a2dismod mpm_event ; systemctl restart apache2 ; puppet agent -tv ( T196968) https://gerrit.wikimedia.org/r/c/operations/puppet/+/451206 [production]
11:07 <kartik@deploy1001> Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit|601174|Create URL campaign for African languages for COVID-19 translation project (T253305)]] (duration: 01m 00s) [production]
11:01 <hnowlan@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'changeprop-jobqueue' for release 'staging' . [production]
10:48 <mutante> LDAP - added uid=lulu to group nda (T254121) [production]