351-400 of 10000 results (55ms)
2019-08-30 §
12:12 <marostegui> Start replication s2 on labsdb1009 and labsdb1010 [production]
11:57 <marostegui> Start replication s2 on labsdb1011 [production]
11:48 <marostegui> Start s2 replication on labsdb1012 [production]
11:33 <jynus> switching db1125:s2 (eqiad sanitarium) to replicate from codfw T231638 [production]
11:31 <marostegui> Temporary stop s2 replication on labsdb1009-labsdb1012 [production]
10:23 <jynus> reseting db1074 from iLo [production]
10:10 <jynus@deploy1001> Synchronized wmf-config/db-eqiad.php: Mirror dbctl depool of db1074 (duration: 00m 55s) [production]
09:57 <jynus@cumin1001> dbctl commit (dc=all): 'Depool db1074 after crash', diff saved to https://phabricator.wikimedia.org/P9013 and previous config saved to /var/cache/conftool/dbconfig/20190830-095747-jynus.json [production]
09:24 <ema> cp1075: depool ats-be due to low but constant 504 rate after 8.0.5-1wm4 upgrade [production]
09:20 <ema@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp1075.eqiad.wmnet,service=ats-be [production]
09:13 <ema> cp1075: upgrade ATS to 8.0.5-1wm4 [production]
08:50 <ema> repool ats-be on cp1075 and verify if T231504 is fixed [production]
08:49 <ema@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp1075.eqiad.wmnet,service=ats-be [production]
08:03 <marostegui@cumin1001> dbctl commit (dc=all): 'Fully repool db1076 after upgrade', diff saved to https://phabricator.wikimedia.org/P9011 and previous config saved to /var/cache/conftool/dbconfig/20190830-080334-marostegui.json [production]
07:42 <marostegui> Upgrade db2055 db2071 db2072 db2092 [production]
07:10 <marostegui@cumin1001> dbctl commit (dc=all): 'More traffic to db1076 after upgrade', diff saved to https://phabricator.wikimedia.org/P9010 and previous config saved to /var/cache/conftool/dbconfig/20190830-071043-marostegui.json [production]
06:39 <marostegui@cumin1001> dbctl commit (dc=all): 'More traffic to db1076 after upgrade', diff saved to https://phabricator.wikimedia.org/P9009 and previous config saved to /var/cache/conftool/dbconfig/20190830-063949-marostegui.json [production]
06:25 <marostegui@cumin1001> dbctl commit (dc=all): 'More traffic to db1076 after upgrade', diff saved to https://phabricator.wikimedia.org/P9008 and previous config saved to /var/cache/conftool/dbconfig/20190830-062517-marostegui.json [production]
06:15 <marostegui@cumin1001> dbctl commit (dc=all): 'Slowly repool db1076 after upgrade', diff saved to https://phabricator.wikimedia.org/P9007 and previous config saved to /var/cache/conftool/dbconfig/20190830-061546-marostegui.json [production]
06:07 <marostegui> Upgrade db1076 [production]
06:07 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1076 for upgrade - T230785', diff saved to https://phabricator.wikimedia.org/P9006 and previous config saved to /var/cache/conftool/dbconfig/20190830-060702-marostegui.json [production]
05:25 <marostegui> Stop MySQL on db2060 - T231625 [production]
05:23 <marostegui> Remove db2060 from tendril and zarcillo - T231625 [production]
05:15 <marostegui@deploy1001> Synchronized wmf-config/db-codfw.php: Remove db2060 from config T231625 (duration: 00m 53s) [production]
05:14 <marostegui@deploy1001> Synchronized wmf-config/db-eqiad.php: Remove db2060 from config T231625 (duration: 00m 53s) [production]
05:10 <marostegui> Restart wikibugs [production]
2019-08-29 §
23:23 <ejegg> updated payments-wiki from 1d5d7503b0 to 51d9ed79b6 [production]
23:15 <krinkle@deploy1001> Synchronized wmf-config/CommonSettings.php: 4cdfebe (duration: 00m 54s) [production]
21:36 <ejegg> re-enabled fundraising python jobs [production]
20:18 <ejegg> updated fundraising python tools from c0f4e7a379 to b42bda6bf3 [production]
20:14 <foks> removing two files for legal compliance [production]
20:14 <ejegg> disabled fundraising python jobs [production]
19:56 <ebernhardson> cloudelastic-chi run frwiki_content/_forcemerge?only_expunge_deletes=true to try and fix 5gb segments with 96% deleted documents [production]
18:59 <ebernhardson> restart elasticsearch on cloudelastic1003 (T231517) [production]
18:50 <ebernhardson> restart elasticsearch on cloudelastic1002 (T231517) [production]
18:41 <ebernhardson> set index.merge.scheduler.max_thread_count to null to accept default values on cloudelastic-chi (T231517) [production]
18:36 <krinkle@deploy1001> Synchronized php-1.34.0-wmf.20/extensions/AbuseFilter/includes/AbuseFilterVariableHolder.php: T231542 f37f0bd50cf (duration: 00m 53s) [production]
18:33 <krinkle@deploy1001> Synchronized php-1.34.0-wmf.20/extensions/CentralAuth/modules/ext.centralauth.ForeignApi.js: e7cd3cd313a4642 (duration: 00m 55s) [production]
18:23 <ebernhardson> restart elasticsearch on cloudelastic1001 (T231517) [production]
18:22 <urbanecm@deploy1001> Synchronized wmf-config/CommonSettings.php: SWAT: Fix "Assign all rights assigned to suppress group to oversight group" (T230601) (duration: 00m 54s) [production]
18:07 <ebernhardson> increase index.refresh_interval to 5m for all indices on cloudelastic-chi [production]
17:22 <crusnov@cumin1001> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) [production]
17:19 <crusnov@cumin1001> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) [production]
17:15 <dcausse> restarted elasticsearch on cloudelastic1004 (T231517) [production]
17:10 <crusnov@cumin1001> START - Cookbook sre.ganeti.makevm [production]
17:09 <crusnov@cumin1001> START - Cookbook sre.ganeti.makevm [production]
17:09 <crusnov@cumin1001> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) [production]
16:59 <crusnov@cumin1001> START - Cookbook sre.ganeti.makevm [production]
16:49 <crusnov@cumin1001> START - Cookbook sre.ganeti.makevm [production]
16:49 <crusnov@cumin1001> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) [production]