2019-08-30
§
|
12:12 |
<marostegui> |
Start replication s2 on labsdb1009 and labsdb1010 |
[production] |
11:57 |
<marostegui> |
Start replication s2 on labsdb1011 |
[production] |
11:48 |
<marostegui> |
Start s2 replication on labsdb1012 |
[production] |
11:33 |
<jynus> |
switching db1125:s2 (eqiad sanitarium) to replicate from codfw T231638 |
[production] |
11:31 |
<marostegui> |
Temporary stop s2 replication on labsdb1009-labsdb1012 |
[production] |
10:23 |
<jynus> |
reseting db1074 from iLo |
[production] |
10:10 |
<jynus@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Mirror dbctl depool of db1074 (duration: 00m 55s) |
[production] |
09:57 |
<jynus@cumin1001> |
dbctl commit (dc=all): 'Depool db1074 after crash', diff saved to https://phabricator.wikimedia.org/P9013 and previous config saved to /var/cache/conftool/dbconfig/20190830-095747-jynus.json |
[production] |
09:24 |
<ema> |
cp1075: depool ats-be due to low but constant 504 rate after 8.0.5-1wm4 upgrade |
[production] |
09:20 |
<ema@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp1075.eqiad.wmnet,service=ats-be |
[production] |
09:13 |
<ema> |
cp1075: upgrade ATS to 8.0.5-1wm4 |
[production] |
08:50 |
<ema> |
repool ats-be on cp1075 and verify if T231504 is fixed |
[production] |
08:49 |
<ema@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=cp1075.eqiad.wmnet,service=ats-be |
[production] |
08:03 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Fully repool db1076 after upgrade', diff saved to https://phabricator.wikimedia.org/P9011 and previous config saved to /var/cache/conftool/dbconfig/20190830-080334-marostegui.json |
[production] |
07:42 |
<marostegui> |
Upgrade db2055 db2071 db2072 db2092 |
[production] |
07:10 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'More traffic to db1076 after upgrade', diff saved to https://phabricator.wikimedia.org/P9010 and previous config saved to /var/cache/conftool/dbconfig/20190830-071043-marostegui.json |
[production] |
06:39 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'More traffic to db1076 after upgrade', diff saved to https://phabricator.wikimedia.org/P9009 and previous config saved to /var/cache/conftool/dbconfig/20190830-063949-marostegui.json |
[production] |
06:25 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'More traffic to db1076 after upgrade', diff saved to https://phabricator.wikimedia.org/P9008 and previous config saved to /var/cache/conftool/dbconfig/20190830-062517-marostegui.json |
[production] |
06:15 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1076 after upgrade', diff saved to https://phabricator.wikimedia.org/P9007 and previous config saved to /var/cache/conftool/dbconfig/20190830-061546-marostegui.json |
[production] |
06:07 |
<marostegui> |
Upgrade db1076 |
[production] |
06:07 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1076 for upgrade - T230785', diff saved to https://phabricator.wikimedia.org/P9006 and previous config saved to /var/cache/conftool/dbconfig/20190830-060702-marostegui.json |
[production] |
05:25 |
<marostegui> |
Stop MySQL on db2060 - T231625 |
[production] |
05:23 |
<marostegui> |
Remove db2060 from tendril and zarcillo - T231625 |
[production] |
05:15 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-codfw.php: Remove db2060 from config T231625 (duration: 00m 53s) |
[production] |
05:14 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Remove db2060 from config T231625 (duration: 00m 53s) |
[production] |
05:10 |
<marostegui> |
Restart wikibugs |
[production] |
2019-08-29
§
|
23:23 |
<ejegg> |
updated payments-wiki from 1d5d7503b0 to 51d9ed79b6 |
[production] |
23:15 |
<krinkle@deploy1001> |
Synchronized wmf-config/CommonSettings.php: 4cdfebe (duration: 00m 54s) |
[production] |
21:36 |
<ejegg> |
re-enabled fundraising python jobs |
[production] |
20:18 |
<ejegg> |
updated fundraising python tools from c0f4e7a379 to b42bda6bf3 |
[production] |
20:14 |
<foks> |
removing two files for legal compliance |
[production] |
20:14 |
<ejegg> |
disabled fundraising python jobs |
[production] |
19:56 |
<ebernhardson> |
cloudelastic-chi run frwiki_content/_forcemerge?only_expunge_deletes=true to try and fix 5gb segments with 96% deleted documents |
[production] |
18:59 |
<ebernhardson> |
restart elasticsearch on cloudelastic1003 (T231517) |
[production] |
18:50 |
<ebernhardson> |
restart elasticsearch on cloudelastic1002 (T231517) |
[production] |
18:41 |
<ebernhardson> |
set index.merge.scheduler.max_thread_count to null to accept default values on cloudelastic-chi (T231517) |
[production] |
18:36 |
<krinkle@deploy1001> |
Synchronized php-1.34.0-wmf.20/extensions/AbuseFilter/includes/AbuseFilterVariableHolder.php: T231542 f37f0bd50cf (duration: 00m 53s) |
[production] |
18:33 |
<krinkle@deploy1001> |
Synchronized php-1.34.0-wmf.20/extensions/CentralAuth/modules/ext.centralauth.ForeignApi.js: e7cd3cd313a4642 (duration: 00m 55s) |
[production] |
18:23 |
<ebernhardson> |
restart elasticsearch on cloudelastic1001 (T231517) |
[production] |
18:22 |
<urbanecm@deploy1001> |
Synchronized wmf-config/CommonSettings.php: SWAT: Fix "Assign all rights assigned to suppress group to oversight group" (T230601) (duration: 00m 54s) |
[production] |
18:07 |
<ebernhardson> |
increase index.refresh_interval to 5m for all indices on cloudelastic-chi |
[production] |
17:22 |
<crusnov@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) |
[production] |
17:19 |
<crusnov@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) |
[production] |
17:15 |
<dcausse> |
restarted elasticsearch on cloudelastic1004 (T231517) |
[production] |
17:10 |
<crusnov@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
17:09 |
<crusnov@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
17:09 |
<crusnov@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) |
[production] |
16:59 |
<crusnov@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
16:49 |
<crusnov@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
16:49 |
<crusnov@cumin1001> |
END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) |
[production] |