2018-04-16
ยง
|
14:05 |
<hashar> |
restarted Jenkins for plugin upgrade T192261 |
[production] |
14:03 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Restore main traffic original weight for db1114 (duration: 00m 58s) |
[production] |
13:47 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Slowly repool db1114 (duration: 00m 58s) |
[production] |
13:32 |
<jynus@tin> |
Synchronized wmf-config/db-eqiad.php: Depool es1017 (duration: 00m 58s) |
[production] |
13:31 |
<marostegui> |
Stop MySQL on db1114 to reboot with another kernel - T191996 |
[production] |
13:30 |
<godog> |
roll-restart swift-proxy in codfw and eqiad - T188062 |
[production] |
13:28 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Depool db1114 (duration: 00m 54s) |
[production] |
13:18 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Repool db1087 (duration: 00m 59s) |
[production] |
12:12 |
<vgutierrez@neodymium> |
conftool action : set/pooled=no; selector: name=hydrogen.wikimedia.org,service=pdns_recursor |
[production] |
12:11 |
<vgutierrez> |
Depool and reimage hydrogen as stretch - T187090 |
[production] |
11:50 |
<vgutierrez@neodymium> |
conftool action : set/pooled=yes; selector: name=acamar.wikimedia.org,service=pdns_recursor |
[production] |
11:11 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Restore db1114 original weight (duration: 00m 59s) |
[production] |
10:50 |
<moritzm> |
reimaging mw1299 (job runner) to stretch |
[production] |
10:23 |
<ariel@tin> |
Finished deploy [dumps/dumps@4706d30]: show full stacktrace for dump job errors (duration: 00m 04s) |
[production] |
10:23 |
<ariel@tin> |
Started deploy [dumps/dumps@4706d30]: show full stacktrace for dump job errors |
[production] |
10:18 |
<godog> |
upload prometheus-memcached-exporter to stretch-wikimedia - T189056 |
[production] |
10:17 |
<jdrewniak@tin> |
Synchronized portals: Wikimedia Portals Update: [[gerrit:426886|Bumping portals to master (T128546)]] (duration: 00m 58s) |
[production] |
10:16 |
<jdrewniak@tin> |
Synchronized portals/wikipedia.org/assets: Wikimedia Portals Update: [[gerrit:426886|Bumping portals to master (T128546)]] (duration: 00m 59s) |
[production] |
09:54 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Give more API traffic to db1114 (duration: 00m 58s) |
[production] |
09:50 |
<vgutierrez@neodymium> |
conftool action : set/pooled=no; selector: name=acamar.wikimedia.org,service=pdns_recursor |
[production] |
09:49 |
<vgutierrez> |
Depool and reimage acamar as stretch - T187090 |
[production] |
09:43 |
<gehel> |
rolling restart of wdqs100[35] and wdqs200[123] for kernel upgrade completed |
[production] |
09:40 |
<jynus> |
restarting dbstore2001:s8 to increase the number of purge threads |
[production] |
09:23 |
<vgutierrez@neodymium> |
conftool action : set/pooled=yes; selector: name=achernar.wikimedia.org,service=pdns_recursor |
[production] |
09:07 |
<gehel> |
starting rolling restart of wdqs100[35] and wdqs200[123] for kernel upgrade |
[production] |
09:05 |
<moritzm> |
pooled mw1276-mw1278 (API app server canaries running stretch) |
[production] |
08:49 |
<gehel> |
first manual run of populate_admin() for maps[12]001 - T190605 |
[production] |
08:47 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Restore db1114 original main traffic weight (duration: 00m 58s) |
[production] |
08:41 |
<moritzm> |
pooled mw1261-mw1264 (app server canaries running stretch) |
[production] |
08:29 |
<joal@tin> |
Finished deploy [analytics/refinery@27416a9]: Regular weekly deploy - Mostly bugfixes from previous week huge deploy (duration: 05m 27s) |
[production] |
08:25 |
<_joe_> |
depooling mw1223 for investigation too |
[production] |
08:23 |
<joal@tin> |
Started deploy [analytics/refinery@27416a9]: Regular weekly deploy - Mostly bugfixes from previous week huge deploy |
[production] |
08:17 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Slowly repool db1114 in API (duration: 00m 58s) |
[production] |
08:04 |
<elukey> |
restart hhvm on mw[1228,1234,1281-1287,1289,1290,1312-1314,1317,1339,1343,1345,1346,1348] - more than 50% cpu usage, prevention scheme for current high load |
[production] |
08:00 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Slowly repool db1114 (duration: 00m 58s) |
[production] |
07:49 |
<marostegui> |
Stop MySQL and reboot db1114 - T191996 |
[production] |
07:46 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Depool db1114 (duration: 00m 59s) |
[production] |
07:40 |
<vgutierrez@neodymium> |
conftool action : set/pooled=no; selector: name=achernar.wikimedia.org,service=pdns_recursor |
[production] |
07:39 |
<vgutierrez> |
Depool and reimage achernar.wikimedia.org - T187090 |
[production] |
07:27 |
<moritzm> |
installing perl security updates on Debian systems |
[production] |
06:45 |
<TimStarling> |
depooled mw1230 |
[production] |
06:38 |
<_joe_> |
repooling mw1230 |
[production] |
06:20 |
<marostegui> |
Drop table flow_subscription from x1 - T149936 |
[production] |
05:59 |
<elukey> |
restart hhvm on mw[1221,1233,1280,1347] - high load |
[production] |
05:55 |
<elukey> |
repool mw1341 after investigation |
[production] |
05:48 |
<elukey> |
restart hhvm on mw1225, 1315, 1316, 1340, 1341, 1342, 1347 - high load |
[production] |
05:42 |
<marostegui> |
Reload haproxy on dbproxy1010 |
[production] |
05:36 |
<elukey> |
restart hhvm on mw1226,27,32,88 - high load |
[production] |
05:35 |
<_joe_> |
depooling mw1341 to further debug the API issue |
[production] |
05:33 |
<marostegui> |
Deploy schema change on db1087 with replication (this will generate lag in labs) - T187089 T185128 T153182 |
[production] |