2017-10-30
§
|
10:31 |
<marostegui> |
Stop MySQL on db2039 to copy its data to db2087.s6 - T178359 |
[production] |
10:27 |
<mobrovac@tin> |
Finished deploy [restbase/deploy@2b5889b]: Double-process all summaries and include the parsoid no-op switch (duration: 04m 41s) |
[production] |
10:22 |
<mobrovac@tin> |
Started deploy [restbase/deploy@2b5889b]: Double-process all summaries and include the parsoid no-op switch |
[production] |
10:03 |
<mobrovac@tin> |
Finished deploy [restbase/deploy@2b5889b]: Double-process all summaries and include the parsoid no-op switch (duration: 12m 12s) |
[production] |
09:57 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Restore db1097 original weight - T161088 (duration: 00m 52s) |
[production] |
09:50 |
<mobrovac@tin> |
Started deploy [restbase/deploy@2b5889b]: Double-process all summaries and include the parsoid no-op switch |
[production] |
09:22 |
<marostegui> |
Stop replication in sync on db2039 and db2046 to reimport tables |
[production] |
09:08 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Increase db1097 api traffic - T161088 (duration: 00m 50s) |
[production] |
09:08 |
<marostegui> |
Drop wb_entity_per_page page from s3 and s5 - T177601 |
[production] |
08:51 |
<ema> |
cp4022: restart varnish-be for mbox lag |
[production] |
08:42 |
<elukey> |
raised priority of refreshlink and htmlcacheupdate job execution on jobrunners (https://gerrit.wikimedia.org/r/#/c/386636/) - T173710 |
[production] |
08:39 |
<marostegui> |
Stop replication on db2039 to reimport and compress tables |
[production] |
08:38 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Repool db1097 with low weight - T161088 (duration: 00m 50s) |
[production] |
08:33 |
<moritzm> |
installing wget security updates |
[production] |
08:32 |
<gehel> |
rolling restart of wdqs for config reload - T175919 |
[production] |
08:31 |
<marostegui@tin> |
Synchronized wmf-config/db-codfw.php: Depool db2039 (duration: 00m 50s) |
[production] |
08:28 |
<ariel@tin> |
Finished deploy [dumps/dumps@c204c72]: fix args initialization issue in getconfigvals script (duration: 00m 02s) |
[production] |
08:28 |
<ariel@tin> |
Started deploy [dumps/dumps@c204c72]: fix args initialization issue in getconfigvals script |
[production] |
08:23 |
<marostegui@tin> |
Synchronized wmf-config/db-codfw.php: Repool db2040 - T178359 (duration: 00m 50s) |
[production] |
08:11 |
<marostegui> |
Stop MySQL on db2086 to transfer s7 to db2087 - T178359 |
[production] |
06:51 |
<marostegui> |
Stop MySQL on db1097 to clone db1103 - T161088 |
[production] |
06:47 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Depool db1097 - T161088 (duration: 00m 50s) |
[production] |
06:41 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Move db1103 from s3 to s4 - T161088 (duration: 00m 49s) |
[production] |
06:40 |
<marostegui> |
Stop MySQL on db1103 to reclone it - T161088 |
[production] |
06:24 |
<marostegui> |
Optimize dewiki.pagelinks and dewiki.templatelinks on db1063 (s5 master) - T174509 |
[production] |
06:12 |
<marostegui@tin> |
Synchronized wmf-config/db-codfw.php: Depool db2040 - T178359 (duration: 00m 51s) |
[production] |
06:08 |
<marostegui> |
Stop MySQL on db2040 to populate s7 on db2086 - T178359 |
[production] |
05:58 |
<marostegui> |
Deploy alter table on db1075 (s3 primary master) - T174509 |
[production] |
03:11 |
<l10nupdate@tin> |
ResourceLoader cache refresh completed at Mon Oct 30 03:11:37 UTC 2017 (duration 7m 11s) |
[production] |
03:04 |
<l10nupdate@tin> |
scap sync-l10n completed (1.31.0-wmf.5) (duration: 11m 38s) |
[production] |
02:35 |
<l10nupdate@tin> |
scap sync-l10n completed (1.31.0-wmf.4) (duration: 11m 18s) |
[production] |
2017-10-29
§
|
23:49 |
<ema> |
powercycle cp4024 |
[production] |
22:31 |
<ariel@tin> |
Finished deploy [dumps/dumps@2aa2275]: fix keep setting to work with overrides (duration: 00m 02s) |
[production] |
22:31 |
<ariel@tin> |
Started deploy [dumps/dumps@2aa2275]: fix keep setting to work with overrides |
[production] |
17:55 |
<ariel@tin> |
Finished deploy [dumps/dumps@d8978ce]: add overrides section processing to config file (duration: 00m 04s) |
[production] |
17:55 |
<ariel@tin> |
Started deploy [dumps/dumps@d8978ce]: add overrides section processing to config file |
[production] |
17:23 |
<ariel@tin> |
Finished deploy [dumps/dumps@d426cf7]: batch 7z jobs, multistream job fixup (duration: 00m 02s) |
[production] |
17:23 |
<ariel@tin> |
Started deploy [dumps/dumps@d426cf7]: batch 7z jobs, multistream job fixup |
[production] |
12:54 |
<ema> |
cp4026: restart varnish-be for mbox lag |
[production] |
2017-10-28
§
|
21:03 |
<bblack> |
cp1067 (current target cache): disabling the relatively-new VCL that sets do_stream=false if !CL on applayer fetches... |
[production] |
19:39 |
<hoo@tin> |
Synchronized wmf-config/CommonSettings.php: Half the Flow -> Parsoid timeout (100s -> 50s) (T179156) (duration: 00m 51s) |
[production] |
19:39 |
<bblack> |
backend restart on cp1065 |
[production] |
18:39 |
<bblack> |
restarting varnish backend on cp1053 to move the lag/503 issues to another box and buy more time to debug |
[production] |
18:28 |
<bblack> |
cp4025 - restart backend for mailbox lag (upload@ulsfo, unrelated to text-cluster issues) |
[production] |
18:21 |
<bblack> |
cp1053 - manual VCL change, backends appservers+api_appservers, reduce connect/firstbyte/betweenbytes timeoues from 5/180/60 to 3/20/10 |
[production] |
16:51 |
<elukey> |
restart varnish backend on cp1055 - mailbox lag + T179156 |
[production] |
12:14 |
<elukey@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=mw1313.eqiad.wmnet |
[production] |
12:10 |
<elukey> |
manually killed (SIGTERM) hhvm on mw1313 - high load, hhvm-dump-debug not responsive |
[production] |
12:01 |
<elukey@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=mw1313.eqiad.wmnet |
[production] |
11:53 |
<elukey> |
restart hhvm on mw1285 - hhvm-dump-debug in /tmp/hhvm.17700.bt |
[production] |