2016-04-28
§
|
08:12 |
<elukey> |
restarting kafka on kafka{1012,1014,1022,1020,2001,2002} for Java upgrades. Will probably trigger some EventLogging alarms due to a bug (T133779) |
[production] |
07:51 |
<twentyafterfour> |
applied a hotfix to phabricator repository import job so that autoclose will not apply to unmerged refs/changes |
[production] |
07:50 |
<twentyafterfour> |
reduced the number of phabricator worker processes to hopefully stop exhausting mysql connections. |
[production] |
05:37 |
<mutante> |
lvs1012 - puppet fail, tries to upgrade tcpdump package and cannot be authenticated |
[production] |
05:34 |
<mutante> |
mw1146 - hhvm restart |
[production] |
05:27 |
<mutante> |
krypton remove RT packages, remnants from testing |
[production] |
03:04 |
<catrope@tin> |
Synchronized php-1.27.0-wmf.22/extensions/Echo: Fix T133817 (originally scheduled for SWAT) (duration: 00m 34s) |
[production] |
03:03 |
<catrope@tin> |
Synchronized php-1.27.0-wmf.21/extensions/Echo: Fix T133817 (originally scheduled for SWAT) (duration: 00m 39s) |
[production] |
02:41 |
<mwdeploy@tin> |
sync-l10n completed (1.27.0-wmf.22) (duration: 09m 24s) |
[production] |
02:24 |
<mwdeploy@tin> |
sync-l10n completed (1.27.0-wmf.21) (duration: 10m 38s) |
[production] |
02:12 |
<twentyafterfour> |
manually edited crontab on iridium and killed multiple instances of public_task_dump.py (the cronjob was defined as * 2 * * * instead of 0 2 * * *) |
[production] |
00:48 |
<twentyafterfour> |
Phabricator's back online, everything seems to have gone smoothly. |
[production] |
00:29 |
<twentyafterfour> |
Preparing to take phabricator offline for maintenance. |
[production] |
2016-04-27
§
|
22:18 |
<mattflaschen@tin> |
Synchronized wmf-config/db-labs.php: Beta Cluster change (duration: 00m 29s) |
[production] |
22:04 |
<bblack> |
banned req.url ~ "^/w/load.php.*choiceData" on cache_text |
[production] |
22:00 |
<bblack> |
banned req.url ~ "^/load.php.*choiceData" on cache_text |
[production] |
21:22 |
<cwd> |
updated civicrm from 15a0086eef78f16110eba358a28ef78b51a385e1 to 777a91b8f9f6003a3eebdb8f2c73e45cc2bfb4a4 |
[production] |
21:03 |
<bblack> |
rebooting cp1065 |
[production] |
21:01 |
<ebernhardson@tin> |
Synchronized wmf-config/InitialiseSettings.php: Restore codfw to elasticsearch config T133784 (duration: 00m 31s) |
[production] |
21:00 |
<ebernhardson@tin> |
Synchronized wmf-config/CirrusSearch-production.php: Restore codfw to elasticsearch config T133784 (duration: 00m 37s) |
[production] |
20:48 |
<thcipriani> |
restarting jenkins after plugin downgrade |
[production] |
20:41 |
<hashar> |
1.27.0-wmf.22 to group1 has been completed without incident. Deployment is open ! |
[production] |
20:41 |
<ebernhardson> |
Enabled cirrussearch writes to codfw only on mw1165 w/ live hack |
[production] |
20:32 |
<gehel> |
switching wdqs1002 to maintenance and reimporting data (T133566) |
[production] |
20:28 |
<cscott> |
updated OCG to version e39e06570083877d5498da577758cf8d162c1af4 |
[production] |
20:20 |
<yurik> |
deployed kartotherian & tilerator services |
[production] |
20:09 |
<gehel> |
adding back wdqs1001 to varnish configuration after reinstall (T133566) |
[production] |
19:24 |
<Pchelolo> |
update restbase to e9fbdfe |
[production] |
19:18 |
<Pchelolo> |
update restbase to e9fbdfe: canary on restbase1007 |
[production] |
19:11 |
<Pchelolo> |
update restbase to e9fbdfe: staging |
[production] |
19:09 |
<hashar@tin> |
rebuilt wikiversions.php and synchronized wikiversions files: group1 wikis to 1.27.0-wmf.22 |
[production] |
19:00 |
<dcausse> |
restarting elastic on elastic2007.codfw.wmnet (master) |
[production] |
18:56 |
<mutante> |
creating VM ununpentium on ganeti/eqiad (T123713) |
[production] |
18:55 |
<ebernhardson@tin> |
Synchronized wmf-config/InitialiseSettings.php: Drop codfw from elasticsearch config T133784 (duration: 00m 36s) |
[production] |
18:55 |
<ebernhardson@tin> |
Synchronized wmf-config/CirrusSearch-production.php: Drop codfw from elasticsearch config T133784 (duration: 00m 25s) |
[production] |
18:02 |
<jynus> |
generating new triggers for eventlogging_sync schema T108856 |
[production] |
16:58 |
<gehel> |
increase throttling limit and concurrency on recoveries for elasticsearch codfw cluster (T133784) |
[production] |
16:05 |
<gehel> |
increasing curl pool size for jobrunners (T133755) |
[production] |
15:46 |
<elukey> |
restarted kafka1013 for java upgrades |
[production] |
15:30 |
<thcipriani@tin> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: Enable NewUserMessage on hiwikiquote [[gerrit:285639]] (duration: 00m 31s) |
[production] |
15:09 |
<thcipriani@tin> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: Add Subject namespace to hiwikibooks [[gerrit:285008]] (duration: 02m 41s) |
[production] |
14:52 |
<moritzm> |
uploaded pcre 8.31-2ubuntu2.3+wm1 to carbon for trusty-wikimedia (rebuild of latest trusty update with our patch to enable JIT) |
[production] |
14:44 |
<elukey> |
repooled kafka1001 after upgrades, will do the same procedure to kafka1002 |
[production] |
14:40 |
<_joe_> |
upgraded conftool on palladium |
[production] |
14:40 |
<elukey> |
restarted kafka on kafka1001 |
[production] |
14:38 |
<_joe_> |
upgrading conftool on all cp servers |
[production] |
14:33 |
<elukey> |
kafka1001.eqiad.wmnet depooled from eventbus for kafka upgrades (via confctl) |
[production] |
14:14 |
<chasemp> |
restart phd on iridium as it keeps complaining it lost procs (seems ok now) |
[production] |
13:53 |
<elukey> |
restarted kafka on kafka1018.eqiad.wmnet for Java upgrades |
[production] |
13:24 |
<gehel> |
hard restart of codfw elasticsearch cluster |
[production] |