2018-04-10
ยง
|
23:04 |
<Krinkle> |
Seemingly from 22:53 - 23:03 global traffic dropped by 30-60%, presumably due to issues in eqiad where 10 Gbits dropped to 3 Gbits sharper than ever before. |
[production] |
22:49 |
<joal@tin> |
Finished deploy [analytics/refinery@33448cd]: Deploying fixes after todays deploy errors (duration: 04m 46s) |
[production] |
22:45 |
<joal@tin> |
Started deploy [analytics/refinery@33448cd]: Deploying fixes after todays deploy errors |
[production] |
21:18 |
<sbisson@tin> |
Finished deploy [kartotherian/deploy@8f3a903]: Rollback kartotherian to v0.0.35 (duration: 06m 27s) |
[production] |
21:12 |
<sbisson@tin> |
Started deploy [kartotherian/deploy@8f3a903]: Rollback kartotherian to v0.0.35 |
[production] |
20:41 |
<sbisson@tin> |
Finished deploy [kartotherian/deploy@bdf70ed]: Deploying kartotherian pre-i18n everywhere (downgrade snapshot) (duration: 03m 45s) |
[production] |
20:37 |
<sbisson@tin> |
Started deploy [kartotherian/deploy@bdf70ed]: Deploying kartotherian pre-i18n everywhere (downgrade snapshot) |
[production] |
20:30 |
<mutante> |
deploy1001 - reinstalled with stretch - re-adding to puppet (T175288) |
[production] |
20:30 |
<mutante> |
deploy1001 - reinstalled with jessie - re-adding to puppet (T175288) |
[production] |
20:13 |
<urandom> |
increasing sample change-prop sample rate to 20% (from 10) in dev environment -- T186751 |
[production] |
20:06 |
<thcipriani@tin> |
rebuilt and synchronized wikiversions files: testwiki back to 1.31.0-wmf.28 |
[production] |
20:02 |
<sbisson@tin> |
Finished deploy [kartotherian/deploy@6e4d666]: Deploying kartotherian pre-i18n everywhere (duration: 04m 34s) |
[production] |
19:58 |
<sbisson@tin> |
Started deploy [kartotherian/deploy@6e4d666]: Deploying kartotherian pre-i18n everywhere |
[production] |
19:57 |
<sbisson@tin> |
Finished deploy [tilerator/deploy@3326c14]: Deploying tilerator pre-i18n everywhere (duration: 00m 48s) |
[production] |
19:56 |
<sbisson@tin> |
Started deploy [tilerator/deploy@3326c14]: Deploying tilerator pre-i18n everywhere |
[production] |
19:48 |
<sbisson@tin> |
Finished deploy [tilerator/deploy@3326c14]: Deploying tilerator pre-i18n to maps-test* (duration: 00m 27s) |
[production] |
19:48 |
<sbisson@tin> |
Started deploy [tilerator/deploy@3326c14]: Deploying tilerator pre-i18n to maps-test* |
[production] |
19:16 |
<thcipriani@tin> |
Finished scap: testwiki to php-1.31.0-wmf.29 and rebuild l10n cache (duration: 66m 28s) |
[production] |
18:10 |
<thcipriani@tin> |
Started scap: testwiki to php-1.31.0-wmf.29 and rebuild l10n cache |
[production] |
18:07 |
<Krinkle> |
Stopping coal on graphite1001 to manually repopulate for T191239 |
[production] |
18:04 |
<otto@tin> |
Finished deploy [analytics/refinery@b8ea97f]: refinery 0.0.60 - take 3 (duration: 04m 54s) |
[production] |
17:59 |
<otto@tin> |
Started deploy [analytics/refinery@b8ea97f]: refinery 0.0.60 - take 3 |
[production] |
17:58 |
<otto@tin> |
Finished deploy [analytics/refinery@b8ea97f]: refinery 0.0.60 - take 2 (duration: 01m 50s) |
[production] |
17:56 |
<otto@tin> |
Started deploy [analytics/refinery@b8ea97f]: refinery 0.0.60 - take 2 |
[production] |
17:56 |
<otto@tin> |
Started deploy [analytics/refinery@b8ea97f]: refinery 0.0.60 - take 2^ |
[production] |
17:49 |
<joal@tin> |
Finished deploy [analytics/refinery@b8ea97f]: Analytics weekly deploy - Move to spark 2 (duration: 03m 55s) |
[production] |
17:48 |
<joal@tin> |
(no justification provided) |
[production] |
17:47 |
<joal@tin> |
(no justification provided) |
[production] |
17:45 |
<joal@tin> |
Started deploy [analytics/refinery@b8ea97f]: Analytics weekly deploy - Move to spark 2 |
[production] |
17:43 |
<chasemp> |
add static route to neutron poc instance range for codfw 172.16.128.0/21 |
[production] |
17:22 |
<papaul> |
shutting down cp2022 for main board replacement |
[production] |
17:20 |
<awight@tin> |
Finished deploy [ores/deploy@d35a1e6]: Test deploy virtualenv on ores1001, with logging and forced failure (duration: 02m 44s) |
[production] |
17:17 |
<awight@tin> |
Started deploy [ores/deploy@d35a1e6]: Test deploy virtualenv on ores1001, with logging and forced failure |
[production] |
17:07 |
<awight@tin> |
Finished deploy [ores/deploy@1e18fa6]: Test deploy virtualenv on ores1001, with logging (duration: 02m 28s) |
[production] |
17:05 |
<awight@tin> |
Started deploy [ores/deploy@1e18fa6]: Test deploy virtualenv on ores1001, with logging |
[production] |
16:57 |
<thcipriani> |
starting branch cut of 1.31.0-wmf.29 |
[production] |
16:45 |
<andrew@tin> |
Synchronized wmf-config/CommonSettings.php: disable new accounts on labtestwikitech (duration: 01m 00s) |
[production] |
16:26 |
<marostegui@tin> |
Synchronized wmf-config/db-codfw.php: Change db2045 IP as it is being moved to another rack - T191193 (duration: 00m 59s) |
[production] |
16:25 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Change db2045 IP as it is being moved to another rack - T191193 (duration: 00m 59s) |
[production] |
16:21 |
<marostegui> |
Reload haproxy on dbproxy1010 to depool labsdb1011 |
[production] |
16:11 |
<marostegui> |
Stop MySQL on db2045 (s8 codfw master) to move it to another rack, this will break replication on codfw - T191193 |
[production] |
16:07 |
<bstorm_> |
labsdb1010 now has the latest views available, including the comment table |
[production] |
16:05 |
<marostegui> |
Reload haproxy on dbproxy1010 to repool labsdb1010 |
[production] |
15:42 |
<ottomata> |
disable puppet on analytics1003 and stop camus crons in preperation for spark 2 upgrade |
[production] |
15:32 |
<marostegui> |
Reload haproxy on dbproxy1010 to depool labsdb1010 |
[production] |
15:26 |
<vgutierrez> |
Reimage lvs5003 as stretch |
[production] |
15:22 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Change db2040 IP as it is being moved to another rack - T191193 (duration: 00m 59s) |
[production] |
15:21 |
<marostegui@tin> |
Synchronized wmf-config/db-codfw.php: Change db2040 IP as it is being moved to another rack - T191193 (duration: 00m 59s) |
[production] |
15:08 |
<volans> |
restarting Icinga on einsteinium, command file not working |
[production] |
15:06 |
<bd808> |
Wiki replicas: ran `sudo maintain-views --table page_assessments --database arwiki` on all 3 servers for T191455 |
[production] |