2016-05-23
ยง
|
19:01 |
<awight> |
update SmashPig from 5cadcf3abcfcda4552b068c783337d82b743e2e5 to f0bf4385afac65a27f99c5f657c3d0931c991fa8 |
[production] |
18:50 |
<awight> |
Rollback SmashPig from aa1614afa845358669208c2f6c4cd62e83a98f4c to 5cadcf3abcfcda4552b068c783337d82b743e2e5 |
[production] |
18:49 |
<krenair@tin> |
Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/290290 (duration: 00m 38s) |
[production] |
18:39 |
<awight> |
Disable Adyen SmashPig runner and just stop spamming the server admin log. |
[production] |
18:39 |
<awight> |
Reenable Adyen SmashPig job runner |
[production] |
18:38 |
<awight> |
Paused Adyen SmashPig job runner |
[production] |
18:34 |
<awight> |
update SmashPig from 5cadcf3abcfcda4552b068c783337d82b743e2e5 to aa1614afa845358669208c2f6c4cd62e83a98f4c |
[production] |
18:25 |
<andrewbogott> |
temporarily turning off pdns and recursor on holmium (https://phabricator.wikimedia.org/T106303) |
[production] |
18:10 |
<gehel> |
removing maintenance from wdqs1001 |
[production] |
17:35 |
<gehel> |
putting wdqs1001 in maintenance to fix deployment issues |
[production] |
17:28 |
<elukey> |
re-run from Hue webrequest-load-wf-(text|upload)-2016-5-23-13. The failures were likely caused by my global Yarn restart on the cluster. |
[analytics] |
17:20 |
<elukey> |
oozie bundles re-enabled |
[analytics] |
17:13 |
<andrewbogott> |
rebooting labvirt1003 |
[production] |
17:10 |
<elukey> |
restarting Yarn Resource manager (master node) on analytics1001 to apply a new Spark configuration. The service will automatically failover to analytics1002 |
[production] |
16:45 |
<bd808> |
Stashbot back online. Will continue to monitor for a while to see if ES cluster is happier. |
[production] |
16:44 |
<bd808> |
Bot back online |
[tools.stashbot] |
16:12 |
<thcipriani@tin> |
Synchronized php-1.28.0-wmf.2/extensions/Wikidata/extensions/Wikibase/client/includes/Hooks/DataUpdateHookHandlers.php: [[gerrit:290256|Update Wikidata - fix file deletion issue on commons]] (duration: 00m 29s) |
[production] |
16:10 |
<bd808> |
Bot died due to https://github.com/bd808/tools-stashbot/issues/9 |
[tools.stashbot] |
16:02 |
<elukey> |
restarting yarn on analytics10* hosts to pick up the new Spark shuffler process |
[production] |
16:01 |
<volans> |
testing thread_pool_max_threads=2000 on db1072 (s1) [instead of db1076 (s2)] T133333 |
[production] |
15:49 |
<thcipriani> |
beta code update not running, disconnect-reconnect dance resulted in: [05/23/16 15:48:39] [SSH] Authentication failed. |
[releng] |
15:47 |
<volans> |
testing thread_pool_max_threads=2000 on db1076 (s2) T133333 |
[production] |
15:42 |
<thcipriani@tin> |
Synchronized wmf-config: SWAT: [[gerrit:290259|Final Commons configuration for $wgUploadDialog]] (duration: 00m 28s) |
[production] |
15:32 |
<thcipriani@tin> |
Synchronized wmf-config: SWAT: revert [[gerrit:289109|Final Commons configuration for $wgUploadDialog]] (duration: 00m 28s) |
[production] |
15:29 |
<thcipriani@tin> |
Synchronized wmf-config: SWAT: [[gerrit:289109|Final Commons configuration for $wgUploadDialog]] (duration: 00m 30s) |
[production] |
15:21 |
<thcipriani@tin> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:288101|Set interwiki sorting order for West Frisian Wikibooks]] (duration: 00m 25s) |
[production] |
15:13 |
<jynus> |
performing schema change on s3 T130692 |
[production] |
15:11 |
<thcipriani@tin> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:286168|Creation of page mover userright on enwiki]] (duration: 00m 30s) |
[production] |
15:06 |
<thcipriani@tin> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:290000|Adjust groups permissions on fa.wikipedia]] (duration: 00m 41s) |
[production] |
14:57 |
<elukey> |
suspended all the oozie bundles as prep step for https://gerrit.wikimedia.org/r/#/c/290252 (yes I know super paranoid mode on) |
[analytics] |
14:44 |
<godog> |
reboot restbase2005 in single user mode for T113714 |
[production] |
14:43 |
<filippo@palladium> |
conftool action : set/pooled=no; selector: restbase2005.codfw.wmnet |
[production] |
14:32 |
<jzerebecki> |
offlined integration-slave-trusty-1004 because it can't connect to mysql T135997 |
[releng] |
13:32 |
<hashar> |
Upgrading Jenkins git plugins and restarting Jenkins |
[releng] |
13:28 |
<chasemp> |
'apt-get install hhvm -y --force-yes' across trusty hosts to handle hhvm downgrade |
[tools] |
12:57 |
<moritzm> |
rolling restart of cassandra on maps-test cluster for openjdk security update |
[production] |
12:50 |
<Lokal_Profil> |
Deployed latest from Git, 50915bf (T55688) |
[tools.heritage] |
12:44 |
<mobrovac> |
restbase restarting to apply https://gerrit.wikimedia.org/r/#/c/289092/ |
[production] |
12:33 |
<jynus> |
stopping, backing up and reimage db1016 T135973 (it will also affect db2010 lag) |
[production] |
12:15 |
<filippo@palladium> |
conftool action : set/pooled=yes; selector: restbase2009.codfw.wmnet |
[production] |
12:15 |
<filippo@palladium> |
conftool action : set/pooled=yes; selector: restbase2008.codfw.wmnet |
[production] |
12:15 |
<filippo@palladium> |
conftool action : set/pooled=yes; selector: restbase2007.codfw.wmnet |
[production] |
11:42 |
<mobrovac> |
restbase deploying 75a94ee to restbase2009 |
[production] |
11:34 |
<valhallasw`cloud> |
temporarily offline for mass edit by Danny_B |
[tools.wikibugs] |
11:24 |
<godog> |
run puppet and roll-restart cassandra-metrics-collector on restbase codfw/eqiad |
[production] |
11:20 |
<godog> |
deploy new version of cassandra-metrics-collector T135385 |
[production] |
11:01 |
<hashar> |
Upgrading hhvm on Trusty slaves. Bring him hhvm compiled against libicu52 instead of libicu48 |
[releng] |
10:57 |
<moritzm> |
restarting hhvm on app servers in codfw for librsvg update |
[production] |
10:32 |
<_joe_> |
running updateCollations.php --force on ptwiki, T58041 |
[production] |
10:24 |
<moritzm> |
rolling restart of restbase1* for openjdk-8 update |
[production] |