2016-10-19
§
|
10:27 |
<akosiaris@puppetmaster1001> |
conftool action : set/pooled=no; selector: scb1003.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=scb', 'service=mobileapps']) |
[production] |
10:05 |
<_joe_> |
converting owner of files for l10nupdate usermod on tin |
[production] |
10:02 |
<_joe_> |
ran usermod -u 10002 l10nupdate on tin |
[production] |
09:50 |
<marostegui@mira> |
Synchronized wmf-config/db-eqiad.php: Repool db1064 after finishing the ALTER table - T147305 (duration: 01m 08s) |
[production] |
09:22 |
<moritzm> |
installing rsyslog bugfix updates |
[production] |
09:20 |
<marostegui> |
Deploying schema change on db1069 S4 instance commonswiki revision table - T147305 |
[production] |
08:15 |
<marostegui> |
Stopping db2062.codfw.wmnet to use it to clone another server - T146261 |
[production] |
08:14 |
<moritzm> |
installing tor security update on radium |
[production] |
08:12 |
<moritzm> |
installing quagga security updates |
[production] |
07:31 |
<_joe_> |
disabled profiling on mw1189, hhvm keeps crashing |
[production] |
06:50 |
<_joe_> |
installing jemalloc with memory profiling enabled on mw1189 |
[production] |
2016-10-18
§
|
23:04 |
<Dereckson> |
This full scap pulled three changes of the EU SWAT: [[gerrit:316069] TimedMediaHandler, [[gerrit:316585]] MobileFrontend, [[gerrit:315901]] ULS |
[production] |
23:03 |
<demon@mira> |
Finished scap: bringing full cluster back into sync (duration: 25m 13s) |
[production] |
22:38 |
<demon@mira> |
Started scap: bringing full cluster back into sync |
[production] |
22:28 |
<demon@mira> |
Synchronized README: Bringing co-masters back in sync (duration: 13m 10s) |
[production] |
21:37 |
<mutante> |
added Dpatrick to WMF LDAP group |
[production] |
18:32 |
<dereckson@mira> |
Synchronized wmf-config/LabsServices.php: Elastic@deployment-prep: Remove deployment-elastic08 from the cluster (no-op in prod, labs only) (duration: 00m 47s) |
[production] |
18:30 |
<dereckson@mira> |
Synchronized wmf-config/CirrusSearch-labs.php: Elastic@deployment-prep: force the number of replicas to 1 max (no-op in prod, labs only) (duration: 01m 18s) |
[production] |
17:55 |
<dcausse> |
warming up elastic@codfw from wasat.codfw.wmnet |
[production] |
17:34 |
<jynus> |
stopping mysql, cloning db1064->db1053; upgrading |
[production] |
17:01 |
<bblack> |
upgrading nginx on cache_maps - T144523 |
[production] |
16:57 |
<ejegg> |
updated payments-wiki from b4ad60e739b9dbb97f08a3623db961a74682422a to 27b464fd4383647fc2e7f0a613f290d6edccd22f |
[production] |
15:47 |
<godog> |
eqiad-prod: ms-be1022 to weight 3000 T136631 |
[production] |
15:16 |
<andrewbogott> |
upgrading puppetmaster on labtestcontrol2001 to trusty/3.8.5 |
[production] |
15:06 |
<bblack> |
upgrading nginx on all remaining cache_misc (eqiad, esams) - T144523 |
[production] |
14:54 |
<bblack> |
upgrading nginx on all cache_misc @ codfw - T144523 |
[production] |
14:54 |
<chasemp> |
rsync tools from labstore1001 to labstore1004 |
[production] |
14:43 |
<bblack> |
upgrading nginx on all cache_misc @ ulsfo - T144523 |
[production] |
14:40 |
<marostegui> |
Shutting down es2015 for hardware maintenance - T147769 |
[production] |
14:21 |
<bblack> |
upgrading nginx on cp4001 (cache_misc ulsfo) as prod canary |
[production] |
14:18 |
<bblack> |
uploading nginx-1.11.4+wmf3 to carbon jessie-wikimedia - T144523 |
[production] |
13:58 |
<jynus> |
restarting and upgrading db2049 and es2019 to test new config |
[production] |
13:53 |
<jynus> |
applying new init.d script on all mariadb 10 servers |
[production] |
12:52 |
<elukey> |
mw1169 back in service after reimage (MW Jobrunner) |
[production] |
11:55 |
<elukey> |
removed /etc/mysql/conf.d/research-client.cnf from stat1002 (root:root perms, not supposed to be there but only on stat1003) |
[production] |
11:37 |
<elukey> |
reimaging mw1169 to Debian Jessie (MW Jobrunner) |
[production] |
10:40 |
<elukey> |
mw1168.eqiad.wmnet back in service after reimage (MW Jobrunner) |
[production] |
09:28 |
<elukey> |
reimaging mw1168 to Debian Jessie (MW Jobrunner) |
[production] |
09:25 |
<elukey> |
varnishkafka restarting in upload/misc/maps with new settings (https://gerrit.wikimedia.org/r/316306) |
[production] |
09:18 |
<gehel> |
upgrade nodejs to 4.6.0 on maps2* servers |
[production] |
08:56 |
<moritzm> |
reimaging tin to jessie |
[production] |
08:53 |
<marostegui> |
Deploying ALTER table on S4 commonswiki (db1064 — last host) - T147305 |
[production] |
08:42 |
<jynus> |
clone db1052 -> db1053, will perform maintenance (db restarts, reboots on both) at the same time |
[production] |
07:57 |
<marostegui@mira> |
Synchronized wmf-config/db-eqiad.php: Depool db1064 as it needs an ALTER table and pool db1068 temporarily to serve vslow and dump service - T147305 (duration: 02m 53s) |
[production] |
03:19 |
<mutante> |
restarted grrrit-wm |
[production] |
03:18 |
<mutante> |
gerrit has logs now in /var/log/gerrit/ |
[production] |
03:15 |
<mutante> |
restarting gerrit for logging config change |
[production] |
02:37 |
<l10nupdate@tin> |
ResourceLoader cache refresh completed at Tue Oct 18 02:37:01 UTC 2016 (duration 5m 49s) |
[production] |
02:31 |
<mwdeploy@tin> |
scap sync-l10n completed (1.28.0-wmf.22) (duration: 10m 20s) |
[production] |
00:48 |
<bblack> |
restarting API hhvms with >40% mem usage via salt every 10 minutes in a loop from here forward. screen session on neodymium, named api-hhvm-restarts |
[production] |