2016-11-24
§
|
13:04 |
<akosiaris@puppetmaster1001> |
conftool action : set/weight=20; selector: thumbor1001.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=thumbor', 'service=thumbor']) |
[production] |
12:54 |
<gilles> |
restarting thumbor on thumbor1001 |
[production] |
12:49 |
<akosiaris> |
lower thumbor1001 load by 50% to easy debugging |
[production] |
12:48 |
<gilles> |
restarting thumbor on thumbor1001 |
[production] |
12:48 |
<akosiaris@puppetmaster1001> |
conftool action : set/weight=5; selector: thumbor1001.eqiad.wmnet (tags: ['dc=eqiad', 'cluster=thumbor', 'service=thumbor']) |
[production] |
12:36 |
<elukey> |
launched preferred-replica-election to re-add kafka1022 among the Topic partition leader brokers of the Analytics Kafka cluster (all metrics looks good) |
[production] |
11:41 |
<hoo> |
Killed the Wikidata JSON dump creation on snapshot1007: Wont succeed before Monday, due to T151356 |
[production] |
10:13 |
<_joe_> |
running commonswiki htmlCacheUpdate jobs on terbium to catch up with the backlog, monitoring caches for vhtcpd queue overflows T151196 |
[production] |
09:38 |
<marostegui> |
Stopping replication db1052 (depooled) for maintenance - T150960 |
[production] |
08:59 |
<marostegui> |
Deploy alter table S5 - dewiki.revision on db1092 (depooled) - T148967 |
[production] |
08:15 |
<_joe_> |
uploaded calico-cni 1.5.1 to jessie-wikimedia |
[production] |
07:32 |
<marostegui> |
Stopping MySQL db2070 for maintenance - https://phabricator.wikimedia.org/T149553 |
[production] |
02:35 |
<l10nupdate@tin> |
ResourceLoader cache refresh completed at Thu Nov 24 02:35:10 UTC 2016 (duration 5m 15s) |
[production] |
02:29 |
<l10nupdate@tin> |
scap sync-l10n completed (1.29.0-wmf.3) (duration: 10m 39s) |
[production] |
00:28 |
<reedy@tin> |
Synchronized php-1.29.0-wmf.3/extensions/CentralAuth/maintenance/populateLocalAndGlobalIds.php: Some perf related improvements (duration: 00m 45s) |
[production] |
00:12 |
<demon@tin> |
Synchronized docroot/foundation/: rm more junk (duration: 00m 45s) |
[production] |
2016-11-23
§
|
23:11 |
<godog> |
cleanup older labs instances metrics from 'instances' hierarchy on graphite1001 |
[production] |
22:57 |
<mutante> |
phab2001 - installing vim upgrade |
[production] |
22:52 |
<godog> |
cleanup older labs instances metrics from 'instances' hierarchy on graphite2001 |
[production] |
21:59 |
<mutante> |
gerrit restarting for config change 323179 |
[production] |
21:07 |
<demon@tin> |
Finished scap: pruning old deployment branches (duration: 19m 14s) |
[production] |
20:48 |
<demon@tin> |
Started scap: pruning old deployment branches |
[production] |
20:42 |
<XenoRyet> |
Updated payments-wiki from f8ca94201a3f69ee8176f4984f1caa410ac90c49 to d7ed14407aa7be9a790778cae644c2b320bb7aa4 |
[production] |
19:24 |
<godog> |
swift eqiad-prod: ms-be1027 to weight 2000 T136631 |
[production] |
18:56 |
<marostegui> |
Shutting down db2034 for maintenance - T149553 |
[production] |
18:04 |
<volans@puppetmaster1001> |
conftool action : set/pooled=inactive; selector: name=mw2092.codfw.wmnet |
[production] |
17:58 |
<demon@tin> |
Synchronized php-1.29.0-wmf.3/extensions/CentralAuth/maintenance/populateLocalAndGlobalIds.php: (no message) (duration: 00m 53s) |
[production] |
17:36 |
<marostegui> |
Stopping MySQL on db2070 for maintenance - https://phabricator.wikimedia.org/T149553 |
[production] |
16:24 |
<marostegui> |
Setting offline disk [32:4] on db1053 - looks like it is causing repl issues |
[production] |
16:01 |
<marostegui> |
Stopping replication db2070 for maintenance - T149553 |
[production] |
15:50 |
<dcausse> |
elastic@eqiad: ruwiki reindex done (T148344) |
[production] |
14:37 |
<dcausse> |
elastic@eqiad: reindexing ruwiki from terbium, logs in ~dcausse/bm25_reindex/cirrus_log (T148344) |
[production] |
14:33 |
<jynus> |
rebooting, upgrading db1092 while it is depooled for maintenance |
[production] |
14:31 |
<marostegui> |
Stopping replication db1095 (not pooled) - maintenance - T150960 |
[production] |
11:48 |
<_joe_> |
uploaded calico/kube-policy-controller:0.5.0 to the docker registry |
[production] |
10:24 |
<marostegui> |
Stopping replication on the following m3 hosts for maintenance - db1048, dbstore1002 (m3 instance), db2012 - T151384 |
[production] |
10:23 |
<jynus> |
stopping replication to dbstore1001 to change its masters |
[production] |
07:46 |
<marostegui> |
Stopping MySQL db2070 for maintenance - T149553 |
[production] |
07:29 |
<marostegui> |
Stopping replication on db1095 (depooled) for maintenance - T150960 |
[production] |
07:14 |
<marostegui> |
Stopping replication on db1052 (depooled) for maintenance - T150960 |
[production] |
03:20 |
<papaul> |
prometheus200[3-4] signing puppet certs, salt-key, initial run |
[production] |
02:33 |
<l10nupdate@tin> |
scap sync-l10n completed (1.29.0-wmf.3) (duration: 14m 13s) |
[production] |
02:04 |
<mutante> |
depooled mw2092 because it had I/O errors, dev sda |
[production] |
02:03 |
<dzahn@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=mw2092.codfw.wmnet |
[production] |
01:47 |
<Krenair> |
mw2092 seems broken |
[production] |
01:44 |
<krenair@tin> |
Synchronized php-1.29.0-wmf.3/extensions/VisualEditor/modules/ve-mw: https://gerrit.wikimedia.org/r/323080 and https://gerrit.wikimedia.org/r/323103 (duration: 00m 49s) |
[production] |
01:13 |
<bd808> |
Updated striker to c546f4c (T151409) |
[production] |
00:04 |
<maxsem@tin> |
Synchronized wmf-config/CommonSettings-labs.php: https://gerrit.wikimedia.org/r/#/c/323078/ (duration: 00m 49s) |
[production] |