2017-08-01
ยง
|
17:53 |
<paladox> |
upgrading gerrit on gerrit-new.wmflabs.org to 2.14.2-2449-gc8fa38496c (bazel build release) (master branch). Testing wikimedia branding for prod and the move change feature |
[git] |
17:24 |
<ottomata> |
beginning druid upgrade to 0.9.2 http://druid.io/docs/0.9.2/operations/rolling-updates.html |
[analytics] |
17:23 |
<twentyafterfour> |
MediaWiki train for 1.30.0-wmf.12 - finished `scap prep` & `scap patch` refs T168053 |
[production] |
17:10 |
<ottomata> |
pausing all druid oozie coordinators |
[analytics] |
16:41 |
<ejegg> |
updated CiviCRM from 23f2bbf73557a7a88e783f68459112cf4bba1c79 to 5c741b1f42da80a30a93d26338eb89912f72f1eb |
[production] |
16:25 |
<twentyafterfour> |
MediaWiki Train: Creating new branch wmf/1.30.0-wmf.12 from master. See T170631 for deployment blockers. |
[production] |
16:17 |
<dcausse> |
restarting elastic on relforge100x servers to pick up new version of the plugins |
[production] |
15:54 |
<bblack> |
varnish backend restart on cp1072 (mailbox lag) |
[production] |
15:45 |
<hashar> |
Image snapshot-ci-jessie-1501601670 in wmflabs-eqiad is ready && purging old instances T161861 |
[releng] |
15:44 |
<hashar> |
Debug: Executing '/usr/bin/npm install -g npm@3.8.3' - T161861 |
[releng] |
15:43 |
<bblack> |
rebooting lvs1002 |
[production] |
15:42 |
<marostegui> |
db1069: Migrate trwiktionary.page from TokuDB to InnoDB |
[production] |
15:40 |
<bblack> |
stopping pybal on lvs1002 for impending reboot |
[production] |
15:39 |
<bblack> |
stopping pybal on 1002 for impending reboot |
[production] |
15:34 |
<hashar> |
Refreshing nodepool Jessie image to bump npm from 2.x to 3.8.x T161861 |
[releng] |
15:33 |
<marostegui> |
Stop s3 on db1069 - replication stuck |
[production] |
15:17 |
<marostegui> |
Stop MySQL on db1055 for maintenance - https://phabricator.wikimedia.org/T148507 |
[production] |
14:56 |
<andrewbogott> |
rebooting labvirt1016 |
[production] |
14:46 |
<marostegui> |
Deploy InnoDB compression on s3 - db2074 for the following tables (revision, pagelinks and templatelinks) - T170662 |
[production] |
14:45 |
<ema> |
lvs1001-1003 (eqiad primaries): upgrade to pybal 1.13.11 - one-packet-scheduling, instrumentation fixes. T104442, T103882 |
[production] |
14:28 |
<ema> |
lvs1004-1006 (eqiad secondaries): upgrade to pybal 1.13.11 - one-packet-scheduling, instrumentation fixes. T104442, T103882 |
[production] |
14:27 |
<bblack> |
restart varnish backend on cp1049 (mailbox lag) |
[production] |
14:12 |
<bblack> |
restart varnish backend on cp1074 (mailbox lag) |
[production] |
13:53 |
<ema@neodymium> |
conftool action : set/pooled=yes; selector: name=achernar.wikimedia.org,service=pdns_recursor |
[production] |
13:26 |
<ema@neodymium> |
conftool action : set/pooled=no; selector: name=achernar.wikimedia.org,service=pdns_recursor |
[production] |
13:10 |
<hashar@tin> |
Synchronized wmf-config/InitialiseSettings.php: Enable OOjs UI EditPage on all wikis except Commons (duration: 00m 44s) |
[production] |
13:06 |
<marostegui> |
Compress s2 on db1102 - T172169 |
[production] |
12:51 |
<zhuyifei1999_> |
Deployed ba54a61 on quarry-main-01 T164390 |
[quarry] |
12:49 |
<elukey> |
restart hive daemons on analytics1003 to pick up new jvm settings (bigger Xmx, JMX ports) |
[production] |
12:49 |
<elukey> |
restart hive daemons on analytics1003 to pick up new jvm settings (bigger Xmx, JMX ports) |
[analytics] |
12:06 |
<dcausse> |
100% cpu spike on elastic1023 caused percentiles to jump for a short period of time (T169498) |
[production] |
12:04 |
<elukey> |
stop eventlogging_sync on analytics-slaves && rename all CookieBlock* tables (log db) to CookieBlock*_backup - T171883 |
[production] |
11:52 |
<marostegui> |
Stop MySQL on db2057 to copy its data to db2074 - T170662 |
[production] |
11:51 |
<paladox> |
cherry picking https://gerrit.wikimedia.org/r/#/c/369001/ to test for any errors. |
[phabricator] |
11:36 |
<marostegui@tin> |
Synchronized wmf-config/db-codfw.php: Depool db2057 - T170662 (duration: 00m 43s) |
[production] |
11:10 |
<marostegui@tin> |
Synchronized wmf-config/db-codfw.php: Repool db2065 - T170662 (duration: 00m 43s) |
[production] |
10:31 |
<hashar> |
Enabling Zuul/CI again and reenabling puppet on contint1001 |
[production] |
10:24 |
<hashar> |
contint1001 stopped puppet agent to prevent Zuul server to come back up |
[production] |
10:12 |
<hashar> |
Stopped Zuul / CI for mass mediawiki extension changes |
[releng] |
10:12 |
<hashar> |
Stopped Zuul / CI for mass mediawiki extension changes |
[production] |
10:05 |
<elukey> |
suspended again webrequest-load-bundle as prep step to restart the hive daemons |
[analytics] |
08:55 |
<ema> |
lvs2001-2003 (codfw primaries): upgrade to pybal 1.13.11 - one-packet-scheduling, instrumentation fixes. T104442, T103882 |
[production] |
08:32 |
<ema> |
lvs2004-2006 (codfw secondaries): upgrade to pybal 1.13.11 - one-packet-scheduling, instrumentation fixes. T104442, T103882 |
[production] |
08:03 |
<ema> |
lvs3*: upgrade to pybal 1.13.11 - one-packet-scheduling, instrumentation fixes. T104442, T103882 |
[production] |
07:58 |
<elukey> |
suspended webrequest-load-bundle as prep step to restart the hive daemons |
[analytics] |
07:40 |
<ema> |
lvs4001, lvs4002 (ulsfo primaries): upgrade to pybal 1.13.11 - one-packet-scheduling, instrumentation fixes. T104442, T103882 |
[production] |
07:35 |
<ema> |
lvs4003, lvs4004 (ulsfo secondaries): upgrade to pybal 1.13.11 - one-packet-scheduling, instrumentation fixes. T104442, T103882 |
[production] |
07:33 |
<ema> |
pybal 1.13.11 uploaded to apt.w.o T103882 |
[production] |
07:03 |
<elukey> |
restarted mobile_apps-session_metrics-coord-global-30days failed job via Hue |
[analytics] |
06:20 |
<marostegui> |
Stop MySQL on db2065 to copy its data to db2073 - T170662 |
[production] |