2018-04-11
§
|
18:11 |
<mutante> |
deploy1001 is back on stretch once again - it has been removed from scap hosts though (T175288 T185275) |
[production] |
17:40 |
<thcipriani@tin> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:425522|Deploy page previews for anons on dewiki]] T191966 (duration: 00m 54s) |
[production] |
17:30 |
<sbisson@tin> |
Finished deploy [kartotherian/deploy@4cd5a19]: Deploying kartotherian v0.0.38 everywhere (duration: 02m 27s) |
[production] |
17:29 |
<Krinkle> |
actually re-enabled puppet on graphite2001 |
[production] |
17:28 |
<sbisson@tin> |
Started deploy [kartotherian/deploy@4cd5a19]: Deploying kartotherian v0.0.38 everywhere |
[production] |
17:24 |
<thcipriani@tin> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:423496|Enable RemexHtml on wikis with <50 issues in high priority linter cats]] T190731 (duration: 00m 59s) |
[production] |
16:53 |
<sbisson@tin> |
Finished deploy [kartotherian/deploy@4cd5a19]: Deploying kartotherian v0.0.38 to maps-test* (duration: 01m 16s) |
[production] |
16:51 |
<sbisson@tin> |
Started deploy [kartotherian/deploy@4cd5a19]: Deploying kartotherian v0.0.38 to maps-test* |
[production] |
16:44 |
<elukey> |
restart hadoop hdfs namenodes on analytics100[12] to pick up HDFS Trash settings - T189051 |
[production] |
16:35 |
<robh> |
cp2018 returned to service |
[production] |
16:33 |
<foks> |
See T191887 |
[production] |
16:24 |
<robh> |
cp2011 returned to service |
[production] |
16:23 |
<marostegui> |
Reload haproxy on dbproxy1011 to depool labsdb1009 |
[production] |
16:14 |
<elukey> |
reboot notebook1001 for kernel updates |
[production] |
16:11 |
<urandom> |
restarting cassandra, dev environment (testing default GC settings) -- T186751 |
[production] |
15:58 |
<Krinkle> |
Re-enabled puppet and coal on graphite2001 |
[production] |
15:43 |
<robh> |
cp2008 repooled after memory swap |
[production] |
15:20 |
<Krinkle> |
disabling coal service on graphite2001 and disabling puppet – T191239 |
[production] |
15:19 |
<jynus> |
fixing grant issue on db1114 |
[production] |
15:14 |
<ema> |
restart pybal on lvs1003 for logstash-{json,syslog} UDP monitoring config changes https://gerrit.wikimedia.org/r/#/c/425253/ |
[production] |
15:08 |
<ema> |
restart pybal on lvs1006 for logstash-{json,syslog} UDP monitoring config changes https://gerrit.wikimedia.org/r/#/c/425253/ |
[production] |
15:06 |
<robh> |
shutting down cp2008, cp2011, and cp2018 for onsite work |
[production] |
15:01 |
<jynus@tin> |
Synchronized wmf-config/db-eqiad.php: Depool es1012 (duration: 01m 00s) |
[production] |
15:01 |
<marlier> |
Stopping coal on graphite2001.codfw.wmnet for data replay |
[production] |
14:54 |
<jynus@tin> |
Synchronized wmf-config/db-codfw.php: Repool es2013 (duration: 01m 00s) |
[production] |
14:54 |
<gehel> |
starting rolling restart of elasticsearch cirrus / eqiad for jvm upgrade |
[production] |
14:39 |
<moritzm> |
rolling restart of restbase in eqiad to pick up openssl update |
[production] |
14:38 |
<Krinkle> |
Turned regular coal back on (T191239) |
[production] |
14:37 |
<ppchelko@tin> |
Finished deploy [cpjobqueue/deploy@a090a3c]: Fix the low priority jobs topic names (duration: 00m 38s) |
[production] |
14:36 |
<ppchelko@tin> |
Started deploy [cpjobqueue/deploy@a090a3c]: Fix the low priority jobs topic names |
[production] |
14:15 |
<jynus> |
start reimage of es2013 |
[production] |
14:14 |
<marostegui> |
Deploy schema change on db1099:3318 - T187089 T185128 T153182 |
[production] |
14:13 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Depool db1099:3318 for alter table (duration: 01m 00s) |
[production] |
14:12 |
<jynus@tin> |
Synchronized wmf-config/db-codfw.php: Depool es2013 (duration: 01m 00s) |
[production] |
13:44 |
<ppchelko@tin> |
Finished deploy [cpjobqueue/deploy@3ba6580]: Enable second bulk of low-traffic jobs T190327 take 2 (duration: 00m 49s) |
[production] |
13:44 |
<ppchelko@tin> |
Started deploy [cpjobqueue/deploy@3ba6580]: Enable second bulk of low-traffic jobs T190327 take 2 |
[production] |
13:41 |
<ppchelko@tin> |
Finished deploy [cpjobqueue/deploy@2b59313]: Enable second bulk of low-traffic jobs T190327 (duration: 08m 27s) |
[production] |
13:37 |
<moritzm> |
rolling restart of restbase in codfw to pick up openssl update |
[production] |
13:33 |
<mobrovac@tin> |
Synchronized wmf-config/jobqueue.php: Switch a bulk of low-traffic jobs to EventBus for testwikis, file 2/2 - T190327 (duration: 01m 00s) |
[production] |
13:32 |
<ppchelko@tin> |
Started deploy [cpjobqueue/deploy@2b59313]: Enable second bulk of low-traffic jobs T190327 |
[production] |
13:32 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Repool db1072 (duration: 01m 07s) |
[production] |
13:31 |
<ppchelko@tin> |
Started deploy [cpjobqueue/deploy@2b59313]: Enable second bulk of low-traffic jobs T190327 |
[production] |
13:27 |
<marostegui> |
Drop prefstats table on s3 sanitarium master (db1072) this might cause lag on labs - T154490 |
[production] |
13:26 |
<moritzm> |
installing java security updates on kafka/main cluster |
[production] |
13:25 |
<marostegui@tin> |
Synchronized wmf-config/db-eqiad.php: Depool db1072 (duration: 01m 00s) |
[production] |
13:13 |
<marostegui> |
Drop prefstats table on s1 codfw master - db2048 (this might generate lag on codfw) - T154490 |
[production] |
13:12 |
<elukey> |
restart kafka brokers on kafka1012->23 for openjdk-7 upgrades |
[production] |
13:09 |
<marostegui> |
Drop prefstats table on s3 codfw master - db2043 (this might generate lag on codfw) - T154490 |
[production] |
13:01 |
<vgutierrez> |
Reimage lvs4007 as stretch |
[production] |
13:00 |
<jynus@tin> |
Synchronized wmf-config/db-codfw.php: Repool es2012 (duration: 01m 00s) |
[production] |