2020-02-06
§
|
11:11 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:21 |
<akosiaris> |
undo "switchover selectively eventgate-analytics.discovery.wmnet to codfw for mw1331 and mw1348". no effect observed |
[production] |
10:20 |
<akosiaris> |
undo "switchover selectively eventgate-analytics.discovery.wmnet to codfw for mw1331 and mw1348" |
[production] |
10:19 |
<vgutierrez> |
Enabling HTTP keepalive between ats-tls and varnish-frontend on cp4031 - T244464 |
[production] |
10:00 |
<vgutierrez> |
depool and reimage cp3065 as buster - T242093 |
[production] |
09:59 |
<vgutierrez> |
upload trafficserver 8.0.5-1wm14 to apt.wm.o (buster) - T242093 |
[production] |
09:08 |
<dcausse@deploy1001> |
Finished deploy [wdqs/wdqs@4306c64]: deploying wdqs 0.3.14-SNAPSHOT and gui 5a1af3b (duration: 11m 41s) |
[production] |
08:56 |
<dcausse@deploy1001> |
Started deploy [wdqs/wdqs@4306c64]: deploying wdqs 0.3.14-SNAPSHOT and gui 5a1af3b |
[production] |
08:45 |
<dcausse@deploy1001> |
Finished deploy [wdqs/wdqs@4306c64]: deploying wdqs 0.3.14-SNAPSHOT and gui 5a1af3b to wdqs1010.eqiad.wmnet (duration: 00m 29s) |
[production] |
08:44 |
<dcausse@deploy1001> |
Started deploy [wdqs/wdqs@4306c64]: deploying wdqs 0.3.14-SNAPSHOT and gui 5a1af3b to wdqs1010.eqiad.wmnet |
[production] |
08:23 |
<marostegui> |
Reboot dbproxy1012 and dbproxy1014 for upgrade |
[production] |
08:18 |
<dcausse> |
restarting blazegraph on wdqs1006: T242453 |
[production] |
08:17 |
<akosiaris> |
switchover selectively eventgate-analytics.discovery.wmnet to codfw for mw1331 and mw1348 to |
[production] |
06:59 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1101:3317 - T239453', diff saved to https://phabricator.wikimedia.org/P10319 and previous config saved to /var/cache/conftool/dbconfig/20200206-065906-marostegui.json |
[production] |
06:52 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db1098:3317 - T239453', diff saved to https://phabricator.wikimedia.org/P10318 and previous config saved to /var/cache/conftool/dbconfig/20200206-065238-marostegui.json |
[production] |
06:46 |
<elukey> |
run puppet on all ores[12]* nodes |
[production] |
02:49 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) |
[production] |
02:42 |
<mutante> |
ganeti - Creating new VM named install2003.codfw.wmnet in codfw with row=A vcpu=1 memory=1 gigabytes disk=20 gigabytes link=private (T244390) |
[production] |
02:39 |
<dzahn@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
02:30 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) |
[production] |
02:21 |
<mutante> |
ganeti - Creating new VM named install1003.eqiad.wmnet in eqiad with row=C vcpu=1 memory=1 gigabytes disk=20 gigabytes link=private (T244390) |
[production] |
02:20 |
<dzahn@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
2020-02-05
§
|
23:30 |
<ebernhardson> |
delete search indices duplicated on multiple clusters for: hywwiki, chrwiktionary, gcrwiki, mnwwiki, noboard_chapterswikimedia nqowiki nrmwiki outreachwiki and srnwiki |
[production] |
23:08 |
<mholloway-shell@deploy1001> |
Finished deploy [mobileapps/deploy@a51f927]: Update mobileapps to a7928fa (duration: 10m 48s) |
[production] |
22:57 |
<mholloway-shell@deploy1001> |
Started deploy [mobileapps/deploy@a51f927]: Update mobileapps to a7928fa |
[production] |
22:07 |
<mutante> |
Gerrit - added ppchelko to 'wmf-deployment' Gerrit group (he is already in deployment admin group) (T244389) |
[production] |
21:37 |
<arlolra@deploy1001> |
Finished deploy [parsoid/deploy@01d9d3d]: Updating Parsoid to 74730a3 (duration: 03m 07s) |
[production] |
21:33 |
<arlolra@deploy1001> |
Started deploy [parsoid/deploy@01d9d3d]: Updating Parsoid to 74730a3 |
[production] |
21:31 |
<mutante> |
killing and restarting wikibugs, it was reporting each update twice |
[production] |
20:51 |
<joal@deploy1001> |
Finished deploy [analytics/refinery@a47f0d5] (thin): Analytics regular weekly deploy (duration: 00m 07s) |
[production] |
20:51 |
<joal@deploy1001> |
Started deploy [analytics/refinery@a47f0d5] (thin): Analytics regular weekly deploy |
[production] |
20:51 |
<joal@deploy1001> |
Finished deploy [analytics/refinery@a47f0d5]: Analytics regular weekly deploy (duration: 13m 28s) |
[production] |
20:50 |
<mutante> |
ores1004 - systemctl start celery-ores-worker |
[production] |
20:45 |
<twentyafterfour@deploy1001> |
Synchronized php: group1 wikis to 1.35.0-wmf.18 refs T233866 (duration: 01m 07s) |
[production] |
20:44 |
<twentyafterfour@deploy1001> |
rebuilt and synchronized wikiversions files: group1 wikis to 1.35.0-wmf.18 refs T233866 |
[production] |
20:37 |
<joal@deploy1001> |
Started deploy [analytics/refinery@a47f0d5]: Analytics regular weekly deploy |
[production] |
20:34 |
<dzahn@cumin1001> |
conftool action : set/weight=25; selector: name=mw1269.eqiad.wmnet |
[production] |
20:25 |
<dzahn@cumin1001> |
conftool action : set/weight=25; selector: name=mw1267.eqiad.wmnet |
[production] |
20:25 |
<mutante> |
mw1267 restarting php7.2-fpm |
[production] |
20:21 |
<joal@deploy1001> |
Finished deploy [analytics/hdfs-tools/deploy@714e2d0]: Deploy bug fix version (duration: 00m 08s) |
[production] |
20:21 |
<joal@deploy1001> |
Started deploy [analytics/hdfs-tools/deploy@714e2d0]: Deploy bug fix version |
[production] |
20:09 |
<twentyafterfour> |
Preparing to deploy wmf/1.35.0-wmf.18 to group1 wikis refs T233866 |
[production] |
20:09 |
<moritzm> |
installing git security updates for jessie |
[production] |
20:00 |
<moritzm> |
installing unzip security updates |
[production] |
19:44 |
<mutante> |
LDAP - added spramduya to wmf group (T243802) |
[production] |
19:38 |
<jforrester@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Clean up VisualEditor settings (duration: 01m 07s) |
[production] |
19:38 |
<ebernhardson> |
restart mjolnir-kafka-bulk-daemon across eqiad, daemons appear stuck and not reading new messages |
[production] |
19:19 |
<jforrester@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: T238029 Enable InukaPageView logging on production Wikipedias (duration: 01m 07s) |
[production] |
19:15 |
<jforrester@deploy1001> |
Synchronized wmf-config/CommonSettings.php: Sync back revert of 975b4bbb9 (duration: 01m 06s) |
[production] |
19:10 |
<jforrester@deploy1001> |
scap failed: average error rate on 4/11 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org/goto/db09a36be5ed3e81155041f7d46ad040 for details) |
[production] |