2019-09-17
ยง
|
13:14 |
<jbond42> |
currently running octocatalog-diff for all hosts from elnath |
[production] |
13:02 |
<marostegui> |
Start replication on db1130 db1104 db1085 db1086 after PDU maintenance is completed - T227539 |
[production] |
13:01 |
<cmjohnson1> |
The PDU swap in rack B3 eqiad is finished. |
[production] |
12:30 |
<mobrovac> |
bootstrap restbase2010-c - T224553 |
[production] |
11:32 |
<Urbanecm> |
EU SWAT is done |
[production] |
11:31 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.ipmi-password-reset (exit_code=0) |
[production] |
11:31 |
<dzahn@cumin1001> |
Updating IPMI password on 8 hosts - dzahn@cumin1001 |
[production] |
11:31 |
<urbanecm@deploy1001> |
Synchronized wmf-config/VariantSettings.php: SWAT: 290e207: Add channels for the Translate and TranslationsNotification extension (T221119, T144780, T143073) (duration: 00m 56s) |
[production] |
11:30 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.ipmi-password-reset |
[production] |
11:30 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.ipmi-password-reset (exit_code=99) |
[production] |
11:30 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.ipmi-password-reset |
[production] |
11:29 |
<dzahn@cumin1001> |
END (ERROR) - Cookbook sre.hosts.ipmi-password-reset (exit_code=97) |
[production] |
11:29 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.ipmi-password-reset |
[production] |
11:27 |
<awight@deploy1001> |
Synchronized php-1.34.0-wmf.22/extensions/FileImporter: SWAT: [[gerrit:537345|Use https rather than protcol-relative remote API URLs (T228851)]] (duration: 00m 58s) |
[production] |
11:24 |
<cmjohnson1> |
commencing pdu swap rack b3 eqiad T227539 |
[production] |
11:22 |
<awight@deploy1001> |
Synchronized wmf-config/VariantSettings.php: SWAT: [[gerrit:536732|Update ORES filter threshold configuration for new huwiki model (T230031)]] (duration: 00m 55s) |
[production] |
11:17 |
<awight@deploy1001> |
Synchronized wmf-config/VariantSettings.php: SWAT: [[gerrit:537092|Enable EditorJourney for euwiki (T232061)]] (duration: 00m 56s) |
[production] |
11:13 |
<Urbanecm> |
Run mwscript emptyUserGroup.php --wiki=aawiki 'inactive' (T150538) |
[production] |
10:58 |
<mobrovac> |
bootstrap restbase2010-b - T224553 |
[production] |
10:44 |
<vgutierrez> |
replacing nginx with ATS in cp1076 (upload cluster) - T231433 |
[production] |
09:48 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool and stop replication on db1130 db1104 db1085 db1086 (lag will appear on s6 on labsdb) for PDU maintenance - T227539', diff saved to https://phabricator.wikimedia.org/P9116 and previous config saved to /var/cache/conftool/dbconfig/20190917-094827-marostegui.json |
[production] |
09:46 |
<marostegui> |
Depool and stop replication on db1130 db1104 db1085 db1086 (lag will appear on s6 on labsdb) for PDU maintenance - T227539 |
[production] |
09:30 |
<hashar> |
Restarting CI jenkins |
[production] |
09:29 |
<marostegui> |
Downtime db1073 db1130 db1104 db1085 db1086 for the PDU maintenance T227539 |
[production] |
09:18 |
<jynus@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:16 |
<mobrovac> |
bootstrap restbase2010-a - T224553 |
[production] |
09:15 |
<jynus@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:05 |
<jiji@deploy1001> |
Synchronized wmf-config/CommonSettings.php: Push PHP7 traffic to 100% of users who accept cookies - T219150 (duration: 00m 57s) |
[production] |
08:37 |
<vgutierrez> |
upgrading ATS to 8.0.5-1wm8 on cp3034 - T231849 T232724 |
[production] |
07:58 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Pool db1074 with just 50 to keep its warmness level just in case T231638', diff saved to https://phabricator.wikimedia.org/P9115 and previous config saved to /var/cache/conftool/dbconfig/20190917-075807-marostegui.json |
[production] |
07:48 |
<effie> |
Enable puppet on mw* |
[production] |
07:42 |
<elukey> |
reboot analytics-tool1004 (host running superset) for kernel updates |
[production] |
07:41 |
<marostegui> |
Stop mysql on db1063 for decommissioning T232564 |
[production] |
07:40 |
<marostegui> |
Remove db1063 from puppet and zarcillo T232564 |
[production] |
07:29 |
<vgutierrez> |
repooling cp5007 without wikibase configuration - T99531 |
[production] |
07:23 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
07:21 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
07:19 |
<vgutierrez> |
depooling cp5007 to ensure that wikibase removal goes as expected - T99531 |
[production] |
07:10 |
<vgutierrez> |
getting rid of wikibase TLS certificate & nginx configuration on the text cache cluster - T99531 |
[production] |
06:56 |
<vgutierrez> |
upgrading ATS to 8.0.5-1wm8 on cp2002, cp4021 and cp5001 - T231849 |
[production] |
06:55 |
<vgutierrez> |
uploaded trafficserver 8.0.5-1wm8 to apt.wikimedia.org (stretch) - T231849 |
[production] |
06:53 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1066 T233071', diff saved to https://phabricator.wikimedia.org/P9114 and previous config saved to /var/cache/conftool/dbconfig/20190917-065342-marostegui.json |
[production] |
06:49 |
<moritzm> |
reimage restbase2010 to Stretch T224553 |
[production] |
05:57 |
<vgutierrez> |
upgrading ATS to 8.0.5-1wm7 on cp2002 and cp4021 - T232724 |
[production] |
05:56 |
<vgutierrez> |
uploaded trafficserver 8.0.5-1wm7 to apt.wikimedia.org (stretch) - T232298 T232724 |
[production] |
05:23 |
<effie> |
disable puppet on mw* servers for 536979 |
[production] |
05:01 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Promote db1122 to s2 master and remove read-only from s2 T230785', diff saved to https://phabricator.wikimedia.org/P9113 and previous config saved to /var/cache/conftool/dbconfig/20190917-050133-marostegui.json |
[production] |
05:00 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Set s2 as read-only for maintenance T230785', diff saved to https://phabricator.wikimedia.org/P9112 and previous config saved to /var/cache/conftool/dbconfig/20190917-050043-marostegui.json |
[production] |
05:00 |
<marostegui> |
Starting s2 failover from db1066 to db1122 - T230785 |
[production] |
04:56 |
<effie> |
Downtiming HTTPS-blog on icing - T232412 |
[production] |