2020-02-19
ยง
|
17:40 |
<jynus> |
starting data check between db1078 and db1140:3313 T244958 |
[production] |
17:39 |
<addshore@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Start reading for the new term store for clients up to Q4000 (T225057) (just incase of cache issue) (duration: 01m 04s) |
[production] |
17:26 |
<addshore@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: Start reading for the new term store for clients up to Q4000 (T225057) (duration: 01m 01s) |
[production] |
17:14 |
<ema> |
cp4026: repool after probe Connection:keep-alive experiment revert https://gerrit.wikimedia.org/r/573337 |
[production] |
17:12 |
<robh> |
cp1088 returned to service, cp1089 & cp1090 offline for firmware update via T243167 |
[production] |
16:44 |
<papaul> |
replacing ps1-a8-codfw mgmt in rack A8 will go down |
[production] |
16:37 |
<otto@deploy1001> |
Finished deploy [analytics/refinery@e23918a]: Updating eventgate-analytics port (T245203) and also eventlogging whitelist (duration: 12m 27s) |
[production] |
16:32 |
<ema> |
depool cp4026, 5xx |
[production] |
16:24 |
<otto@deploy1001> |
Started deploy [analytics/refinery@e23918a]: Updating eventgate-analytics port (T245203) and also eventlogging whitelist |
[production] |
16:13 |
<marostegui> |
Depool labsdb1011 to help replication to catch up |
[production] |
16:05 |
<elukey> |
Update analytics-in4 filter term eventgate for T245203 on cr1/cr2 eqiad |
[production] |
15:48 |
<ariel@deploy1001> |
Finished deploy [dumps/dumps@b42acb5]: fix temp stub generation, add pagerangeinfo cache, some unit tests (duration: 00m 03s) |
[production] |
15:48 |
<ariel@deploy1001> |
Started deploy [dumps/dumps@b42acb5]: fix temp stub generation, add pagerangeinfo cache, some unit tests |
[production] |
14:59 |
<marostegui> |
Stop mysql on es2021 - T243052 |
[production] |
14:31 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:29 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:29 |
<marostegui> |
Data checksum on db1084 T245621 |
[production] |
14:07 |
<marostegui> |
Upgrade and reboot db1084 - T245621 |
[production] |
14:02 |
<marostegui> |
Start mysql on db1084 without replication - T245621 |
[production] |
13:53 |
<jbond42> |
disable puppet to upgrade postgresql |
[production] |
13:30 |
<jynus@cumin1001> |
dbctl commit (dc=all): 'Depool db1084, lots of connection errors', diff saved to https://phabricator.wikimedia.org/P10458 and previous config saved to /var/cache/conftool/dbconfig/20200219-133057-jynus.json |
[production] |
12:25 |
<ladsgroup@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:573236|Start reading for the new term store for clients up to Q2000 (T225057)]], take II, the cache issue (duration: 01m 04s) |
[production] |
12:22 |
<ladsgroup@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:573236|Start reading for the new term store for clients up to Q2000 (T225057)]] (duration: 01m 06s) |
[production] |
11:56 |
<volans> |
better splay of periodic scripts that interact with Netbox - T244291 |
[production] |
11:43 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
11:41 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
11:08 |
<ladsgroup@deploy1001> |
Synchronized php-1.35.0-wmf.20/extensions/Wikibase/lib/includes/Store: Get rid of useless metrics in EntityTermLookupBase (T245592) (duration: 01m 04s) |
[production] |
11:06 |
<ladsgroup@deploy1001> |
Synchronized php-1.35.0-wmf.19/extensions/Wikibase/lib/includes/Store: Get rid of useless metrics in EntityTermLookupBase (T245592) (duration: 01m 12s) |
[production] |
11:01 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
10:58 |
<marostegui@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
10:58 |
<marostegui@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
10:58 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:58 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:58 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:45 |
<jynus> |
upgrading mariadb client on cumin hosts |
[production] |
10:38 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db2089:3315, db2089:3316 after new package testing', diff saved to https://phabricator.wikimedia.org/P10457 and previous config saved to /var/cache/conftool/dbconfig/20200219-103806-marostegui.json |
[production] |
10:26 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
10:24 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:17 |
<jynus> |
stopping db2089 mariadb@s5 |
[production] |
10:12 |
<jiji@cumin1001> |
conftool action : set/weight=30; selector: dc=eqiad,cluster=appserver,service=apache2,name=mw135[0-5]*.eqiad.wmnet |
[production] |
10:12 |
<jiji@cumin1001> |
conftool action : set/weight=30; selector: dc=eqiad,cluster=appserver,service=nginx,name=mw135[0-5]*.eqiad.wmnet |
[production] |
10:11 |
<jiji@cumin1001> |
conftool action : set/weight=30; selector: dc=eqiad,cluster=appserver,service=nginx,name=mw1349.eqiad.wmnet |
[production] |
10:11 |
<jiji@cumin1001> |
conftool action : set/weight=30; selector: dc=eqiad,cluster=appserver,service=apache2,name=mw1349.eqiad.wmnet |
[production] |
10:09 |
<moritzm> |
updated tftpboot environment for stretch-bootif for the 9.12 point release T241359 |
[production] |
09:53 |
<jynus> |
stopping and upgrading db1140 instances |
[production] |
09:51 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2089:3315, db2089:3316 for new package testing', diff saved to https://phabricator.wikimedia.org/P10455 and previous config saved to /var/cache/conftool/dbconfig/20200219-095139-marostegui.json |
[production] |
09:51 |
<marostegui> |
Depool db2089:3315, db2089:3316 for new package testing |
[production] |
09:49 |
<akosiaris> |
T245516. Deploy mathoid chart version 0.0.27, removing logstash gelf configuration |
[production] |
09:46 |
<akosiaris@deploy1001> |
helmfile [EQIAD] Ran 'apply' command on namespace 'mathoid' for release 'production' . |
[production] |
09:43 |
<vgutierrez> |
test trafficserver 8.0.6-rc1 in cp40[26,32] |
[production] |