2020-03-06
ยง
|
23:50 |
<mutante> |
install1003/2003 - starting DHCP servers and letting puppet stop them again to clear systemd state |
[production] |
23:04 |
<mutante> |
signing puppet certs for install1003/install2003, initial puppet runs |
[production] |
22:33 |
<reedy@deploy1001> |
Synchronized wmf-config/interwiki-labs.php: T247091 (duration: 00m 57s) |
[production] |
22:09 |
<ebernhardson@deploy1001> |
Finished deploy [wikimedia/discovery/analytics@18f13e4]: update to pyhton3.7, ship articletopic propagation (duration: 00m 36s) |
[production] |
22:08 |
<ebernhardson@deploy1001> |
Started deploy [wikimedia/discovery/analytics@18f13e4]: update to pyhton3.7, ship articletopic propagation |
[production] |
20:23 |
<ebernhardson> |
post-deploy restart mjolnir bulk and msearch daemons across eqiad and codfw |
[production] |
20:07 |
<ebernhardson@deploy1001> |
Finished deploy [search/mjolnir/deploy@dda3d28]: Re-deploy python3.7 upgrade (duration: 05m 14s) |
[production] |
20:02 |
<ebernhardson@deploy1001> |
Started deploy [search/mjolnir/deploy@dda3d28]: Re-deploy python3.7 upgrade |
[production] |
19:57 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) |
[production] |
19:56 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) |
[production] |
19:48 |
<mutante> |
re-creating install1003 and install2003 with same specs as before but public IP (T244390) |
[production] |
19:47 |
<dzahn@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
19:46 |
<dzahn@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
19:46 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) |
[production] |
19:46 |
<dzahn@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
18:54 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
18:53 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
18:52 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) |
[production] |
18:52 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
18:46 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
18:44 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
18:07 |
<mutante> |
sudo -i cumin -b 15 'mw23[25-34].codfw.wmnet' 'sudo -u dzahn scap pull' |
[production] |
18:05 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw233[0-4].codfw.wmnet |
[production] |
18:05 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw232[5-9].codfw.wmnet |
[production] |
18:04 |
<dzahn@cumin1001> |
conftool action : set/weight=15; selector: name=mw233[0-4].codfw.wmnet |
[production] |
18:04 |
<dzahn@cumin1001> |
conftool action : set/weight=15; selector: name=mw232[5-9].codfw.wmnet |
[production] |
17:42 |
<krinkle@deploy1001> |
Synchronized wmf-config/wgConf.php: I260bafdb8e (no-op) (duration: 01m 00s) |
[production] |
17:28 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
17:26 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
17:23 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
17:23 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
16:54 |
<reedy@deploy1001> |
Synchronized php-1.35.0-wmf.22/extensions/WikimediaMaintenance/dumpInterwiki.php: T247097 (duration: 01m 00s) |
[production] |
16:40 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
16:40 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
16:40 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
16:40 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
15:11 |
<moritzm> |
installing libtimedate-perl updates from Stretch point release |
[production] |
15:07 |
<reedy@deploy1001> |
Synchronized langlist-labs: T247091 (duration: 01m 05s) |
[production] |
14:53 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.aqs.roll-restart (exit_code=0) |
[production] |
14:50 |
<elukey@cumin1001> |
START - Cookbook sre.aqs.roll-restart |
[production] |
14:44 |
<XioNoX> |
add cloud-out4 firewall filter in codfw - T246887 |
[production] |
11:56 |
<akosiaris> |
T238658. kubernetes1001 pooled for eventstreams, weight=1 which should account for 2.1% of traffic |
[production] |
11:51 |
<akosiaris@cumin1001> |
conftool action : set/pooled=yes; selector: dc=eqiad,service=eventstreams,name=kubernetes1001.* |
[production] |
11:50 |
<akosiaris@cumin1001> |
conftool action : set/weight=1; selector: dc=eqiad,service=eventstreams,name=kube.* |
[production] |
10:21 |
<hnowlan@deploy1001> |
helmfile [STAGING] Ran 'sync' command on namespace 'changeprop' for release 'staging' . |
[production] |
10:16 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.presto.roll-restart-workers (exit_code=0) |
[production] |
10:10 |
<moritzm> |
rolling restart of Exim on mx* to pick up libidn security updates |
[production] |
10:06 |
<elukey@cumin1001> |
START - Cookbook sre.presto.roll-restart-workers |
[production] |
10:06 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Fully repool db1074', diff saved to https://phabricator.wikimedia.org/P10648 and previous config saved to /var/cache/conftool/dbconfig/20200306-100628-marostegui.json |
[production] |
10:03 |
<moritzm> |
rolling restart of labweb* to pick up libidn security updates |
[production] |