2020-03-06
§
|
23:50 |
<mutante> |
install1003/2003 - starting DHCP servers and letting puppet stop them again to clear systemd state |
[production] |
23:04 |
<mutante> |
signing puppet certs for install1003/install2003, initial puppet runs |
[production] |
22:42 |
<James_F> |
Raised executor count on contint1001 from 3 to 12 for T247109 |
[releng] |
22:33 |
<reedy@deploy1001> |
Synchronized wmf-config/interwiki-labs.php: T247091 (duration: 00m 57s) |
[production] |
22:17 |
<jeh> |
delete shinken-puppetmaster-01 T241719 |
[shinken] |
22:15 |
<jeh> |
migrate existing VMs to new shinken-puppetmaster-02 (local commits restored from shinken-puppetmaster-01) T241719 |
[shinken] |
22:09 |
<ebernhardson@deploy1001> |
Finished deploy [wikimedia/discovery/analytics@18f13e4]: update to pyhton3.7, ship articletopic propagation (duration: 00m 36s) |
[production] |
22:08 |
<ebernhardson@deploy1001> |
Started deploy [wikimedia/discovery/analytics@18f13e4]: update to pyhton3.7, ship articletopic propagation |
[production] |
21:09 |
<jeh> |
create new puppetmaster shinken-puppetmaster-02 T241719 |
[shinken] |
20:51 |
<jeh> |
add jhedden to CloudVPS project |
[shinken] |
20:39 |
<jeh> |
delete old puppetmaster cloudstore-puppetmaster-01 T241719 |
[cloudstore] |
20:23 |
<ebernhardson> |
post-deploy restart mjolnir bulk and msearch daemons across eqiad and codfw |
[production] |
20:07 |
<ebernhardson@deploy1001> |
Finished deploy [search/mjolnir/deploy@dda3d28]: Re-deploy python3.7 upgrade (duration: 05m 14s) |
[production] |
20:02 |
<ebernhardson@deploy1001> |
Started deploy [search/mjolnir/deploy@dda3d28]: Re-deploy python3.7 upgrade |
[production] |
19:57 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) |
[production] |
19:56 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) |
[production] |
19:48 |
<mutante> |
re-creating install1003 and install2003 with same specs as before but public IP (T244390) |
[production] |
19:47 |
<dzahn@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
19:46 |
<dzahn@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
19:46 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) |
[production] |
19:46 |
<dzahn@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
19:40 |
<marxarelli> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/577653 |
[releng] |
19:33 |
<marxarelli> |
creating 2 jenkins jobs for https://gerrit.wikimedia.org/r/c/integration/config/+/577653 |
[releng] |
19:32 |
<zhuyifei1999_> |
changed to analytics replica for database queries and restarted celery workers T246970 |
[quarry] |
19:24 |
<jeh> |
create new puppetmaster cloudstore-puppetmaster-02 T241719 |
[cloudstore] |
19:05 |
<James_F> |
Manually pushing trigger-helloworldoid-pipeline-test over to contint1001 to test T247109 |
[releng] |
18:56 |
<marxarelli> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/577626 |
[releng] |
18:54 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
18:53 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
18:52 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) |
[production] |
18:52 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
18:52 |
<marxarelli> |
creating 2 jobs for https://gerrit.wikimedia.org/r/c/integration/config/+/577626 |
[releng] |
18:46 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
18:44 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
18:09 |
<jeh> |
restart designate_floating_ip_ptr_records_updater.service on cloudcontrol1003 |
[openstack] |
18:07 |
<mutante> |
sudo -i cumin -b 15 'mw23[25-34].codfw.wmnet' 'sudo -u dzahn scap pull' |
[production] |