2020-03-09
§
|
07:31 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
07:29 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
07:27 |
<elukey> |
deploy jupyterhub on notebook100[3,4] (manual venv re-creation) to allow the use of the user.slice - T247055 |
[analytics] |
07:26 |
<elukey> |
upgrade nodejs from 6->10 on stat1* and notebook1* |
[analytics] |
07:13 |
<marostegui> |
Stop MySQL on db2114 to upgrade to buster |
[production] |
07:09 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2114 for reimage to buster - T246604', diff saved to https://phabricator.wikimedia.org/P10654 and previous config saved to /var/cache/conftool/dbconfig/20200309-070937-marostegui.json |
[production] |
05:34 |
<vgutierrez> |
restart ats-tls, ats-be and varnish-fe on cp3053 to clean up daemon restart alerts - T247195 |
[production] |
2020-03-06
§
|
23:50 |
<mutante> |
install1003/2003 - starting DHCP servers and letting puppet stop them again to clear systemd state |
[production] |
23:04 |
<mutante> |
signing puppet certs for install1003/install2003, initial puppet runs |
[production] |
22:42 |
<James_F> |
Raised executor count on contint1001 from 3 to 12 for T247109 |
[releng] |
22:33 |
<reedy@deploy1001> |
Synchronized wmf-config/interwiki-labs.php: T247091 (duration: 00m 57s) |
[production] |
22:17 |
<jeh> |
delete shinken-puppetmaster-01 T241719 |
[shinken] |
22:15 |
<jeh> |
migrate existing VMs to new shinken-puppetmaster-02 (local commits restored from shinken-puppetmaster-01) T241719 |
[shinken] |
22:09 |
<ebernhardson@deploy1001> |
Finished deploy [wikimedia/discovery/analytics@18f13e4]: update to pyhton3.7, ship articletopic propagation (duration: 00m 36s) |
[production] |
22:08 |
<ebernhardson@deploy1001> |
Started deploy [wikimedia/discovery/analytics@18f13e4]: update to pyhton3.7, ship articletopic propagation |
[production] |
21:09 |
<jeh> |
create new puppetmaster shinken-puppetmaster-02 T241719 |
[shinken] |
20:51 |
<jeh> |
add jhedden to CloudVPS project |
[shinken] |
20:39 |
<jeh> |
delete old puppetmaster cloudstore-puppetmaster-01 T241719 |
[cloudstore] |
20:23 |
<ebernhardson> |
post-deploy restart mjolnir bulk and msearch daemons across eqiad and codfw |
[production] |
20:07 |
<ebernhardson@deploy1001> |
Finished deploy [search/mjolnir/deploy@dda3d28]: Re-deploy python3.7 upgrade (duration: 05m 14s) |
[production] |
20:02 |
<ebernhardson@deploy1001> |
Started deploy [search/mjolnir/deploy@dda3d28]: Re-deploy python3.7 upgrade |
[production] |
19:57 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) |
[production] |
19:56 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) |
[production] |
19:48 |
<mutante> |
re-creating install1003 and install2003 with same specs as before but public IP (T244390) |
[production] |
19:47 |
<dzahn@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
19:46 |
<dzahn@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
19:46 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) |
[production] |
19:46 |
<dzahn@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
19:40 |
<marxarelli> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/577653 |
[releng] |
19:33 |
<marxarelli> |
creating 2 jenkins jobs for https://gerrit.wikimedia.org/r/c/integration/config/+/577653 |
[releng] |
19:32 |
<zhuyifei1999_> |
changed to analytics replica for database queries and restarted celery workers T246970 |
[quarry] |
19:24 |
<jeh> |
create new puppetmaster cloudstore-puppetmaster-02 T241719 |
[cloudstore] |
19:05 |
<James_F> |
Manually pushing trigger-helloworldoid-pipeline-test over to contint1001 to test T247109 |
[releng] |
18:56 |
<marxarelli> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/577626 |
[releng] |
18:54 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
18:53 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
18:52 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) |
[production] |
18:52 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
18:52 |
<marxarelli> |
creating 2 jobs for https://gerrit.wikimedia.org/r/c/integration/config/+/577626 |
[releng] |