2019-09-25
ยง
|
14:35 |
<filippo@cumin1001> |
END (FAIL) - Cookbook sre.hosts.decommission (exit_code=True) |
[production] |
14:34 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
14:29 |
<moritzm> |
restarting apache on grafana1001 to pick up Expat security update |
[production] |
14:14 |
<moritzm> |
restarting apache on various services to pick up Expat security update (releases, netmon, miscweb, graphite, planet,puppetboard) |
[production] |
14:02 |
<marostegui> |
Deploy schema change on db2086:3318 |
[production] |
14:00 |
<effie> |
Rolling restart thumbor for expat updat |
[production] |
13:55 |
<moritzm> |
rolling restart of apache on webperf* to pick up Expat security update |
[production] |
13:53 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Fully repool db1075 after BBU replacement', diff saved to https://phabricator.wikimedia.org/P9183 and previous config saved to /var/cache/conftool/dbconfig/20190925-135317-marostegui.json |
[production] |
13:52 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=False) |
[production] |
13:51 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
13:51 |
<filippo@cumin1001> |
END (ERROR) - Cookbook sre.hosts.decommission (exit_code=97) |
[production] |
13:51 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
13:45 |
<_joe_> |
restarting trafficserver on cp1075 to pick up the change |
[production] |
13:41 |
<gilles@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: T230817 Remove origin trials config (duration: 01m 05s) |
[production] |
13:31 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Increase weight for db1075 after BBU replacement', diff saved to https://phabricator.wikimedia.org/P9182 and previous config saved to /var/cache/conftool/dbconfig/20190925-133146-marostegui.json |
[production] |
13:31 |
<moritzm> |
installing remaining expat security updates |
[production] |
13:21 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Increase weight for db1075 after BBU replacement', diff saved to https://phabricator.wikimedia.org/P9181 and previous config saved to /var/cache/conftool/dbconfig/20190925-132147-marostegui.json |
[production] |
13:11 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Increase weight for db1075 after BBU replacement', diff saved to https://phabricator.wikimedia.org/P9180 and previous config saved to /var/cache/conftool/dbconfig/20190925-131149-marostegui.json |
[production] |
13:06 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Slowly repool db1075 after replacing its BBU', diff saved to https://phabricator.wikimedia.org/P9179 and previous config saved to /var/cache/conftool/dbconfig/20190925-130613-marostegui.json |
[production] |
12:56 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db2085:3311 T233625', diff saved to https://phabricator.wikimedia.org/P9178 and previous config saved to /var/cache/conftool/dbconfig/20190925-125601-marostegui.json |
[production] |
12:51 |
<marostegui@cumin1001> |
dbctl commit (dc=all): ' Depool for schema change on the logging table: db2088:3312 db2084:3315 db2087:3316 db2086:3317 T233625', diff saved to https://phabricator.wikimedia.org/P9177 and previous config saved to /var/cache/conftool/dbconfig/20190925-125140-marostegui.json |
[production] |
12:47 |
<akosiaris@> |
helmfile [STAGING] Ran 'sync' command on namespace 'kube-system' for release 'calico-policy-controller' . |
[production] |
12:47 |
<akosiaris@> |
helmfile [CODFW] Ran 'sync' command on namespace 'kube-system' for release 'calico-policy-controller' . |
[production] |
12:46 |
<akosiaris@> |
helmfile [EQIAD] Ran 'sync' command on namespace 'kube-system' for release 'calico-policy-controller' . |
[production] |
12:45 |
<akosiaris@> |
helmfile [EQIAD] Ran 'apply' command on namespace 'kube-system' for release 'calico-policy-controller' . |
[production] |
12:45 |
<akosiaris@> |
helmfile [CODFW] Ran 'apply' command on namespace 'kube-system' for release 'calico-policy-controller' . |
[production] |
12:44 |
<akosiaris@> |
helmfile [STAGING] Ran 'apply' command on namespace 'kube-system' for release 'calico-policy-controller' . |
[production] |
12:44 |
<marostegui> |
Repool labsdb1011 T233766 |
[production] |
12:41 |
<marostegui> |
Shutdown db1075 for onsite maintenance T233534 |
[production] |
12:37 |
<marostegui> |
Stop MySQL on db1075 for BBU replacement T233534 |
[production] |
12:37 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1075 for BBU replacement T233534', diff saved to https://phabricator.wikimedia.org/P9176 and previous config saved to /var/cache/conftool/dbconfig/20190925-123736-marostegui.json |
[production] |
12:34 |
<onimisionipe> |
depool wdqs1005 to allow it catch up on lag |
[production] |
12:32 |
<@> |
helmfile [STAGING] Ran 'apply' command on namespace 'restrouter' for release 'staging' . |
[production] |
12:29 |
<@> |
helmfile [EQIAD] Ran 'apply' command on namespace 'restrouter' for release 'production' . |
[production] |
12:28 |
<@> |
helmfile [CODFW] Ran 'sync' command on namespace 'restrouter' for release 'production' . |
[production] |
12:18 |
<mholloway-shell@deploy1001> |
Finished deploy [mobileapps/deploy@241b284]: Performance tweaks: domUtil + addSectionEditButtons (T229286) (duration: 05m 17s) |
[production] |
12:13 |
<mholloway-shell@deploy1001> |
Started deploy [mobileapps/deploy@241b284]: Performance tweaks: domUtil + addSectionEditButtons (T229286) |
[production] |
12:05 |
<akosiaris> |
depool kubernetes1001 and disable puppet on it for rsyslog mmkubernetes testing |
[production] |
12:05 |
<akosiaris@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=kubernetes1001.* |
[production] |
11:57 |
<vgutierrez> |
switch cp1078 from nginx to ats-tls - T231433 |
[production] |
11:37 |
<vgutierrez> |
switch cp2005 from nginx to ats-tls - T231433 |
[production] |
11:29 |
<onimisionipe> |
restarted wdqs-blazegraph on wdqs1005 |
[production] |
11:15 |
<onimisionipe> |
repooled wdqs1004 to reduce load on the wdqs public cluster |
[production] |
11:15 |
<Urbanecm> |
EU SWAT done |
[production] |
11:13 |
<vgutierrez> |
switch cp3035 from nginx to ats-tls - T231433 |
[production] |
11:07 |
<urbanecm@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: 127485c: Fully close bgwikinews (T233322) (duration: 01m 06s) |
[production] |
10:48 |
<vgutierrez> |
Switch from nginx to ats-tls on cp4022 - T231433 |
[production] |
10:46 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
10:46 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:27 |
<twentyafterfour@deploy1001> |
Finished deploy [releng/phatality@8f05ba9]: (no justification provided) (duration: 00m 16s) |
[production] |