2021-01-21
§
|
06:54 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db1087', diff saved to https://phabricator.wikimedia.org/P13861 and previous config saved to /var/cache/conftool/dbconfig/20210121-065408-marostegui.json |
[production] |
06:49 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1087 and pool db1099:3318 into s8 vslow', diff saved to https://phabricator.wikimedia.org/P13860 and previous config saved to /var/cache/conftool/dbconfig/20210121-064903-marostegui.json |
[production] |
03:54 |
<milimetric@deploy1001> |
deploy aborted: Minor typo fix (duration: 01m 39s) |
[production] |
03:52 |
<milimetric@deploy1001> |
Started deploy [analytics/refinery@57589e7]: Minor typo fix |
[production] |
01:27 |
<ryankemper> |
[WDQS Deploy] Rollback complete, service health of `wdqs1003` is restored. Need to investigate source of 404 (possibly related to some recent changes we made in the `gui` repo) |
[production] |
01:26 |
<ryankemper@deploy1001> |
Finished deploy [wdqs/wdqs@70f9d37]: 0.3.60 (duration: 02m 53s) |
[production] |
01:26 |
<ryankemper> |
[WDQS Deploy] Rollback of canary `wdqs1003` initiated |
[production] |
01:25 |
<ryankemper> |
[WDQS Deploy] Automated tests passing on canary`wdqs1003` but manually visiting `http://localhost:9999` (my tunnel to `wdqs1003`) gives `404 Not Found`from nginx; aborting deploy |
[production] |
01:23 |
<ryankemper@deploy1001> |
Started deploy [wdqs/wdqs@70f9d37]: 0.3.60 |
[production] |
01:22 |
<ryankemper> |
[WDQS Deploy] Tests on canary `wdqs1003` passing before start of deploy, proceeding with deploy of wdqs `0.3.60` to canary |
[production] |
00:44 |
<legoktm> |
legoktm@mwmaint1002:~$ mwscript initSiteStats.php --wiki=trwikivoyage --update |
[production] |
00:19 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2369.codfw.wmnet |
[production] |
00:19 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2367.codfw.wmnet |
[production] |
00:19 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2365.codfw.wmnet |
[production] |
00:19 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2363.codfw.wmnet |
[production] |
00:18 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2369.codfw.wmnet |
[production] |
00:18 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2365.codfw.wmnet |
[production] |
00:18 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2367.codfw.wmnet |
[production] |
00:17 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2363.codfw.wmnet |
[production] |
2021-01-20
§
|
23:51 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2369.codfw.wmnet with reason: REIMAGE |
[production] |
23:51 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2365.codfw.wmnet with reason: REIMAGE |
[production] |
23:49 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2367.codfw.wmnet with reason: REIMAGE |
[production] |
23:47 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2363.codfw.wmnet with reason: REIMAGE |
[production] |
23:47 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2369.codfw.wmnet with reason: REIMAGE |
[production] |
23:47 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2367.codfw.wmnet with reason: REIMAGE |
[production] |
23:46 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2365.codfw.wmnet with reason: REIMAGE |
[production] |
23:45 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2363.codfw.wmnet with reason: REIMAGE |
[production] |
23:30 |
<mutante> |
releases2002 - rebooting VM |
[production] |
23:25 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2361.codfw.wmnet |
[production] |
23:25 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2359.codfw.wmnet |
[production] |
23:25 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2355.codfw.wmnet |
[production] |
23:25 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2357.codfw.wmnet |
[production] |
23:22 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on releases2002.codfw.wmnet with reason: rebooting to add a disk |
[production] |
23:22 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 1:00:00 on releases2002.codfw.wmnet with reason: rebooting to add a disk |
[production] |
23:10 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2357.codfw.wmnet |
[production] |
23:07 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2361.codfw.wmnet |
[production] |
23:07 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2359.codfw.wmnet |
[production] |
23:06 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2355.codfw.wmnet |
[production] |
23:03 |
<legoktm> |
updated docker-registry.discovery.wmnet/wikimedia-buster image |
[production] |
23:01 |
<mutante> |
mw2331, mw2333 - scap pull |
[production] |
22:49 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on mw2359.codfw.wmnet with reason: new install on buster |
[production] |
22:49 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 4:00:00 on mw2359.codfw.wmnet with reason: new install on buster |
[production] |
22:48 |
<dzahn@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw2359.codfw.wmnet with reason: REIMAGE |
[production] |
22:47 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2361.codfw.wmnet with reason: REIMAGE |
[production] |
22:45 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2357.codfw.wmnet with reason: REIMAGE |
[production] |
22:44 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2361.codfw.wmnet with reason: REIMAGE |
[production] |
22:43 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2359.codfw.wmnet with reason: REIMAGE |
[production] |
22:43 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2357.codfw.wmnet with reason: REIMAGE |
[production] |
22:43 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2355.codfw.wmnet with reason: REIMAGE |
[production] |
22:41 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2355.codfw.wmnet with reason: REIMAGE |
[production] |