2021-02-06
§
|
08:59 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
08:58 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
08:52 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hadoop.change-distro-from-cdh-clients (exit_code=0) for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
08:52 |
<elukey@cumin1001> |
START - Cookbook sre.hadoop.change-distro-from-cdh-clients for Hadoop test cluster: Change Hadoop distribution - elukey@cumin1001 |
[production] |
03:40 |
<ryankemper> |
Deleted dump taking up diskspace on `wdqs1009`, disk space warning will resolve now |
[production] |
01:30 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1319.eqiad.wmnet |
[production] |
01:29 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1313.eqiad.wmnet |
[production] |
01:25 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1319.eqiad.wmnet |
[production] |
01:25 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1313.eqiad.wmnet |
[production] |
01:00 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2265.codfw.wmnet |
[production] |
00:57 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1366.eqiad.wmnet |
[production] |
00:46 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1366.eqiad.wmnet |
[production] |
00:46 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw2265.codfw.wmnet |
[production] |
00:30 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1313.eqiad.wmnet with reason: REIMAGE |
[production] |
00:28 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1313.eqiad.wmnet with reason: REIMAGE |
[production] |
00:25 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1319.eqiad.wmnet with reason: REIMAGE |
[production] |
00:23 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1319.eqiad.wmnet with reason: REIMAGE |
[production] |
00:19 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2265.codfw.wmnet with reason: REIMAGE |
[production] |
00:17 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw2265.codfw.wmnet with reason: REIMAGE |
[production] |
00:15 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1366.eqiad.wmnet with reason: REIMAGE |
[production] |
00:13 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1366.eqiad.wmnet with reason: REIMAGE |
[production] |
2021-02-05
§
|
23:37 |
<legoktm@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1285.eqiad.wmnet |
[production] |
23:35 |
<ryankemper> |
T267927 Re-downloading latest dumps (main database, lexeme) in tmux session `downloads_dumps` on `ryankemper@wdqs1009.eqiad.wmnet` |
[production] |
23:15 |
<legoktm@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1285.eqiad.wmnet |
[production] |
22:56 |
<ryankemper@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) |
[production] |
22:56 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-reload |
[production] |
22:50 |
<ryankemper@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) |
[production] |
22:50 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-reload |
[production] |
22:46 |
<ryankemper@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) |
[production] |
22:46 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-reload |
[production] |
22:42 |
<ryankemper> |
T267927 `sudo cookbook sre.wdqs.data-reload wdqs1009.eqiad.wmnet --reuse-downloaded-dump --reload-data wikidata --skolemize --reason 'T267927: Reload wikidata jnl from fresh dumps' --task-id T267927` failing with `ERROR org.wikidata.query.rdf.tool.Munge - Fatal error munging RDF` |
[production] |
22:41 |
<ryankemper@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) |
[production] |
22:41 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-reload |
[production] |
22:38 |
<ryankemper@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) |
[production] |
22:38 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-reload |
[production] |
22:37 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1269.eqiad.wmnet |
[production] |
22:32 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1269.eqiad.wmnet |
[production] |
22:19 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1306.eqiad.wmnet |
[production] |
22:16 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1306.eqiad.wmnet |
[production] |
22:03 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1285.eqiad.wmnet with reason: REIMAGE |
[production] |
22:01 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1285.eqiad.wmnet with reason: REIMAGE |
[production] |
21:52 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1393.eqiad.wmnet |
[production] |
21:51 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1393.eqiad.wmnet |
[production] |
21:49 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw1392.eqiad.wmnet |
[production] |
21:49 |
<dzahn@cumin1001> |
conftool action : set/pooled=no; selector: name=mw1392.eqiad.wmnet |
[production] |
21:48 |
<dzahn@cumin1001> |
conftool action : set/pooled=yes; selector: name=mw2266.codfw.wmnet |
[production] |
21:41 |
<dzahn@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1269.eqiad.wmnet with reason: REIMAGE |
[production] |
21:39 |
<dzahn@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mw1269.eqiad.wmnet with reason: REIMAGE |
[production] |