2021-04-15
ยง
|
22:46 |
<ryankemper> |
T280108 T267927 Manually re-enabled and ran puppet on `wdqs1005` (had closed the tmux pane which terminated the cookbook without letting it do its final cleanup) |
[production] |
22:33 |
<ryankemper> |
T280108 T267927 Data transfers completed successfully; small issue with new `wait_for_updater` logic is preventing termination so I ctrl+c'd manually |
[production] |
22:32 |
<ryankemper@cumin2001> |
END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) |
[production] |
20:03 |
<herron> |
migrating kafka-logging broker logstash1012 to kafka-logging1003 T279342 |
[production] |
19:56 |
<Trey314159> |
reindexing wikidata on cloudelastic finished/failed (T274200) |
[production] |
19:43 |
<Trey314159> |
reindexing wikidata on cloudelastic (T274200) |
[production] |
19:42 |
<Trey314159> |
reindexing commons and wikidata on elastic@eqiad (T274200) |
[production] |
19:28 |
<cmjohnson@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
19:24 |
<cmjohnson@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
19:14 |
<jhuneidi@deploy1002> |
rebuilt and synchronized wikiversions files: all wikis to 1.37.0-wmf.1 refs T278345 |
[production] |
18:49 |
<andrew@deploy1002> |
Finished deploy [horizon/deploy@ec37c43]: test deploy of trove dashboard to codfw1dev (duration: 01m 58s) |
[production] |
18:47 |
<andrew@deploy1002> |
Started deploy [horizon/deploy@ec37c43]: test deploy of trove dashboard to codfw1dev |
[production] |
18:39 |
<jdrewniak@deploy1002> |
Synchronized private/readme.php: Config: [[gerrit:679614|Add $wgWMEVectorPrefDiffSalt to private/readme (T261842)]] (duration: 01m 08s) |
[production] |
18:32 |
<jdrewniak@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:679613|Add mediawiki.pref_diff stream to wgEventLoggingStreamNames/wgEventStreams (T261842)]] (duration: 01m 18s) |
[production] |
17:18 |
<cmjohnson@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
17:09 |
<cmjohnson@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
16:42 |
<crusnov@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
16:34 |
<crusnov@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
16:27 |
<ariel@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase1027.eqiad.wmnet |
[production] |
16:21 |
<ryankemper> |
T280108 T267927 Current wdqs transfers in progress: `wqds1004`->`wdqs1005`, `wdqs2008`->`wdqs2001` |
[production] |
16:21 |
<ariel@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase1027.eqiad.wmnet |
[production] |
16:17 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
16:17 |
<ariel@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase1026.eqiad.wmnet |
[production] |
16:17 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
16:17 |
<ryankemper> |
T280108 T267927 Merged https://gerrit.wikimedia.org/r/c/operations/cookbooks/+/679702 and ran puppet-agent on `cumin2001` before next round of wdqs `data-transfer`s |
[production] |
16:12 |
<ariel@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase1026.eqiad.wmnet |
[production] |
16:08 |
<ariel@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase1025.eqiad.wmnet |
[production] |
16:02 |
<ariel@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase1025.eqiad.wmnet |
[production] |
15:26 |
<otto@deploy1002> |
Finished deploy [analytics/refinery@497f6a5] (hadoop-test): (no justification provided) (duration: 04m 44s) |
[production] |
15:21 |
<otto@deploy1002> |
Started deploy [analytics/refinery@497f6a5] (hadoop-test): (no justification provided) |
[production] |
15:09 |
<elukey@deploy1002> |
Finished deploy [analytics/refinery@497f6a5]: Regular analytics weekly train (duration: 13m 12s) |
[production] |
15:09 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dns1002.wikimedia.org |
[production] |
15:03 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host dns1002.wikimedia.org |
[production] |
14:59 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dns1001.wikimedia.org |
[production] |
14:56 |
<elukey@deploy1002> |
Started deploy [analytics/refinery@497f6a5]: Regular analytics weekly train |
[production] |
14:53 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host dns1001.wikimedia.org |
[production] |
14:50 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dns5002.wikimedia.org |
[production] |
14:47 |
<jayme> |
imported etcd-mirror_0.0.5-1 to buster-wikimedia |
[production] |
14:43 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host dns5002.wikimedia.org |
[production] |
14:41 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dns5001.wikimedia.org |
[production] |
14:37 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wtp1048.eqiad.wmnet with reason: REIMAGE |
[production] |
14:35 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wtp1047.eqiad.wmnet with reason: REIMAGE |
[production] |
14:35 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wtp1048.eqiad.wmnet with reason: REIMAGE |
[production] |
14:34 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host dns5001.wikimedia.org |
[production] |
14:33 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wtp1046.eqiad.wmnet with reason: REIMAGE |
[production] |
14:33 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wtp1047.eqiad.wmnet with reason: REIMAGE |
[production] |
14:32 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dns2002.wikimedia.org |
[production] |
14:31 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wtp1046.eqiad.wmnet with reason: REIMAGE |
[production] |
14:27 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host dns2002.wikimedia.org |
[production] |
14:24 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dns2001.wikimedia.org |
[production] |