2021-04-15
ยง
|
22:33 |
<ryankemper> |
T280108 T267927 Data transfers completed successfully; small issue with new `wait_for_updater` logic is preventing termination so I ctrl+c'd manually |
[production] |
22:32 |
<ryankemper@cumin2001> |
END (ERROR) - Cookbook sre.wdqs.data-transfer (exit_code=97) |
[production] |
21:13 |
<razzi> |
rebalance kafka partitions for webrequest_text partition 23 |
[analytics] |
20:03 |
<herron> |
migrating kafka-logging broker logstash1012 to kafka-logging1003 T279342 |
[production] |
19:56 |
<Trey314159> |
reindexing wikidata on cloudelastic finished/failed (T274200) |
[production] |
19:43 |
<Trey314159> |
reindexing wikidata on cloudelastic (T274200) |
[production] |
19:42 |
<Trey314159> |
reindexing commons and wikidata on elastic@eqiad (T274200) |
[production] |
19:28 |
<cmjohnson@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
19:26 |
<wm-bot> |
<lucaswerkmeister> deployed 051e3789a2 (l10n updates) |
[tools.lexeme-forms] |
19:24 |
<cmjohnson@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
19:14 |
<jhuneidi@deploy1002> |
rebuilt and synchronized wikiversions files: all wikis to 1.37.0-wmf.1 refs T278345 |
[production] |
18:49 |
<andrew@deploy1002> |
Finished deploy [horizon/deploy@ec37c43]: test deploy of trove dashboard to codfw1dev (duration: 01m 58s) |
[production] |
18:47 |
<andrew@deploy1002> |
Started deploy [horizon/deploy@ec37c43]: test deploy of trove dashboard to codfw1dev |
[production] |
18:39 |
<jdrewniak@deploy1002> |
Synchronized private/readme.php: Config: [[gerrit:679614|Add $wgWMEVectorPrefDiffSalt to private/readme (T261842)]] (duration: 01m 08s) |
[production] |
18:32 |
<jdrewniak@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:679613|Add mediawiki.pref_diff stream to wgEventLoggingStreamNames/wgEventStreams (T261842)]] (duration: 01m 18s) |
[production] |
18:26 |
<paladox> |
gerrit: created openstack/horizon/trove-dashboard per andrewbogott (with parent set as openstack/horizon/horizon) |
[releng] |
17:45 |
<bstorm> |
cleared error state from tools-sgeexec-0920.tools.eqiad.wmflabs for a failed job |
[tools] |
17:18 |
<cmjohnson@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
17:09 |
<cmjohnson@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
16:47 |
<Majavah> |
manually rebase deployment-puppetmaster04 due to local hacks having conflicts |
[releng] |
16:42 |
<crusnov@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
16:34 |
<crusnov@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
16:27 |
<ariel@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase1027.eqiad.wmnet |
[production] |
16:21 |
<ryankemper> |
T280108 T267927 Current wdqs transfers in progress: `wqds1004`->`wdqs1005`, `wdqs2008`->`wdqs2001` |
[production] |
16:21 |
<ariel@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase1027.eqiad.wmnet |
[production] |
16:17 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
16:17 |
<ariel@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase1026.eqiad.wmnet |
[production] |
16:17 |
<ryankemper@cumin2001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
16:17 |
<ryankemper> |
T280108 T267927 Merged https://gerrit.wikimedia.org/r/c/operations/cookbooks/+/679702 and ran puppet-agent on `cumin2001` before next round of wdqs `data-transfer`s |
[production] |
16:12 |
<ariel@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase1026.eqiad.wmnet |
[production] |
16:08 |
<ariel@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase1025.eqiad.wmnet |
[production] |
16:02 |
<ariel@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host restbase1025.eqiad.wmnet |
[production] |
15:26 |
<otto@deploy1002> |
Finished deploy [analytics/refinery@497f6a5] (hadoop-test): (no justification provided) (duration: 04m 44s) |
[production] |
15:21 |
<otto@deploy1002> |
Started deploy [analytics/refinery@497f6a5] (hadoop-test): (no justification provided) |
[production] |
15:09 |
<elukey@deploy1002> |
Finished deploy [analytics/refinery@497f6a5]: Regular analytics weekly train (duration: 13m 12s) |
[production] |
15:09 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dns1002.wikimedia.org |
[production] |
15:03 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host dns1002.wikimedia.org |
[production] |
14:59 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dns1001.wikimedia.org |
[production] |
14:56 |
<elukey> |
deploy refinery via scap - weekly train |
[analytics] |
14:56 |
<elukey@deploy1002> |
Started deploy [analytics/refinery@497f6a5]: Regular analytics weekly train |
[production] |
14:53 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host dns1001.wikimedia.org |
[production] |
14:50 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dns5002.wikimedia.org |
[production] |
14:47 |
<jayme> |
imported etcd-mirror_0.0.5-1 to buster-wikimedia |
[production] |
14:43 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host dns5002.wikimedia.org |
[production] |
14:41 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dns5001.wikimedia.org |
[production] |
14:37 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wtp1048.eqiad.wmnet with reason: REIMAGE |
[production] |
14:35 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wtp1047.eqiad.wmnet with reason: REIMAGE |
[production] |
14:35 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wtp1048.eqiad.wmnet with reason: REIMAGE |
[production] |
14:34 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host dns5001.wikimedia.org |
[production] |
14:33 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wtp1046.eqiad.wmnet with reason: REIMAGE |
[production] |