2023-01-03
§
|
10:39 |
<btullis> |
fail over hive services to an-coord1002 with change to the DNS CNAME for analytics-hive.eqiad.wmnet |
[analytics] |
10:37 |
<cgoubert@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on parse1002.eqiad.wmnet with reason: CPU1 machine check error |
[production] |
10:36 |
<cgoubert@cumin1001> |
START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on parse1002.eqiad.wmnet with reason: CPU1 machine check error |
[production] |
10:36 |
<jelto@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host gerrit1001.wikimedia.org |
[production] |
10:31 |
<jelto@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host gerrit1001.wikimedia.org |
[production] |
10:25 |
<jelto@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host gerrit2002.wikimedia.org |
[production] |
10:20 |
<btullis> |
restart hive-server2 and hive-metastore services on an-coord1002 prior to failover |
[analytics] |
10:18 |
<jelto@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host gerrit2002.wikimedia.org |
[production] |
09:27 |
<vgutierrez> |
restarting varnish on cp5032 to clear VarnishChildRestarted alert - T325797 |
[production] |
08:19 |
<kartik@deploy1002> |
Finished scap: Backport for [[gerrit:869347|Content Translation: Move ttwiki out of Beta (T319177)]] (duration: 16m 09s) |
[production] |
08:16 |
<jmm@puppetmaster1001> |
conftool action : set/pooled=inactive; selector: name=parse1002.eqiad.wmnet |
[production] |
08:12 |
<moritzm> |
installing Linux 4.19.269 on Buster hosts |
[production] |
08:12 |
<phedenskog@deploy1002> |
Finished deploy [performance/navtiming@4f8c010]: (no justification provided) (duration: 00m 08s) |
[production] |
08:12 |
<phedenskog@deploy1002> |
Started deploy [performance/navtiming@4f8c010]: (no justification provided) |
[production] |
08:05 |
<kartik@deploy1002> |
kartik and kartik: Backport for [[gerrit:869347|Content Translation: Move ttwiki out of Beta (T319177)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet |
[production] |
08:03 |
<kartik@deploy1002> |
Started scap: Backport for [[gerrit:869347|Content Translation: Move ttwiki out of Beta (T319177)]] |
[production] |
07:52 |
<hashar> |
Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/874439/ Make Phonos depend on TimedMediaHandler # T322368 |
[releng] |
04:58 |
<mwpresync@deploy1002> |
Finished scap: testwikis wikis to 1.40.0-wmf.17 refs T325580 (duration: 55m 31s) |
[production] |
04:02 |
<mwpresync@deploy1002> |
Started scap: testwikis wikis to 1.40.0-wmf.17 refs T325580 |
[production] |
00:06 |
<wm-bot> |
<anticomposite> ./stewardbots/StewardBot/manage.sh restart # RC not working |
[tools.stewardbots] |
2022-12-29
§
|
23:26 |
<ryankemper@cumin2002> |
END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) |
[production] |
23:25 |
<ryankemper@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) |
[production] |
23:24 |
<ryankemper@cumin2002> |
START - Cookbook sre.wdqs.data-reload |
[production] |
23:22 |
<ryankemper@cumin1001> |
START - Cookbook sre.wdqs.data-reload |
[production] |
23:07 |
<legoktm> |
Marked for deletion in 40 days |
[tools.scfc-test-can-be-deleted-anytime] |
23:02 |
<TheresNoTime> |
`samtar@coibot:/home/billinghurst$ sudo rm syslog.script` for T326014 |
[linkwatcher] |
09:19 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on an-worker1084.eqiad.wmnet with reason: Avoid IRC spam |
[production] |
09:19 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 14 days, 0:00:00 on an-worker1084.eqiad.wmnet with reason: Avoid IRC spam |
[production] |