2024-01-02
§
|
10:22 |
<wm-bot2> |
fran@wmf3169 START - Cookbook wmcs.openstack.cloudvirt.vm_console |
[admin] |
10:13 |
<dhinus> |
hard reboot tools-harbor-1 |
[tools] |
09:24 |
<btullis> |
adding three days' downtime to dbstore1008, prior to switching its role to `mariadb::analytics_replica` for T351921 |
[analytics] |
09:23 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on dbstore1008.eqiad.wmnet with reason: Commissioning new database server |
[production] |
09:23 |
<btullis@cumin1001> |
START - Cookbook sre.hosts.downtime for 3 days, 0:00:00 on dbstore1008.eqiad.wmnet with reason: Commissioning new database server |
[production] |
09:17 |
<pfischer@deploy2002> |
Finished scap: Backport for [[gerrit:987028|configure message_key_fields for update_pipeline]] (duration: 15m 35s) |
[production] |
09:05 |
<pfischer@deploy2002> |
pfischer: Continuing with sync |
[production] |
09:04 |
<pfischer@deploy2002> |
pfischer: Backport for [[gerrit:987028|configure message_key_fields for update_pipeline]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
09:02 |
<moritzm> |
installing nodejs security updates on bookworm |
[production] |
09:02 |
<pfischer@deploy2002> |
Started scap: Backport for [[gerrit:987028|configure message_key_fields for update_pipeline]] |
[production] |
08:33 |
<akosiaris@cumin1001> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host mw2448.mgmt.codfw.wmnet with reboot policy GRACEFUL |
[production] |
08:27 |
<jayme> |
restart prometheus@k8s prometheus@k8s-aux in eqiad - T343529 |
[production] |
08:26 |
<akosiaris@cumin1001> |
START - Cookbook sre.hosts.provision for host mw2448.mgmt.codfw.wmnet with reboot policy GRACEFUL |
[production] |
06:45 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2144.codfw.wmnet with OS bookworm |
[production] |
06:27 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2144.codfw.wmnet with reason: host reimage |
[production] |
06:24 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2144.codfw.wmnet with reason: host reimage |
[production] |
06:06 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.reimage for host db2144.codfw.wmnet with OS bookworm |
[production] |
05:00 |
<mwpresync@deploy2002> |
Finished scap: testwikis wikis to 1.42.0-wmf.12 refs T350088 (duration: 56m 48s) |
[production] |
04:03 |
<mwpresync@deploy2002> |
Started scap: testwikis wikis to 1.42.0-wmf.12 refs T350088 |
[production] |
00:04 |
<wm-bot> |
<jjmc89> disable all plagiabot jobs T354145 |
[tools.eranbot] |
2023-12-29
§
|
22:59 |
<pfischer@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
22:59 |
<pfischer@deploy2002> |
helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
22:57 |
<pfischer@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
21:39 |
<andrewbogott> |
rebooting tools-sgeweblight-10-28.tools.eqiad1.wikimedia.cloud because previous reset didn't get the queue out of error state |
[tools] |
20:33 |
<wm-bot> |
<anticomposite> SULWatcher/manage.sh restart # Not connected. |
[tools.stewardbots] |
19:31 |
<andrewbogott> |
restarting sge_execd on tools-sgeweblight-10-28.tools.eqiad1.wikimedia.cloud in response to error state alert |
[tools] |
08:01 |
<pfischer@deploy2002> |
helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
08:00 |
<pfischer@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
08:00 |
<pfischer@deploy2002> |
helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
07:58 |
<pfischer@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
07:58 |
<pfischer@deploy2002> |
helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
07:58 |
<pfischer@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |