2023-04-28
§
|
08:29 |
<klausman@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-cache2003.codfw.wmnet with OS buster |
[production] |
08:27 |
<dcaro> |
rebooting tools-sgeweblight-10-28 (T335336) |
[tools] |
08:23 |
<jmm@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
08:14 |
<klausman@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-cache2003.codfw.wmnet with reason: host reimage |
[production] |
08:11 |
<klausman@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ml-cache2003.codfw.wmnet with reason: host reimage |
[production] |
07:57 |
<klausman@cumin2002> |
START - Cookbook sre.hosts.reimage for host ml-cache2003.codfw.wmnet with OS buster |
[production] |
07:55 |
<klausman@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-cache2002.codfw.wmnet with OS buster |
[production] |
07:47 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest1002.eqiad.wmnet with reason: host reimage |
[production] |
07:44 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on sretest1002.eqiad.wmnet with reason: host reimage |
[production] |
07:41 |
<klausman@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-cache2002.codfw.wmnet with reason: host reimage |
[production] |
07:37 |
<klausman@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ml-cache2002.codfw.wmnet with reason: host reimage |
[production] |
07:30 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm |
[production] |
07:23 |
<klausman@cumin2002> |
START - Cookbook sre.hosts.reimage for host ml-cache2002.codfw.wmnet with OS buster |
[production] |
07:22 |
<klausman@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-cache2001.codfw.wmnet with OS buster |
[production] |
07:20 |
<dcaro> |
rebooting tools-sgegrid-shadow due to stale nfs mount |
[tools] |
07:04 |
<klausman@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-cache2001.codfw.wmnet with reason: host reimage |
[production] |
07:00 |
<klausman@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ml-cache2001.codfw.wmnet with reason: host reimage |
[production] |
06:46 |
<klausman@cumin2002> |
START - Cookbook sre.hosts.reimage for host ml-cache2001.codfw.wmnet with OS buster |
[production] |
05:57 |
<XioNoX> |
push pfw policies - T335554 |
[production] |
05:30 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.network.debug (exit_code=0) for Netbox circuit ID 112 |
[production] |
05:29 |
<ayounsi@cumin1001> |
START - Cookbook sre.network.debug for Netbox circuit ID 112 |
[production] |
05:28 |
<ayounsi@cumin1001> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 393731 |
[production] |
05:27 |
<ayounsi@cumin1001> |
START - Cookbook sre.network.peering with action 'configure' for AS: 393731 |
[production] |
04:08 |
<eileen> |
config revision changed from b33fa934 to 2eef4039 |
[production] |
03:16 |
<ejegg> |
SmashPig upgraded from db9fa965 to a9fa7a2c |
[production] |
03:08 |
<ejegg> |
payments-wiki upgraded from 91582d93 to 61951572 |
[production] |
03:05 |
<eileen> |
config revision changed from 98f2afbb to b33fa934 |
[production] |
02:55 |
<eileen> |
civicrm upgraded from b4a05476 to e7904ea6 |
[production] |
02:13 |
<eileen> |
civicrm upgraded from 601d223e to b4a05476 |
[production] |
00:09 |
<bd808> |
`kubectl uncordon tools-k8s-worker-67` (T335543) |
[tools] |
00:07 |
<bd808> |
Hard reboot tools-k8s-worker-67.tools.eqiad1.wikimedia.cloud via horizon (T335543) |
[tools] |
00:04 |
<bd808> |
Rebooting tools-k8s-worker-67.tools.eqiad1.wikimedia.cloud (T335543) |
[tools] |
2023-04-27
§
|
23:59 |
<bd808> |
`kubectl drain --ignore-daemonsets --delete-emptydir-data --force tools-k8s-worker-67` (T335543) |
[tools] |
23:16 |
<wm-bot> |
<root> Hard stop && start cycle to reset Deployment and all dependent objects (T335520) |
[tools.glamtools] |
22:17 |
<zabe@deploy1002> |
Finished scap: T334295 (duration: 06m 58s) |
[production] |
22:10 |
<zabe@deploy1002> |
Started scap: T334295 |
[production] |
20:50 |
<bd808> |
Started process to rebuild all buster and bullseye based container images again. Prior problem seems to have been stale images in local cache on the build server. |
[tools] |
20:42 |
<bd808> |
Container image rebuild failed with GPG errors in buster-sssd base image. Will investigate and attempt to restart once resolved in a local dev environment. |
[tools] |
20:33 |
<bd808> |
Started process to rebuild all buster and bullseye based container images per https://wikitech.wikimedia.org/wiki/Portal:Toolforge/Admin/Kubernetes#Building_toolforge_specific_images |
[tools] |
20:29 |
<TheresNoTime> |
close UTC late backport window |
[production] |
20:27 |
<samtar@deploy1002> |
Finished scap: Backport for [[gerrit:912884|[cawikisource] Add a wordmark (Vector 2022) (T331823)]], [[gerrit:912888|[cawiktionary] Add a wordmark (Vector 2022) (T331823)]] (duration: 07m 19s) |
[production] |
20:21 |
<samtar@deploy1002> |
superpes and samtar: Backport for [[gerrit:912884|[cawikisource] Add a wordmark (Vector 2022) (T331823)]], [[gerrit:912888|[cawiktionary] Add a wordmark (Vector 2022) (T331823)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet |
[production] |
20:20 |
<samtar@deploy1002> |
Started scap: Backport for [[gerrit:912884|[cawikisource] Add a wordmark (Vector 2022) (T331823)]], [[gerrit:912888|[cawiktionary] Add a wordmark (Vector 2022) (T331823)]] |
[production] |
20:20 |
<samtar@deploy1002> |
Finished scap: Backport for [[gerrit:912874|[cawikibooks] Add a wordmark (Vector 2022) (T331823)]], [[gerrit:912877|[cawikinews] Add a wordmark (Vector 2022) (T331823)]], [[gerrit:912880|[cawikiquote] Add a wordmark (Vector 2022) (T331823)]] (duration: 09m 43s) |
[production] |
20:11 |
<samtar@deploy1002> |
samtar and superpes: Backport for [[gerrit:912874|[cawikibooks] Add a wordmark (Vector 2022) (T331823)]], [[gerrit:912877|[cawikinews] Add a wordmark (Vector 2022) (T331823)]], [[gerrit:912880|[cawikiquote] Add a wordmark (Vector 2022) (T331823)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet |
[production] |
20:10 |
<samtar@deploy1002> |
Started scap: Backport for [[gerrit:912874|[cawikibooks] Add a wordmark (Vector 2022) (T331823)]], [[gerrit:912877|[cawikinews] Add a wordmark (Vector 2022) (T331823)]], [[gerrit:912880|[cawikiquote] Add a wordmark (Vector 2022) (T331823)]] |
[production] |
19:27 |
<xcollazo@deploy1002> |
Finished deploy [airflow-dags/platform_eng@bc37201]: (no justification provided) (duration: 00m 10s) |
[production] |
19:27 |
<ejegg> |
payments-wiki upgraded from 7fa25437 to 91582d93 |
[production] |
19:27 |
<xcollazo@deploy1002> |
Started deploy [airflow-dags/platform_eng@bc37201]: (no justification provided) |
[production] |
19:16 |
<xcollazo@deploy1002> |
Finished deploy [airflow-dags/platform_eng@f162f4d]: Deploying T333001 on platform_eng Airflow instance. (duration: 12m 01s) |
[production] |