2024-11-27
ยง
|
13:20 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.reimage for host wdqs1025.eqiad.wmnet with OS bullseye |
[production] |
13:20 |
<jclark@cumin1002> |
START - Cookbook sre.hosts.reimage for host wdqs1027.eqiad.wmnet with OS bullseye |
[production] |
13:16 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.changedisk for changing disk type of durum7002.magru.wmnet to drbd |
[production] |
13:15 |
<kartik@deploy2002> |
helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . |
[production] |
13:15 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of doh7002.wikimedia.org to drbd |
[production] |
13:05 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.changedisk for changing disk type of doh7002.wikimedia.org to drbd |
[production] |
13:05 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of prometheus7001.magru.wmnet to drbd |
[production] |
12:56 |
<jiji@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on kafka-main[1002,1007].eqiad.wmnet with reason: Hardware refresh |
[production] |
12:56 |
<jiji@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on kafka-main[1002,1007].eqiad.wmnet with reason: Hardware refresh |
[production] |
12:50 |
<moritzm> |
installing ghostscript security updates |
[production] |
12:39 |
<kartik@deploy2002> |
helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . |
[production] |
12:38 |
<effie> |
start replacing kafka-main1002 with kafka-main1007 - T363214 |
[production] |
12:24 |
<mvolz@deploy2002> |
helmfile [staging] DONE helmfile.d/services/citoid: apply |
[production] |
12:24 |
<mvolz@deploy2002> |
helmfile [staging] START helmfile.d/services/citoid: apply |
[production] |
12:24 |
<kart_> |
Updated cxserver to 2024-11-20-121713-production (T377966, T357950) |
[production] |
12:22 |
<kartik@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |
12:22 |
<kartik@deploy2002> |
helmfile [eqiad] START helmfile.d/services/cxserver: apply |
[production] |
12:20 |
<kartik@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/cxserver: apply |
[production] |
12:20 |
<kartik@deploy2002> |
helmfile [codfw] START helmfile.d/services/cxserver: apply |
[production] |
12:18 |
<moritzm> |
installing python-cryptography security updates |
[production] |
12:14 |
<kartik@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cxserver: apply |
[production] |
12:13 |
<kartik@deploy2002> |
helmfile [staging] START helmfile.d/services/cxserver: apply |
[production] |
12:12 |
<moritzm> |
installing openssl security updates |
[production] |
12:08 |
<mvolz@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/zotero: apply |
[production] |
12:07 |
<mvolz@deploy2002> |
helmfile [eqiad] START helmfile.d/services/zotero: apply |
[production] |
12:06 |
<mvolz@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/zotero: apply |
[production] |
12:06 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.changedisk for changing disk type of prometheus7001.magru.wmnet to drbd |
[production] |
12:06 |
<mvolz@deploy2002> |
helmfile [codfw] START helmfile.d/services/zotero: apply |
[production] |
12:05 |
<mvolz@deploy2002> |
helmfile [staging] DONE helmfile.d/services/zotero: apply |
[production] |
12:05 |
<mvolz@deploy2002> |
helmfile [staging] START helmfile.d/services/zotero: apply |
[production] |
12:05 |
<kartik@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cxserver: apply |
[production] |
12:05 |
<kartik@deploy2002> |
helmfile [staging] START helmfile.d/services/cxserver: apply |
[production] |
12:03 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on ganeti2042.codfw.wmnet with reason: broken CPU |
[production] |
12:03 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on ganeti2042.codfw.wmnet with reason: broken CPU |
[production] |
11:53 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of bast7001.wikimedia.org to drbd |
[production] |
11:45 |
<ladsgroup@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1098484|Bump ratio of new parsercache key spec to 3 (T373037)]] (duration: 12m 51s) |
[production] |
11:38 |
<ladsgroup@deploy2002> |
ladsgroup: Continuing with sync |
[production] |
11:38 |
<ladsgroup@deploy2002> |
ladsgroup: Backport for [[gerrit:1098484|Bump ratio of new parsercache key spec to 3 (T373037)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
11:34 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.changedisk for changing disk type of bast7001.wikimedia.org to drbd |
[production] |
11:32 |
<ladsgroup@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1098484|Bump ratio of new parsercache key spec to 3 (T373037)]] |
[production] |
11:29 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of ncredir7002.magru.wmnet to drbd |
[production] |
11:21 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on dns7002.wikimedia.org with reason: T376737 |
[production] |
11:21 |
<fabfur@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on dns7002.wikimedia.org with reason: T376737 |
[production] |
11:21 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on dns7001.wikimedia.org with reason: T376737 |
[production] |
11:20 |
<fabfur@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on dns7001.wikimedia.org with reason: T376737 |
[production] |
11:19 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on lvs[7001-7003].magru.wmnet with reason: T376737 |
[production] |
11:19 |
<fabfur@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on lvs[7001-7003].magru.wmnet with reason: T376737 |
[production] |
11:19 |
<fabfur@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on 16 hosts with reason: T376737 |
[production] |
11:19 |
<fabfur@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on 16 hosts with reason: T376737 |
[production] |
11:18 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.changedisk for changing disk type of ncredir7002.magru.wmnet to drbd |
[production] |