2024-11-20
§
|
09:33 |
<fabfur@cumin1002> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_esams |
[production] |
09:28 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-analytics-test: apply |
[production] |
09:27 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-analytics-test: apply |
[production] |
09:27 |
<aklapper@deploy2002> |
aklapper, thiemowmde: Continuing with sync |
[production] |
09:26 |
<aklapper@deploy2002> |
aklapper, thiemowmde: Backport for [[gerrit:1093172|EditionLookup: Update EntityLookup calls (T380304)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
09:21 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of prometheus7001.magru.wmnet to plain |
[production] |
09:20 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.changedisk for changing disk type of prometheus7001.magru.wmnet to plain |
[production] |
09:20 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-analytics-test: apply |
[production] |
09:20 |
<aklapper@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1093172|EditionLookup: Update EntityLookup calls (T380304)]] |
[production] |
09:19 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-analytics-test: apply |
[production] |
09:18 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of doh7002.wikimedia.org to plain |
[production] |
09:15 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.changedisk for changing disk type of doh7002.wikimedia.org to plain |
[production] |
09:13 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of ncredir7002.magru.wmnet to plain |
[production] |
09:13 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.changedisk for changing disk type of ncredir7002.magru.wmnet to plain |
[production] |
08:56 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of durum7002.magru.wmnet to plain |
[production] |
08:51 |
<jayme> |
disabling puppet on all k8s controll planes for rollout of T380142 |
[production] |
08:48 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.changedisk for changing disk type of durum7002.magru.wmnet to plain |
[production] |
08:46 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of bast7001.wikimedia.org to plain |
[production] |
08:44 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.changedisk for changing disk type of bast7001.wikimedia.org to plain |
[production] |
08:35 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti7004.magru.wmnet |
[production] |
08:35 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti7004.magru.wmnet |
[production] |
08:35 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti7004.magru.wmnet |
[production] |
08:34 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti7004.magru.wmnet |
[production] |
08:18 |
<hashar> |
Restarted CI Jenkins to upgrade Leastload plugin and remove the SSH server plugin |
[production] |
2024-11-19
§
|
22:50 |
<ryankemper@deploy2002> |
Started deploy [wdqs/wdqs@9927a5a] (wcqs): Deploy 0.3.150 to WCQS |
[production] |
22:00 |
<urbanecm@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1092341|Enable experimental Parsoid fragment support on labs and test wikis (T374661)]], [[gerrit:1092850|Revert "editcheck: Remove try/catch around transaction squashing" (T333710 T380234)]], [[gerrit:1092851|Revert "editcheck: Remove try/catch around transaction squashing" (T333710 T380234)]] (duration: 20m 39s) |
[production] |
21:53 |
<urbanecm@deploy2002> |
cscott, kemayo, urbanecm: Continuing with sync |
[production] |
21:45 |
<urbanecm@deploy2002> |
cscott, kemayo, urbanecm: Backport for [[gerrit:1092341|Enable experimental Parsoid fragment support on labs and test wikis (T374661)]], [[gerrit:1092850|Revert "editcheck: Remove try/catch around transaction squashing" (T333710 T380234)]], [[gerrit:1092851|Revert "editcheck: Remove try/catch around transaction squashing" (T333710 T380234)]] synced to the testservers (https://wikitech.wikimedia.or |
[production] |
21:39 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host es2041.codfw.wmnet with OS bookworm |
[production] |
21:39 |
<urbanecm@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1092341|Enable experimental Parsoid fragment support on labs and test wikis (T374661)]], [[gerrit:1092850|Revert "editcheck: Remove try/catch around transaction squashing" (T333710 T380234)]], [[gerrit:1092851|Revert "editcheck: Remove try/catch around transaction squashing" (T333710 T380234)]] |
[production] |
21:38 |
<urbanecm@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1092296|Promote Vector 2022 as default on 3 wikis (T379765)]], [[gerrit:1092912|Separate cache key space for test & production JsonConfig data (T380320)]] (duration: 14m 38s) |
[production] |
21:31 |
<urbanecm@deploy2002> |
bvibber, jdlrobson, urbanecm: Continuing with sync |
[production] |
21:29 |
<urbanecm@deploy2002> |
bvibber, jdlrobson, urbanecm: Backport for [[gerrit:1092296|Promote Vector 2022 as default on 3 wikis (T379765)]], [[gerrit:1092912|Separate cache key space for test & production JsonConfig data (T380320)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
21:23 |
<urbanecm@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1092296|Promote Vector 2022 as default on 3 wikis (T379765)]], [[gerrit:1092912|Separate cache key space for test & production JsonConfig data (T380320)]] |
[production] |
21:16 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on restbase2038.codfw.wmnet with reason: Bootstrapping — T380236 |
[production] |
21:15 |
<eevans@cumin1002> |
START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on restbase2038.codfw.wmnet with reason: Bootstrapping — T380236 |
[production] |
21:15 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on restbase2037.codfw.wmnet with reason: Bootstrapping — T380236 |
[production] |
21:15 |
<eevans@cumin1002> |
START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on restbase2037.codfw.wmnet with reason: Bootstrapping — T380236 |
[production] |
21:15 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on restbase2036.codfw.wmnet with reason: Bootstrapping — T380236 |
[production] |
21:14 |
<eevans@cumin1002> |
START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on restbase2036.codfw.wmnet with reason: Bootstrapping — T380236 |
[production] |
20:56 |
<jhancock@cumin2002> |
START - Cookbook sre.hosts.reimage for host es2041.codfw.wmnet with OS bookworm |
[production] |
20:50 |
<jhathaway@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host thanos-be2005.codfw.wmnet with OS bullseye |
[production] |
20:40 |
<jhathaway@cumin2002> |
START - Cookbook sre.hosts.reimage for host thanos-be2005.codfw.wmnet with OS bullseye |
[production] |
20:40 |
<jhathaway@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host thanos-be2005.codfw.wmnet with OS bullseye |
[production] |
20:32 |
<sukhe@cumin1002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp7007.magru.wmnet with OS bullseye |
[production] |
20:29 |
<sukhe@cumin1002> |
START - Cookbook sre.hosts.reimage for host cp7007.magru.wmnet with OS bullseye |
[production] |
20:24 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host es2041.codfw.wmnet with OS bookworm |
[production] |
20:24 |
<jhathaway@cumin2002> |
START - Cookbook sre.hosts.reimage for host thanos-be2005.codfw.wmnet with OS bullseye |
[production] |
20:10 |
<jhathaway@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on ms-be2082.codfw.wmnet with reason: T371400 |
[production] |
20:10 |
<jhathaway@cumin1002> |
START - Cookbook sre.hosts.downtime for 3:00:00 on ms-be2082.codfw.wmnet with reason: T371400 |
[production] |