2025-01-21
ยง
|
14:23 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-analytics: apply |
[production] |
14:22 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-analytics: apply |
[production] |
14:16 |
<lucaswerkmeister-wmde@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1111932|Remove KartographerParsoidSupport flag from configuration (T340134)]] |
[production] |
14:10 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-search: apply |
[production] |
14:09 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-search: apply |
[production] |
14:09 |
<btullis@cumin1002> |
END (PASS) - Cookbook sre.druid.roll-restart-workers (exit_code=0) for Druid analytics cluster: Roll restart of Druid jvm daemons. |
[production] |
13:55 |
<Emperor> |
hard-reboot ms-fe1014 |
[production] |
13:54 |
<mvernon@cumin2002> |
conftool action : set/pooled=no; selector: name=ms-fe1014.eqiad.wmnet |
[production] |
13:43 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-analytics: apply |
[production] |
13:43 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-analytics: apply |
[production] |
13:27 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-analytics: apply |
[production] |
13:26 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-analytics: apply |
[production] |
13:22 |
<btullis@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on an-launcher1002.eqiad.wmnet with reason: Migrating to kubernetes |
[production] |
13:22 |
<btullis@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-launcher1002.eqiad.wmnet with reason: Migrating to kubernetes |
[production] |
13:20 |
<btullis@cumin1002> |
START - Cookbook sre.druid.roll-restart-workers for Druid analytics cluster: Roll restart of Druid jvm daemons. |
[production] |
13:17 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-search: apply |
[production] |
13:16 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-search: apply |
[production] |
13:01 |
<hnowlan@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply |
[production] |
13:01 |
<hnowlan@deploy2002> |
helmfile [codfw] START helmfile.d/services/rest-gateway: apply |
[production] |
12:59 |
<hnowlan@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply |
[production] |
12:59 |
<hnowlan@deploy2002> |
helmfile [eqiad] START helmfile.d/services/rest-gateway: apply |
[production] |
12:48 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2203 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P72198 and previous config saved to /var/cache/conftool/dbconfig/20250121-124857-root.json |
[production] |
12:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2216 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P72197 and previous config saved to /var/cache/conftool/dbconfig/20250121-124750-root.json |
[production] |
12:38 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2019.codfw.wmnet |
[production] |
12:33 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2203 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72194 and previous config saved to /var/cache/conftool/dbconfig/20250121-123352-root.json |
[production] |
12:33 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of aux-k8s-etcd2004.codfw.wmnet to plain |
[production] |
12:32 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2216 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72193 and previous config saved to /var/cache/conftool/dbconfig/20250121-123245-root.json |
[production] |
12:32 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.changedisk for changing disk type of aux-k8s-etcd2004.codfw.wmnet to plain |
[production] |
12:32 |
<hnowlan@deploy2002> |
helmfile [staging] DONE helmfile.d/services/rest-gateway: apply |
[production] |
12:32 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2019.codfw.wmnet |
[production] |
12:32 |
<hnowlan@deploy2002> |
helmfile [staging] START helmfile.d/services/rest-gateway: apply |
[production] |
12:31 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2019.codfw.wmnet |
[production] |
12:29 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of aux-k8s-etcd2004.codfw.wmnet to drbd |
[production] |
12:27 |
<hnowlan@deploy2002> |
helmfile [staging] DONE helmfile.d/services/changeprop: apply |
[production] |
12:27 |
<hnowlan@deploy2002> |
helmfile [staging] START helmfile.d/services/changeprop: apply |
[production] |
12:18 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2203 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P72190 and previous config saved to /var/cache/conftool/dbconfig/20250121-121847-root.json |
[production] |
12:17 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2216 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P72189 and previous config saved to /var/cache/conftool/dbconfig/20250121-121739-root.json |
[production] |
12:15 |
<kart_> |
Updated cxserver to 2025-01-20-172318-production (T377966, T377813) |
[production] |
12:15 |
<kartik@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |
12:14 |
<kartik@deploy2002> |
helmfile [eqiad] START helmfile.d/services/cxserver: apply |
[production] |
12:10 |
<kartik@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/cxserver: apply |
[production] |
12:09 |
<kartik@deploy2002> |
helmfile [codfw] START helmfile.d/services/cxserver: apply |
[production] |
12:09 |
<fceratto@cumin1002> |
END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db2189.codfw.wmnet |
[production] |
12:08 |
<hnowlan@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on restbase2037.codfw.wmnet with reason: Memory issues, rebooting frequently. Depooled. T383820 |
[production] |
12:05 |
<federico3> |
updating db2189.codfw.wmnet for https://phabricator.wikimedia.org/T384202 |
[production] |
12:05 |
<kartik@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cxserver: apply |
[production] |
12:04 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.upgrade for db2189.codfw.wmnet |
[production] |
12:03 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2203 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P72187 and previous config saved to /var/cache/conftool/dbconfig/20250121-120341-root.json |
[production] |
12:03 |
<kartik@deploy2002> |
helmfile [staging] START helmfile.d/services/cxserver: apply |
[production] |
12:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2216 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P72186 and previous config saved to /var/cache/conftool/dbconfig/20250121-120234-root.json |
[production] |