2025-01-21
ยง
|
13:22 |
<btullis@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on an-launcher1002.eqiad.wmnet with reason: Migrating to kubernetes |
[production] |
13:22 |
<btullis@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-launcher1002.eqiad.wmnet with reason: Migrating to kubernetes |
[production] |
13:20 |
<btullis@cumin1002> |
START - Cookbook sre.druid.roll-restart-workers for Druid analytics cluster: Roll restart of Druid jvm daemons. |
[production] |
13:17 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-search: apply |
[production] |
13:16 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-search: apply |
[production] |
13:01 |
<hnowlan@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply |
[production] |
13:01 |
<hnowlan@deploy2002> |
helmfile [codfw] START helmfile.d/services/rest-gateway: apply |
[production] |
12:59 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for toolsbeta-test-k8s-worker-nfs-9 |
[toolsbeta] |
12:59 |
<hnowlan@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply |
[production] |
12:59 |
<hnowlan@deploy2002> |
helmfile [eqiad] START helmfile.d/services/rest-gateway: apply |
[production] |
12:56 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.k8s.reboot for toolsbeta-test-k8s-worker-nfs-9 |
[toolsbeta] |
12:56 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for toolsbeta-test-k8s-worker-nfs-8 |
[toolsbeta] |
12:52 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.k8s.reboot for toolsbeta-test-k8s-worker-nfs-8 |
[toolsbeta] |
12:52 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for toolsbeta-test-k8s-worker-nfs-7 |
[toolsbeta] |
12:48 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2203 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P72198 and previous config saved to /var/cache/conftool/dbconfig/20250121-124857-root.json |
[production] |
12:48 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.k8s.reboot for toolsbeta-test-k8s-worker-nfs-7 |
[toolsbeta] |
12:48 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for toolsbeta-test-k8s-worker-nfs-5 |
[toolsbeta] |
12:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2216 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P72197 and previous config saved to /var/cache/conftool/dbconfig/20250121-124750-root.json |
[production] |
12:44 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.k8s.reboot for toolsbeta-test-k8s-worker-nfs-5 |
[toolsbeta] |
12:42 |
<andrew@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for toolsbeta-test-k8s-worker-nfs-10 |
[toolsbeta] |
12:40 |
<andrewbogott> |
rebooting toolsbeta-nfs-3 and then restarting all k8s-nfs workers |
[toolsbeta] |
12:38 |
<andrew@cloudcumin1001> |
START - Cookbook wmcs.toolforge.k8s.reboot for toolsbeta-test-k8s-worker-nfs-10 |
[toolsbeta] |
12:38 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2019.codfw.wmnet |
[production] |
12:33 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2203 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72194 and previous config saved to /var/cache/conftool/dbconfig/20250121-123352-root.json |
[production] |
12:33 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of aux-k8s-etcd2004.codfw.wmnet to plain |
[production] |
12:32 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2216 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72193 and previous config saved to /var/cache/conftool/dbconfig/20250121-123245-root.json |
[production] |
12:32 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.changedisk for changing disk type of aux-k8s-etcd2004.codfw.wmnet to plain |
[production] |
12:32 |
<hnowlan@deploy2002> |
helmfile [staging] DONE helmfile.d/services/rest-gateway: apply |
[production] |
12:32 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2019.codfw.wmnet |
[production] |
12:32 |
<hnowlan@deploy2002> |
helmfile [staging] START helmfile.d/services/rest-gateway: apply |
[production] |
12:31 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2019.codfw.wmnet |
[production] |
12:29 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of aux-k8s-etcd2004.codfw.wmnet to drbd |
[production] |
12:27 |
<hnowlan@deploy2002> |
helmfile [staging] DONE helmfile.d/services/changeprop: apply |
[production] |
12:27 |
<hnowlan@deploy2002> |
helmfile [staging] START helmfile.d/services/changeprop: apply |
[production] |
12:18 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2203 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P72190 and previous config saved to /var/cache/conftool/dbconfig/20250121-121847-root.json |
[production] |
12:17 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2216 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P72189 and previous config saved to /var/cache/conftool/dbconfig/20250121-121739-root.json |
[production] |
12:15 |
<kart_> |
Updated cxserver to 2025-01-20-172318-production (T377966, T377813) |
[production] |
12:15 |
<kartik@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |
12:14 |
<kartik@deploy2002> |
helmfile [eqiad] START helmfile.d/services/cxserver: apply |
[production] |
12:10 |
<kartik@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/cxserver: apply |
[production] |
12:09 |
<kartik@deploy2002> |
helmfile [codfw] START helmfile.d/services/cxserver: apply |
[production] |
12:09 |
<fceratto@cumin1002> |
END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db2189.codfw.wmnet |
[production] |
12:08 |
<hnowlan@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on restbase2037.codfw.wmnet with reason: Memory issues, rebooting frequently. Depooled. T383820 |
[production] |
12:05 |
<federico3> |
updating db2189.codfw.wmnet for https://phabricator.wikimedia.org/T384202 |
[production] |
12:05 |
<kartik@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cxserver: apply |
[production] |
12:04 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.upgrade for db2189.codfw.wmnet |
[production] |
12:03 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2203 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P72187 and previous config saved to /var/cache/conftool/dbconfig/20250121-120341-root.json |
[production] |
12:03 |
<kartik@deploy2002> |
helmfile [staging] START helmfile.d/services/cxserver: apply |
[production] |
12:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2216 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P72186 and previous config saved to /var/cache/conftool/dbconfig/20250121-120234-root.json |
[production] |
12:01 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.changedisk for changing disk type of aux-k8s-etcd2004.codfw.wmnet to drbd |
[production] |