601-650 of 10000 results (21ms)
2025-01-21 ยง
13:22 <btullis@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on an-launcher1002.eqiad.wmnet with reason: Migrating to kubernetes [production]
13:22 <btullis@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-launcher1002.eqiad.wmnet with reason: Migrating to kubernetes [production]
13:20 <btullis@cumin1002> START - Cookbook sre.druid.roll-restart-workers for Druid analytics cluster: Roll restart of Druid jvm daemons. [production]
13:17 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-search: apply [production]
13:16 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-search: apply [production]
13:01 <hnowlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]
13:01 <hnowlan@deploy2002> helmfile [codfw] START helmfile.d/services/rest-gateway: apply [production]
12:59 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for toolsbeta-test-k8s-worker-nfs-9 [toolsbeta]
12:59 <hnowlan@deploy2002> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
12:59 <hnowlan@deploy2002> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
12:56 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.reboot for toolsbeta-test-k8s-worker-nfs-9 [toolsbeta]
12:56 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for toolsbeta-test-k8s-worker-nfs-8 [toolsbeta]
12:52 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.reboot for toolsbeta-test-k8s-worker-nfs-8 [toolsbeta]
12:52 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for toolsbeta-test-k8s-worker-nfs-7 [toolsbeta]
12:48 <marostegui@cumin1002> dbctl commit (dc=all): 'db2203 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P72198 and previous config saved to /var/cache/conftool/dbconfig/20250121-124857-root.json [production]
12:48 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.reboot for toolsbeta-test-k8s-worker-nfs-7 [toolsbeta]
12:48 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for toolsbeta-test-k8s-worker-nfs-5 [toolsbeta]
12:47 <marostegui@cumin1002> dbctl commit (dc=all): 'db2216 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P72197 and previous config saved to /var/cache/conftool/dbconfig/20250121-124750-root.json [production]
12:44 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.reboot for toolsbeta-test-k8s-worker-nfs-5 [toolsbeta]
12:42 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for toolsbeta-test-k8s-worker-nfs-10 [toolsbeta]
12:40 <andrewbogott> rebooting toolsbeta-nfs-3 and then restarting all k8s-nfs workers [toolsbeta]
12:38 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.reboot for toolsbeta-test-k8s-worker-nfs-10 [toolsbeta]
12:38 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2019.codfw.wmnet [production]
12:33 <marostegui@cumin1002> dbctl commit (dc=all): 'db2203 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72194 and previous config saved to /var/cache/conftool/dbconfig/20250121-123352-root.json [production]
12:33 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of aux-k8s-etcd2004.codfw.wmnet to plain [production]
12:32 <marostegui@cumin1002> dbctl commit (dc=all): 'db2216 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72193 and previous config saved to /var/cache/conftool/dbconfig/20250121-123245-root.json [production]
12:32 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of aux-k8s-etcd2004.codfw.wmnet to plain [production]
12:32 <hnowlan@deploy2002> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
12:32 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2019.codfw.wmnet [production]
12:32 <hnowlan@deploy2002> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
12:31 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2019.codfw.wmnet [production]
12:29 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of aux-k8s-etcd2004.codfw.wmnet to drbd [production]
12:27 <hnowlan@deploy2002> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
12:27 <hnowlan@deploy2002> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
12:18 <marostegui@cumin1002> dbctl commit (dc=all): 'db2203 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P72190 and previous config saved to /var/cache/conftool/dbconfig/20250121-121847-root.json [production]
12:17 <marostegui@cumin1002> dbctl commit (dc=all): 'db2216 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P72189 and previous config saved to /var/cache/conftool/dbconfig/20250121-121739-root.json [production]
12:15 <kart_> Updated cxserver to 2025-01-20-172318-production (T377966, T377813) [production]
12:15 <kartik@deploy2002> helmfile [eqiad] DONE helmfile.d/services/cxserver: apply [production]
12:14 <kartik@deploy2002> helmfile [eqiad] START helmfile.d/services/cxserver: apply [production]
12:10 <kartik@deploy2002> helmfile [codfw] DONE helmfile.d/services/cxserver: apply [production]
12:09 <kartik@deploy2002> helmfile [codfw] START helmfile.d/services/cxserver: apply [production]
12:09 <fceratto@cumin1002> END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db2189.codfw.wmnet [production]
12:08 <hnowlan@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on restbase2037.codfw.wmnet with reason: Memory issues, rebooting frequently. Depooled. T383820 [production]
12:05 <federico3> updating db2189.codfw.wmnet for https://phabricator.wikimedia.org/T384202 [production]
12:05 <kartik@deploy2002> helmfile [staging] DONE helmfile.d/services/cxserver: apply [production]
12:04 <fceratto@cumin1002> START - Cookbook sre.mysql.upgrade for db2189.codfw.wmnet [production]
12:03 <marostegui@cumin1002> dbctl commit (dc=all): 'db2203 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P72187 and previous config saved to /var/cache/conftool/dbconfig/20250121-120341-root.json [production]
12:03 <kartik@deploy2002> helmfile [staging] START helmfile.d/services/cxserver: apply [production]
12:02 <marostegui@cumin1002> dbctl commit (dc=all): 'db2216 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P72186 and previous config saved to /var/cache/conftool/dbconfig/20250121-120234-root.json [production]
12:01 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of aux-k8s-etcd2004.codfw.wmnet to drbd [production]