1451-1500 of 10000 results (40ms)
2025-01-21 ยง
12:48 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for toolsbeta-test-k8s-worker-nfs-5 [toolsbeta]
12:47 <marostegui@cumin1002> dbctl commit (dc=all): 'db2216 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P72197 and previous config saved to /var/cache/conftool/dbconfig/20250121-124750-root.json [production]
12:44 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.reboot for toolsbeta-test-k8s-worker-nfs-5 [toolsbeta]
12:42 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for toolsbeta-test-k8s-worker-nfs-10 [toolsbeta]
12:40 <andrewbogott> rebooting toolsbeta-nfs-3 and then restarting all k8s-nfs workers [toolsbeta]
12:38 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.reboot for toolsbeta-test-k8s-worker-nfs-10 [toolsbeta]
12:38 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2019.codfw.wmnet [production]
12:33 <marostegui@cumin1002> dbctl commit (dc=all): 'db2203 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72194 and previous config saved to /var/cache/conftool/dbconfig/20250121-123352-root.json [production]
12:33 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of aux-k8s-etcd2004.codfw.wmnet to plain [production]
12:32 <marostegui@cumin1002> dbctl commit (dc=all): 'db2216 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72193 and previous config saved to /var/cache/conftool/dbconfig/20250121-123245-root.json [production]
12:32 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of aux-k8s-etcd2004.codfw.wmnet to plain [production]
12:32 <hnowlan@deploy2002> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
12:32 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2019.codfw.wmnet [production]
12:32 <hnowlan@deploy2002> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
12:31 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2019.codfw.wmnet [production]
12:29 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of aux-k8s-etcd2004.codfw.wmnet to drbd [production]
12:27 <hnowlan@deploy2002> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
12:27 <hnowlan@deploy2002> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
12:18 <marostegui@cumin1002> dbctl commit (dc=all): 'db2203 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P72190 and previous config saved to /var/cache/conftool/dbconfig/20250121-121847-root.json [production]
12:17 <marostegui@cumin1002> dbctl commit (dc=all): 'db2216 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P72189 and previous config saved to /var/cache/conftool/dbconfig/20250121-121739-root.json [production]
12:15 <kart_> Updated cxserver to 2025-01-20-172318-production (T377966, T377813) [production]
12:15 <kartik@deploy2002> helmfile [eqiad] DONE helmfile.d/services/cxserver: apply [production]
12:14 <kartik@deploy2002> helmfile [eqiad] START helmfile.d/services/cxserver: apply [production]
12:10 <kartik@deploy2002> helmfile [codfw] DONE helmfile.d/services/cxserver: apply [production]
12:09 <kartik@deploy2002> helmfile [codfw] START helmfile.d/services/cxserver: apply [production]
12:09 <fceratto@cumin1002> END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db2189.codfw.wmnet [production]
12:08 <hnowlan@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on restbase2037.codfw.wmnet with reason: Memory issues, rebooting frequently. Depooled. T383820 [production]
12:05 <federico3> updating db2189.codfw.wmnet for https://phabricator.wikimedia.org/T384202 [production]
12:05 <kartik@deploy2002> helmfile [staging] DONE helmfile.d/services/cxserver: apply [production]
12:04 <fceratto@cumin1002> START - Cookbook sre.mysql.upgrade for db2189.codfw.wmnet [production]
12:03 <marostegui@cumin1002> dbctl commit (dc=all): 'db2203 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P72187 and previous config saved to /var/cache/conftool/dbconfig/20250121-120341-root.json [production]
12:03 <kartik@deploy2002> helmfile [staging] START helmfile.d/services/cxserver: apply [production]
12:02 <marostegui@cumin1002> dbctl commit (dc=all): 'db2216 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P72186 and previous config saved to /var/cache/conftool/dbconfig/20250121-120234-root.json [production]
12:01 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of aux-k8s-etcd2004.codfw.wmnet to drbd [production]
11:59 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti2019.codfw.wmnet [production]
11:59 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2019.codfw.wmnet [production]
11:57 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2019.codfw.wmnet [production]
11:54 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2019.codfw.wmnet [production]
11:48 <marostegui@cumin1002> dbctl commit (dc=all): 'db2203 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P72185 and previous config saved to /var/cache/conftool/dbconfig/20250121-114836-root.json [production]
11:48 <hnowlan@cumin2002> conftool action : set/pooled=no; selector: name=restbase2037.codfw.wmnet [production]
11:47 <marostegui@cumin1002> dbctl commit (dc=all): 'db2216 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P72184 and previous config saved to /var/cache/conftool/dbconfig/20250121-114728-root.json [production]
11:44 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2189.codfw.wmnet with reason: rebuilding index [production]
11:34 <jiji@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-api-ext: apply [production]
11:32 <jiji@deploy2002> helmfile [codfw] START helmfile.d/services/mw-api-ext: apply [production]
11:32 <dcausse@deploy2002> helmfile [staging] DONE helmfile.d/services/rdf-streaming-updater: apply [production]
11:31 <dcausse@deploy2002> helmfile [staging] START helmfile.d/services/rdf-streaming-updater: apply [production]
11:30 <dcausse@deploy2002> helmfile [staging] DONE helmfile.d/services/rdf-streaming-updater: apply [production]
11:30 <dcausse@deploy2002> helmfile [staging] START helmfile.d/services/rdf-streaming-updater: apply [production]
11:29 <dcausse@deploy2002> helmfile [staging] DONE helmfile.d/services/rdf-streaming-updater: apply [production]
11:29 <dcausse@deploy2002> helmfile [staging] START helmfile.d/services/rdf-streaming-updater: apply [production]