351-400 of 10000 results (125ms)
2025-04-30 ยง
13:01 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2041.codfw.wmnet [production]
13:00 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow7001.magru.wmnet [production]
12:59 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2040.codfw.wmnet [production]
12:58 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2040.codfw.wmnet [production]
12:57 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow6001.drmrs.wmnet [production]
12:55 <damilare> config revision changed from 817b0c94 to 45e49fec [production]
12:55 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P75703 and previous config saved to /var/cache/conftool/dbconfig/20250430-125525-fceratto.json [production]
12:53 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow6001.drmrs.wmnet [production]
12:53 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2040.codfw.wmnet [production]
12:50 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
12:50 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
12:50 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2040.codfw.wmnet [production]
12:49 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
12:49 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
12:49 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow5002.eqsin.wmnet [production]
12:48 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2039.codfw.wmnet [production]
12:48 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2039.codfw.wmnet [production]
12:44 <godog> reboot alert2002 [production]
12:43 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow5002.eqsin.wmnet [production]
12:43 <filippo@cumin1002> DONE (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 1:00:00 on alert2002.wikimedia.org with reason: new kernel [production]
12:43 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2039.codfw.wmnet [production]
12:42 <filippo@cumin1002> DONE (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 4:00:00 on alert2002.wikimedia.org with reason: kernel [production]
12:41 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow4002.ulsfo.wmnet [production]
12:40 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1161 (T392806)', diff saved to https://phabricator.wikimedia.org/P75702 and previous config saved to /var/cache/conftool/dbconfig/20250430-124018-fceratto.json [production]
12:38 <filippo@cumin1002> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host alert2002.wikimedia.org [production]
12:38 <filippo@cumin1002> START - Cookbook sre.hosts.reboot-single for host alert2002.wikimedia.org [production]
12:37 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2039.codfw.wmnet [production]
12:36 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow4002.ulsfo.wmnet [production]
12:33 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1161 (T392806)', diff saved to https://phabricator.wikimedia.org/P75701 and previous config saved to /var/cache/conftool/dbconfig/20250430-123327-fceratto.json [production]
12:33 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
12:33 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1161.eqiad.wmnet with reason: Maintenance [production]
12:32 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1159 (T392806)', diff saved to https://phabricator.wikimedia.org/P75700 and previous config saved to /var/cache/conftool/dbconfig/20250430-123255-fceratto.json [production]
12:30 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2038.codfw.wmnet [production]
12:30 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2038.codfw.wmnet [production]
12:28 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow3003.esams.wmnet [production]
12:25 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2038.codfw.wmnet [production]
12:24 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow3003.esams.wmnet [production]
12:18 <XioNoX> test `host-inbound-traffic system-services any-service` on mr1-ulsfo [production]
12:17 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P75699 and previous config saved to /var/cache/conftool/dbconfig/20250430-121749-fceratto.json [production]
12:02 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P75698 and previous config saved to /var/cache/conftool/dbconfig/20250430-120242-fceratto.json [production]
12:02 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow2003.codfw.wmnet [production]
12:01 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2038.codfw.wmnet [production]
11:58 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow2003.codfw.wmnet [production]
11:55 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host netflow1002.eqiad.wmnet [production]
11:51 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host netflow1002.eqiad.wmnet [production]
11:48 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2037.codfw.wmnet [production]
11:48 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2037.codfw.wmnet [production]
11:47 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1159 (T392806)', diff saved to https://phabricator.wikimedia.org/P75697 and previous config saved to /var/cache/conftool/dbconfig/20250430-114734-fceratto.json [production]
11:45 <kostajh> Deployed patches for T392976 to wmf.25 and wmf.27 [production]
11:43 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti2037.codfw.wmnet [production]