301-350 of 10000 results (113ms)
2025-06-10 ยง
12:06 <taavi@cumin1003> START - Cookbook sre.dns.netbox [production]
12:06 <taavi@cumin1003> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
12:06 <jmm@dns1004> END - running authdns-update [production]
12:05 <jmm@dns1004> START - running authdns-update [production]
12:04 <marostegui@cumin1002> dbctl commit (dc=all): 'db1168 (re)pooling @ 60%: Repooling', diff saved to https://phabricator.wikimedia.org/P77506 and previous config saved to /var/cache/conftool/dbconfig/20250610-120412-root.json [production]
12:03 <taavi@cumin1003> START - Cookbook sre.dns.netbox [production]
12:02 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1191 (T395241)', diff saved to https://phabricator.wikimedia.org/P77505 and previous config saved to /var/cache/conftool/dbconfig/20250610-120249-fceratto.json [production]
12:01 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2189', diff saved to https://phabricator.wikimedia.org/P77504 and previous config saved to /var/cache/conftool/dbconfig/20250610-120103-marostegui.json [production]
11:59 <jmm@cumin1003> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host install7002.wikimedia.org [production]
11:54 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1191 (T395241)', diff saved to https://phabricator.wikimedia.org/P77503 and previous config saved to /var/cache/conftool/dbconfig/20250610-115444-fceratto.json [production]
11:54 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1191.eqiad.wmnet with reason: Maintenance [production]
11:54 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1181 (T395241)', diff saved to https://phabricator.wikimedia.org/P77502 and previous config saved to /var/cache/conftool/dbconfig/20250610-115419-fceratto.json [production]
11:49 <marostegui@cumin1002> dbctl commit (dc=all): 'db1168 (re)pooling @ 40%: Repooling', diff saved to https://phabricator.wikimedia.org/P77501 and previous config saved to /var/cache/conftool/dbconfig/20250610-114906-root.json [production]
11:48 <moritzm> installing qemu bugfix updates [production]
11:47 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on dbstore1007.eqiad.wmnet with reason: Maintenance [production]
11:47 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host install7002.wikimedia.org [production]
11:46 <marostegui@cumin1002> dbctl commit (dc=all): 'db1180 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P77500 and previous config saved to /var/cache/conftool/dbconfig/20250610-114617-root.json [production]
11:45 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2189 (T396130)', diff saved to https://phabricator.wikimedia.org/P77499 and previous config saved to /var/cache/conftool/dbconfig/20250610-114556-marostegui.json [production]
11:44 <cgoubert@deploy1003> Finished scap sync-world: mediawiki-cli: Fix the paths of some of the dumps scripts and config files - T394389 (duration: 08m 49s) [production]
11:39 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P77497 and previous config saved to /var/cache/conftool/dbconfig/20250610-113913-fceratto.json [production]
11:37 <moritzm> failover Ganeti master in codfw to ganeti2032 [production]
11:35 <cgoubert@deploy1003> Started scap sync-world: mediawiki-cli: Fix the paths of some of the dumps scripts and config files - T394389 [production]
11:34 <marostegui@cumin1002> dbctl commit (dc=all): 'db1168 (re)pooling @ 20%: Repooling', diff saved to https://phabricator.wikimedia.org/P77495 and previous config saved to /var/cache/conftool/dbconfig/20250610-113401-root.json [production]
11:33 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2189 (T396130)', diff saved to https://phabricator.wikimedia.org/P77494 and previous config saved to /var/cache/conftool/dbconfig/20250610-113328-marostegui.json [production]
11:33 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2189.codfw.wmnet with reason: Maintenance [production]
11:33 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2175 (T396130)', diff saved to https://phabricator.wikimedia.org/P77493 and previous config saved to /var/cache/conftool/dbconfig/20250610-113306-marostegui.json [production]
11:31 <jmm@cumin1003> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2044.codfw.wmnet [production]
11:31 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2044.codfw.wmnet [production]
11:31 <marostegui@cumin1002> dbctl commit (dc=all): 'db1180 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P77492 and previous config saved to /var/cache/conftool/dbconfig/20250610-113112-root.json [production]
11:26 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host ganeti2044.codfw.wmnet [production]
11:24 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1181', diff saved to https://phabricator.wikimedia.org/P77491 and previous config saved to /var/cache/conftool/dbconfig/20250610-112406-fceratto.json [production]
11:21 <jmm@cumin1003> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2044.codfw.wmnet [production]
11:21 <jmm@cumin1003> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2043.codfw.wmnet [production]
11:20 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2043.codfw.wmnet [production]
11:18 <marostegui@cumin1002> dbctl commit (dc=all): 'db1168 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P77490 and previous config saved to /var/cache/conftool/dbconfig/20250610-111856-root.json [production]
11:17 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2175', diff saved to https://phabricator.wikimedia.org/P77489 and previous config saved to /var/cache/conftool/dbconfig/20250610-111759-marostegui.json [production]
11:16 <marostegui@cumin1002> dbctl commit (dc=all): 'db1180 (re)pooling @ 60%: Repooling', diff saved to https://phabricator.wikimedia.org/P77488 and previous config saved to /var/cache/conftool/dbconfig/20250610-111606-root.json [production]
11:15 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host ganeti2043.codfw.wmnet [production]
11:14 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1168.eqiad.wmnet with reason: Maintenance [production]
11:14 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db1168 T395989', diff saved to https://phabricator.wikimedia.org/P77487 and previous config saved to /var/cache/conftool/dbconfig/20250610-111440-marostegui.json [production]
11:10 <marostegui@cumin1002> dbctl commit (dc=all): 'es2033 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P77486 and previous config saved to /var/cache/conftool/dbconfig/20250610-111054-root.json [production]
11:08 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1181 (T395241)', diff saved to https://phabricator.wikimedia.org/P77485 and previous config saved to /var/cache/conftool/dbconfig/20250610-110859-fceratto.json [production]
11:04 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
11:04 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
11:02 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2175', diff saved to https://phabricator.wikimedia.org/P77484 and previous config saved to /var/cache/conftool/dbconfig/20250610-110252-marostegui.json [production]
11:01 <hnowlan@deploy1003> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]
11:01 <hnowlan@deploy1003> helmfile [codfw] START helmfile.d/services/rest-gateway: apply [production]
11:01 <marostegui@cumin1002> dbctl commit (dc=all): 'db1180 (re)pooling @ 40%: Repooling', diff saved to https://phabricator.wikimedia.org/P77483 and previous config saved to /var/cache/conftool/dbconfig/20250610-110101-root.json [production]
10:59 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1181 (T395241)', diff saved to https://phabricator.wikimedia.org/P77482 and previous config saved to /var/cache/conftool/dbconfig/20250610-105951-fceratto.json [production]
10:59 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1181.eqiad.wmnet with reason: Maintenance [production]