401-450 of 10000 results (118ms)
2025-06-17 ยง
13:33 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.upgrade (exit_code=0) restarting A:liberica-canary (T397053) [production]
13:33 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) pooling A:liberica-canary [production]
13:32 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host ganeti1047.eqiad.wmnet [production]
13:32 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin pooling A:liberica-canary [production]
13:32 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) depooling A:liberica-canary [production]
13:32 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin depooling A:liberica-canary [production]
13:32 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.upgrade restarting A:liberica-canary (T397053) [production]
13:29 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2176', diff saved to https://phabricator.wikimedia.org/P78186 and previous config saved to /var/cache/conftool/dbconfig/20250617-132902-marostegui.json [production]
13:28 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.upgrade (exit_code=0) restarting A:liberica-canary (T397053) [production]
13:28 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) pooling A:liberica-canary [production]
13:28 <tgr@deploy1003> tgr: Continuing with sync [production]
13:27 <isaranto@deploy1003> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . [production]
13:27 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin pooling A:liberica-canary [production]
13:27 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) depooling A:liberica-canary [production]
13:27 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin depooling A:liberica-canary [production]
13:27 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.upgrade restarting A:liberica-canary (T397053) [production]
13:27 <isaranto@deploy1003> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . [production]
13:25 <tgr@deploy1003> tgr: Backport for [[gerrit:1160138|Fix GetSecurityLogContext hook declaration (T395204)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:23 <tgr@deploy1003> Started scap sync-world: Backport for [[gerrit:1160138|Fix GetSecurityLogContext hook declaration (T395204)]] [production]
13:23 <jiji@cumin1002> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host wikikube-worker-exp1001.eqiad.wmnet [production]
13:22 <jiji@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker-exp1001.eqiad.wmnet with OS bookworm [production]
13:22 <sukhe@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum6001.drmrs.wmnet with reason: host reimage [production]
13:21 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1178 (T382778)', diff saved to https://phabricator.wikimedia.org/P78185 and previous config saved to /var/cache/conftool/dbconfig/20250617-132157-ladsgroup.json [production]
13:19 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.upgrade (exit_code=0) restarting A:liberica-canary (T397053) [production]
13:19 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) pooling A:liberica-canary [production]
13:19 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin pooling A:liberica-canary [production]
13:19 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) depooling A:liberica-canary [production]
13:18 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin depooling A:liberica-canary [production]
13:18 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 22:00:00 on 10 hosts with reason: Maintenance [production]
13:18 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.upgrade restarting A:liberica-canary (T397053) [production]
13:18 <sukhe@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on durum6001.drmrs.wmnet with reason: host reimage [production]
13:18 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1178 (T382778)', diff saved to https://phabricator.wikimedia.org/P78184 and previous config saved to /var/cache/conftool/dbconfig/20250617-131824-ladsgroup.json [production]
13:18 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1178.eqiad.wmnet with reason: Maintenance [production]
13:18 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1177 (T382778)', diff saved to https://phabricator.wikimedia.org/P78183 and previous config saved to /var/cache/conftool/dbconfig/20250617-131803-ladsgroup.json [production]
13:17 <tgr@deploy1003> Sync cancelled. [production]
13:13 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2176', diff saved to https://phabricator.wikimedia.org/P78182 and previous config saved to /var/cache/conftool/dbconfig/20250617-131354-marostegui.json [production]
13:12 <tgr@deploy1003> tgr: Backport for [[gerrit:1153626|Use GetSecurityLogContext hook for goodpass/badpass logging (T395204)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:10 <tgr@deploy1003> Started scap sync-world: Backport for [[gerrit:1153626|Use GetSecurityLogContext hook for goodpass/badpass logging (T395204)]] [production]
13:10 <sukhe@cumin1002> START - Cookbook sre.cdn.roll-upgrade-ats Rolling upgrade of ATS on A:magru and not P{cp7002*} and A:cp - 9.2.10 upgrade (T390912) [production]
13:06 <jiji@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker-exp1001.eqiad.wmnet with reason: host reimage [production]
13:02 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1177', diff saved to https://phabricator.wikimedia.org/P78180 and previous config saved to /var/cache/conftool/dbconfig/20250617-130256-ladsgroup.json [production]
13:02 <jiji@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker-exp1001.eqiad.wmnet with reason: host reimage [production]
12:58 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2176 (T396130)', diff saved to https://phabricator.wikimedia.org/P78179 and previous config saved to /var/cache/conftool/dbconfig/20250617-125847-marostegui.json [production]
12:56 <sukhe@cumin1002> START - Cookbook sre.hosts.reimage for host durum5002.eqsin.wmnet with OS bookworm [production]
12:56 <sukhe@cumin1002> START - Cookbook sre.hosts.reimage for host durum6001.drmrs.wmnet with OS bookworm [production]
12:50 <taavi@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
12:50 <taavi@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add eqiad1 auth v6 VIPs - taavi@cumin1003" [production]
12:50 <taavi@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add eqiad1 auth v6 VIPs - taavi@cumin1003" [production]
12:50 <jiji@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker-exp1001.eqiad.wmnet with OS bookworm [production]
12:47 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1177', diff saved to https://phabricator.wikimedia.org/P78178 and previous config saved to /var/cache/conftool/dbconfig/20250617-124748-ladsgroup.json [production]