1851-1900 of 10000 results (26ms)
2025-06-17 ยง
12:21 <jmm@cumin1003> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2024.codfw.wmnet [production]
12:20 <jmm@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on ganeti2023.codfw.wmnet with reason: remove for decom [production]
12:18 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2174', diff saved to https://phabricator.wikimedia.org/P78172 and previous config saved to /var/cache/conftool/dbconfig/20250617-121805-marostegui.json [production]
12:18 <stevemunene@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset: apply [production]
12:17 <samtar@deploy1003> samtar: Continuing with sync [production]
12:16 <stevemunene@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset: apply [production]
12:16 <samtar@deploy1003> samtar: Backport for [[gerrit:1155665|IS: Enable `wgTemplateDataEnableDiscovery` for mediawikiwiki (T377975)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
12:14 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P78171 and previous config saved to /var/cache/conftool/dbconfig/20250617-121412-ladsgroup.json [production]
12:14 <samtar@deploy1003> Started scap sync-world: Backport for [[gerrit:1155665|IS: Enable `wgTemplateDataEnableDiscovery` for mediawikiwiki (T377975)]] [production]
12:10 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on thanos-be2006.codfw.wmnet with reason: host reimage [production]
12:09 <stevemunene@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset-next: apply [production]
12:08 <mvernon@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on thanos-be2006.codfw.wmnet with reason: host reimage [production]
12:06 <stevemunene@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset-next: apply [production]
12:06 <jmm@cumin1003> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2023.codfw.wmnet [production]
12:02 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2174', diff saved to https://phabricator.wikimedia.org/P78170 and previous config saved to /var/cache/conftool/dbconfig/20250617-120257-marostegui.json [production]
11:59 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cuminunpriv1001.eqiad.wmnet [production]
11:59 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P78169 and previous config saved to /var/cache/conftool/dbconfig/20250617-115905-ladsgroup.json [production]
11:56 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host cuminunpriv1001.eqiad.wmnet [production]
11:48 <mvernon@cumin2002> START - Cookbook sre.hosts.reimage for host thanos-be2006.codfw.wmnet with OS bullseye [production]
11:47 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2174 (T396130)', diff saved to https://phabricator.wikimedia.org/P78168 and previous config saved to /var/cache/conftool/dbconfig/20250617-114750-marostegui.json [production]
11:47 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/changeprop: apply [production]
11:46 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/changeprop: apply [production]
11:46 <hnowlan@deploy1003> helmfile [codfw] DONE helmfile.d/services/changeprop: apply [production]
11:46 <hnowlan@deploy1003> helmfile [codfw] START helmfile.d/services/changeprop: apply [production]
11:45 <hnowlan@deploy1003> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
11:45 <hnowlan@deploy1003> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
11:43 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1172 (T382778)', diff saved to https://phabricator.wikimedia.org/P78167 and previous config saved to /var/cache/conftool/dbconfig/20250617-114357-ladsgroup.json [production]
11:41 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply [production]
11:41 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
11:40 <hnowlan@deploy1003> helmfile [codfw] DONE helmfile.d/services/mobileapps: apply [production]
11:40 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1172 (T382778)', diff saved to https://phabricator.wikimedia.org/P78166 and previous config saved to /var/cache/conftool/dbconfig/20250617-114037-ladsgroup.json [production]
11:40 <hnowlan@deploy1003> helmfile [codfw] START helmfile.d/services/mobileapps: apply [production]
11:40 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1172.eqiad.wmnet with reason: Maintenance [production]
11:39 <ladsgroup@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1171.eqiad.wmnet with reason: Maintenance [production]
11:39 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1167 (T382778)', diff saved to https://phabricator.wikimedia.org/P78165 and previous config saved to /var/cache/conftool/dbconfig/20250617-113915-ladsgroup.json [production]
11:38 <jiji@cumin1002> START - Cookbook sre.dns.netbox [production]
11:38 <jiji@cumin1002> START - Cookbook sre.ganeti.makevm for new host wikikube-worker-exp1001.eqiad.wmnet [production]
11:24 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1167', diff saved to https://phabricator.wikimedia.org/P78164 and previous config saved to /var/cache/conftool/dbconfig/20250617-112408-ladsgroup.json [production]
11:22 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2174 (T396130)', diff saved to https://phabricator.wikimedia.org/P78163 and previous config saved to /var/cache/conftool/dbconfig/20250617-112222-marostegui.json [production]
11:22 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2174.codfw.wmnet with reason: Maintenance [production]
11:22 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2173 (T396130)', diff saved to https://phabricator.wikimedia.org/P78162 and previous config saved to /var/cache/conftool/dbconfig/20250617-112200-marostegui.json [production]
11:20 <samtar@deploy1003> Finished scap sync-world: Backport for [[gerrit:1151831|InitialiseSettings: wgTemplateDataEnableDiscovery on more wikis (T377975)]] (duration: 11m 36s) [production]
11:13 <samtar@deploy1003> samwilson, samtar: Continuing with sync [production]
11:10 <samtar@deploy1003> samwilson, samtar: Backport for [[gerrit:1151831|InitialiseSettings: wgTemplateDataEnableDiscovery on more wikis (T377975)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
11:09 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1167', diff saved to https://phabricator.wikimedia.org/P78161 and previous config saved to /var/cache/conftool/dbconfig/20250617-110900-ladsgroup.json [production]
11:08 <samtar@deploy1003> Started scap sync-world: Backport for [[gerrit:1151831|InitialiseSettings: wgTemplateDataEnableDiscovery on more wikis (T377975)]] [production]
11:06 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2173', diff saved to https://phabricator.wikimedia.org/P78160 and previous config saved to /var/cache/conftool/dbconfig/20250617-110652-marostegui.json [production]
11:01 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.upgrade (exit_code=0) upgradeing A:liberica-canary (T397053) [production]
11:00 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) pooling A:liberica-canary [production]
11:00 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.admin pooling A:liberica-canary [production]