651-700 of 10000 results (72ms)
2023-08-24 ยง
10:32 <fabfur> stopping pybal and rebooting lvs1019 (T344587) [production]
10:31 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P51285 and previous config saved to /var/cache/conftool/dbconfig/20230824-103153-ladsgroup.json [production]
10:28 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2170:3312 (T344589)', diff saved to https://phabricator.wikimedia.org/P51284 and previous config saved to /var/cache/conftool/dbconfig/20230824-102848-ladsgroup.json [production]
10:22 <jiji@cumin1001> conftool action : set/pooled=true; selector: dnsdisc=kartotherian,name=codfw [production]
10:22 <mvolz@deploy1002> helmfile [eqiad] DONE helmfile.d/services/citoid: apply [production]
10:22 <effie> pool kartotherian on codfw [production]
10:21 <mvolz@deploy1002> helmfile [eqiad] START helmfile.d/services/citoid: apply [production]
10:21 <jayme@deploy1002> helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply [production]
10:20 <jayme@deploy1002> helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply [production]
10:19 <jiji@deploy1002> helmfile [codfw] DONE helmfile.d/services/tegola-vector-tiles: apply [production]
10:18 <jiji@deploy1002> helmfile [codfw] START helmfile.d/services/tegola-vector-tiles: apply [production]
10:17 <mvolz@deploy1002> helmfile [codfw] DONE helmfile.d/services/citoid: apply [production]
10:16 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1194 (T343718)', diff saved to https://phabricator.wikimedia.org/P51283 and previous config saved to /var/cache/conftool/dbconfig/20230824-101647-ladsgroup.json [production]
10:16 <mvolz@deploy1002> helmfile [codfw] START helmfile.d/services/citoid: apply [production]
10:15 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2170:3311 (T344589)', diff saved to https://phabricator.wikimedia.org/P51282 and previous config saved to /var/cache/conftool/dbconfig/20230824-101527-ladsgroup.json [production]
10:15 <effie> Disable puppet on thanos-fe (eqiad), rollout cfssl on thanos-fe in codfw [production]
10:14 <mvolz@deploy1002> helmfile [staging] DONE helmfile.d/services/citoid: apply [production]
10:14 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1194 (T343718)', diff saved to https://phabricator.wikimedia.org/P51281 and previous config saved to /var/cache/conftool/dbconfig/20230824-101437-ladsgroup.json [production]
10:14 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1194.eqiad.wmnet with reason: Maintenance [production]
10:14 <mvolz@deploy1002> helmfile [staging] START helmfile.d/services/citoid: apply [production]
10:14 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1194.eqiad.wmnet with reason: Maintenance [production]
10:14 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1191 (T343718)', diff saved to https://phabricator.wikimedia.org/P51280 and previous config saved to /var/cache/conftool/dbconfig/20230824-101405-ladsgroup.json [production]
10:08 <mvolz@deploy1002> helmfile [eqiad] DONE helmfile.d/services/zotero: apply [production]
10:08 <mvolz@deploy1002> helmfile [eqiad] START helmfile.d/services/zotero: apply [production]
10:06 <mvolz@deploy1002> helmfile [codfw] DONE helmfile.d/services/zotero: apply [production]
10:06 <mvolz@deploy1002> helmfile [codfw] START helmfile.d/services/zotero: apply [production]
10:04 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host pybal-test2003.codfw.wmnet [production]
10:03 <mvolz@deploy1002> helmfile [staging] DONE helmfile.d/services/zotero: apply [production]
10:03 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2150 (T343718)', diff saved to https://phabricator.wikimedia.org/P51279 and previous config saved to /var/cache/conftool/dbconfig/20230824-100321-ladsgroup.json [production]
10:03 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2150.codfw.wmnet with reason: Maintenance [production]
10:03 <mvolz@deploy1002> helmfile [staging] START helmfile.d/services/zotero: apply [production]
10:03 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2150.codfw.wmnet with reason: Maintenance [production]
10:03 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2122 (T343718)', diff saved to https://phabricator.wikimedia.org/P51278 and previous config saved to /var/cache/conftool/dbconfig/20230824-100259-ladsgroup.json [production]
10:02 <fabfur> end reboot of lvs1020 (pybal service enabled) (T344587) [production]
10:00 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host pybal-test2003.codfw.wmnet [production]
10:00 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2170:3311', diff saved to https://phabricator.wikimedia.org/P51277 and previous config saved to /var/cache/conftool/dbconfig/20230824-100021-ladsgroup.json [production]
09:58 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1191', diff saved to https://phabricator.wikimedia.org/P51276 and previous config saved to /var/cache/conftool/dbconfig/20230824-095858-ladsgroup.json [production]
09:57 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief1001.eqiad.wmnet [production]
09:57 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs1020.eqiad.wmnet [production]
09:54 <fabfur@cumin1001> START - Cookbook sre.hosts.reboot-single for host lvs1020.eqiad.wmnet [production]
09:53 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host acmechief1001.eqiad.wmnet [production]
09:52 <fabfur> reboot lvs1020 to apply patch (T344587) [production]
09:51 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1225.eqiad.wmnet with reason: Maintenance [production]
09:51 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1225.eqiad.wmnet with reason: Maintenance [production]
09:51 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1222 (T344589)', diff saved to https://phabricator.wikimedia.org/P51275 and previous config saved to /var/cache/conftool/dbconfig/20230824-095117-ladsgroup.json [production]
09:49 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host karapace1002.eqiad.wmnet [production]
09:47 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2122', diff saved to https://phabricator.wikimedia.org/P51274 and previous config saved to /var/cache/conftool/dbconfig/20230824-094753-ladsgroup.json [production]
09:45 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief-test2001.codfw.wmnet [production]
09:45 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host karapace1002.eqiad.wmnet [production]
09:45 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2170:3311', diff saved to https://phabricator.wikimedia.org/P51273 and previous config saved to /var/cache/conftool/dbconfig/20230824-094515-ladsgroup.json [production]