2051-2100 of 10000 results (112ms)
2023-08-30 ยง
10:24 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1170.eqiad.wmnet with reason: Maintenance [production]
10:24 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1170.eqiad.wmnet with reason: Maintenance [production]
10:24 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1156 (T343718)', diff saved to https://phabricator.wikimedia.org/P52051 and previous config saved to /var/cache/conftool/dbconfig/20230830-102432-ladsgroup.json [production]
10:22 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2131 (T344589)', diff saved to https://phabricator.wikimedia.org/P52050 and previous config saved to /var/cache/conftool/dbconfig/20230830-102241-ladsgroup.json [production]
10:21 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1029.eqiad.wmnet [production]
10:21 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1029.eqiad.wmnet [production]
10:20 <hnowlan@deploy1002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
10:20 <hnowlan@deploy1002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
10:18 <hnowlan@deploy1002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
10:16 <hnowlan@deploy1002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
10:16 <hnowlan@deploy1002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
10:16 <godog> +50g to prometheus eqiad 'services' instance [production]
10:15 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1029.eqiad.wmnet [production]
10:14 <hnowlan@deploy1002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
10:14 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2131 (T344589)', diff saved to https://phabricator.wikimedia.org/P52049 and previous config saved to /var/cache/conftool/dbconfig/20230830-101437-ladsgroup.json [production]
10:14 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2131.codfw.wmnet with reason: Maintenance [production]
10:14 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2131.codfw.wmnet with reason: Maintenance [production]
10:14 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2096 (T344589)', diff saved to https://phabricator.wikimedia.org/P52048 and previous config saved to /var/cache/conftool/dbconfig/20230830-101410-ladsgroup.json [production]
10:13 <hnowlan@deploy1002> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
10:12 <hnowlan@deploy1002> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
10:09 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1156', diff saved to https://phabricator.wikimedia.org/P52047 and previous config saved to /var/cache/conftool/dbconfig/20230830-100926-ladsgroup.json [production]
10:07 <jiji@cumin1001> START - Cookbook sre.k8s.reboot-nodes rolling reboot on A:wikikube-worker-codfw [production]
10:06 <effie> Rolling reboot codfw wikikube k8s nodes [production]
10:04 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db2138:3312 (T343718)', diff saved to https://phabricator.wikimedia.org/P52046 and previous config saved to /var/cache/conftool/dbconfig/20230830-100413-ladsgroup.json [production]
10:04 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2138.codfw.wmnet with reason: Maintenance [production]
10:03 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2138.codfw.wmnet with reason: Maintenance [production]
10:03 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2126 (T343718)', diff saved to https://phabricator.wikimedia.org/P52045 and previous config saved to /var/cache/conftool/dbconfig/20230830-100351-ladsgroup.json [production]
09:59 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2096', diff saved to https://phabricator.wikimedia.org/P52044 and previous config saved to /var/cache/conftool/dbconfig/20230830-095903-ladsgroup.json [production]
09:58 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1029.eqiad.wmnet [production]
09:56 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ores1006.eqiad.wmnet [production]
09:54 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1156', diff saved to https://phabricator.wikimedia.org/P52043 and previous config saved to /var/cache/conftool/dbconfig/20230830-095419-ladsgroup.json [production]
09:49 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host ores1006.eqiad.wmnet [production]
09:48 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2126', diff saved to https://phabricator.wikimedia.org/P52042 and previous config saved to /var/cache/conftool/dbconfig/20230830-094845-ladsgroup.json [production]
09:47 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ores1005.eqiad.wmnet [production]
09:43 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2096', diff saved to https://phabricator.wikimedia.org/P52041 and previous config saved to /var/cache/conftool/dbconfig/20230830-094357-ladsgroup.json [production]
09:40 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host ores1005.eqiad.wmnet [production]
09:40 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ores1004.eqiad.wmnet [production]
09:39 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1156 (T343718)', diff saved to https://phabricator.wikimedia.org/P52040 and previous config saved to /var/cache/conftool/dbconfig/20230830-093913-ladsgroup.json [production]
09:33 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2126', diff saved to https://phabricator.wikimedia.org/P52039 and previous config saved to /var/cache/conftool/dbconfig/20230830-093339-ladsgroup.json [production]
09:33 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host ores1004.eqiad.wmnet [production]
09:32 <urbanecm@deploy1002> helmfile [codfw] DONE helmfile.d/services/linkrecommendation: apply [production]
09:32 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1027.eqiad.wmnet [production]
09:32 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1027.eqiad.wmnet [production]
09:31 <urbanecm@deploy1002> helmfile [codfw] START helmfile.d/services/linkrecommendation: apply [production]
09:31 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ores1003.eqiad.wmnet [production]
09:30 <urbanecm@deploy1002> helmfile [eqiad] DONE helmfile.d/services/linkrecommendation: apply [production]
09:28 <urbanecm@deploy1002> helmfile [eqiad] START helmfile.d/services/linkrecommendation: apply [production]
09:28 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2096 (T344589)', diff saved to https://phabricator.wikimedia.org/P52038 and previous config saved to /var/cache/conftool/dbconfig/20230830-092851-ladsgroup.json [production]
09:28 <urbanecm@deploy1002> helmfile [staging] DONE helmfile.d/services/linkrecommendation: apply [production]
09:27 <urbanecm@deploy1002> helmfile [staging] START helmfile.d/services/linkrecommendation: apply [production]