2022-04-07
ยง
|
11:46 |
<jynus@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2101.codfw.wmnet with OS bullseye |
[production] |
11:45 |
<mmandere@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp6013.drmrs.wmnet with OS buster |
[production] |
11:35 |
<mmandere> |
depool cp6013 for reimage - T290005 |
[production] |
11:35 |
<jynus@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1140.eqiad.wmnet with reason: host reimage |
[production] |
11:34 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1174', diff saved to https://phabricator.wikimedia.org/P24240 and previous config saved to /var/cache/conftool/dbconfig/20220407-113455-ladsgroup.json |
[production] |
11:34 |
<mmandere@cumin1001> |
START - Cookbook sre.hosts.reimage for host cp3051.esams.wmnet with OS buster |
[production] |
11:32 |
<jforrester@deploy1002> |
Finished deploy [integration/docroot@d88e2fa]: d88e2fa19fd6 [WikiLambda] Fix link typo and re-group/re-word other links (duration: 00m 09s) |
[production] |
11:32 |
<jforrester@deploy1002> |
Started deploy [integration/docroot@d88e2fa]: d88e2fa19fd6 [WikiLambda] Fix link typo and re-group/re-word other links |
[production] |
11:31 |
<jynus@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2101.codfw.wmnet with reason: host reimage |
[production] |
11:31 |
<jynus@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1140.eqiad.wmnet with reason: host reimage |
[production] |
11:28 |
<jynus@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2101.codfw.wmnet with reason: host reimage |
[production] |
11:23 |
<mmandere> |
depool cp3051 for reimage - T290005 |
[production] |
11:23 |
<jynus@cumin1001> |
START - Cookbook sre.hosts.reimage for host db1140.eqiad.wmnet with OS bullseye |
[production] |
11:19 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1174 (T305300)', diff saved to https://phabricator.wikimedia.org/P24239 and previous config saved to /var/cache/conftool/dbconfig/20220407-111950-ladsgroup.json |
[production] |
11:17 |
<jynus@cumin2002> |
START - Cookbook sre.hosts.reimage for host db2101.codfw.wmnet with OS bullseye |
[production] |
11:17 |
<jynus@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1139.eqiad.wmnet with OS bullseye |
[production] |
11:16 |
<mvolz@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/citoid: apply |
[production] |
11:15 |
<mvolz@deploy1002> |
helmfile [eqiad] START helmfile.d/services/citoid: apply |
[production] |
11:12 |
<jynus@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2100.codfw.wmnet with OS bullseye |
[production] |
11:03 |
<jynus@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1139.eqiad.wmnet with reason: host reimage |
[production] |
10:59 |
<jynus@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1139.eqiad.wmnet with reason: host reimage |
[production] |
10:59 |
<mmandere> |
pool cp3053 with HAProxy as TLS termination layer - T290005 |
[production] |
10:58 |
<jynus@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2100.codfw.wmnet with reason: host reimage |
[production] |
10:55 |
<jynus@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2100.codfw.wmnet with reason: host reimage |
[production] |
10:55 |
<mmandere@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3053.esams.wmnet with OS buster |
[production] |
10:51 |
<jynus@cumin1001> |
START - Cookbook sre.hosts.reimage for host db1139.eqiad.wmnet with OS bullseye |
[production] |
10:45 |
<btullis@deploy1002> |
helmfile [staging] DONE helmfile.d/services/datahub: sync on main |
[production] |
10:44 |
<jynus@cumin2002> |
START - Cookbook sre.hosts.reimage for host db2100.codfw.wmnet with OS bullseye |
[production] |
10:43 |
<btullis@deploy1002> |
helmfile [staging] START helmfile.d/services/datahub: apply on main |
[production] |
10:41 |
<jynus@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1116.eqiad.wmnet with OS bullseye |
[production] |
10:40 |
<mmandere> |
pool cp6006 with HAProxy as TLS termination layer - T290005 |
[production] |
10:37 |
<jayme@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on kubemaster2002.codfw.wmnet with reason: reimage |
[production] |
10:37 |
<jayme@cumin1001> |
START - Cookbook sre.hosts.downtime for 3:00:00 on kubemaster2002.codfw.wmnet with reason: reimage |
[production] |
10:37 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance |
[production] |
10:37 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance |
[production] |
10:37 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1099:3311 (T298565)', diff saved to https://phabricator.wikimedia.org/P24238 and previous config saved to /var/cache/conftool/dbconfig/20220407-103739-ladsgroup.json |
[production] |
10:37 |
<mmandere@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp6006.drmrs.wmnet with OS buster |
[production] |
10:36 |
<mvolz@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/citoid: apply |
[production] |
10:36 |
<mvolz@deploy1002> |
helmfile [codfw] START helmfile.d/services/citoid: apply |
[production] |
10:35 |
<btullis@deploy1002> |
helmfile [staging] DONE helmfile.d/services/datahub: apply on main |
[production] |
10:35 |
<btullis@deploy1002> |
helmfile [staging] START helmfile.d/services/datahub: apply on main |
[production] |
10:28 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1146:3312 (re)pooling @ 100%: After schema change', diff saved to https://phabricator.wikimedia.org/P24237 and previous config saved to /var/cache/conftool/dbconfig/20220407-102821-root.json |
[production] |
10:28 |
<jynus@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1116.eqiad.wmnet with reason: host reimage |
[production] |
10:27 |
<jynus@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2099.codfw.wmnet with OS bullseye |
[production] |
10:25 |
<mvolz@deploy1002> |
helmfile [staging] DONE helmfile.d/services/citoid: apply |
[production] |
10:24 |
<jynus@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1116.eqiad.wmnet with reason: host reimage |
[production] |
10:24 |
<mvolz@deploy1002> |
helmfile [staging] START helmfile.d/services/citoid: apply |
[production] |
10:22 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1099:3311', diff saved to https://phabricator.wikimedia.org/P24236 and previous config saved to /var/cache/conftool/dbconfig/20220407-102234-ladsgroup.json |
[production] |
10:20 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. |
[production] |
10:20 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. |
[production] |