2022-11-04
ยง
|
12:21 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es2020 (re)pooling @ 75%: After reboot', diff saved to https://phabricator.wikimedia.org/P38142 and previous config saved to /var/cache/conftool/dbconfig/20221104-122101-root.json |
[production] |
12:19 |
<hnowlan@deploy1002> |
helmfile [staging] DONE helmfile.d/services/thumbor: sync |
[production] |
12:18 |
<hnowlan@deploy1002> |
helmfile [staging] START helmfile.d/services/thumbor: sync |
[production] |
12:18 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2176 (T318955)', diff saved to https://phabricator.wikimedia.org/P38141 and previous config saved to /var/cache/conftool/dbconfig/20221104-121848-ladsgroup.json |
[production] |
12:12 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1169', diff saved to https://phabricator.wikimedia.org/P38140 and previous config saved to /var/cache/conftool/dbconfig/20221104-121219-ladsgroup.json |
[production] |
12:05 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es2020 (re)pooling @ 50%: After reboot', diff saved to https://phabricator.wikimedia.org/P38139 and previous config saved to /var/cache/conftool/dbconfig/20221104-120556-root.json |
[production] |
12:03 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2176', diff saved to https://phabricator.wikimedia.org/P38138 and previous config saved to /var/cache/conftool/dbconfig/20221104-120342-ladsgroup.json |
[production] |
11:57 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1169', diff saved to https://phabricator.wikimedia.org/P38137 and previous config saved to /var/cache/conftool/dbconfig/20221104-115713-ladsgroup.json |
[production] |
11:50 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es2020 (re)pooling @ 25%: After reboot', diff saved to https://phabricator.wikimedia.org/P38136 and previous config saved to /var/cache/conftool/dbconfig/20221104-115051-root.json |
[production] |
11:48 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2176', diff saved to https://phabricator.wikimedia.org/P38135 and previous config saved to /var/cache/conftool/dbconfig/20221104-114835-ladsgroup.json |
[production] |
11:42 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1169 (T318955)', diff saved to https://phabricator.wikimedia.org/P38134 and previous config saved to /var/cache/conftool/dbconfig/20221104-114207-ladsgroup.json |
[production] |
11:39 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1169 (T318955)', diff saved to https://phabricator.wikimedia.org/P38133 and previous config saved to /var/cache/conftool/dbconfig/20221104-113929-ladsgroup.json |
[production] |
11:39 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1169.eqiad.wmnet with reason: Maintenance |
[production] |
11:39 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db1169.eqiad.wmnet with reason: Maintenance |
[production] |
11:38 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1140.eqiad.wmnet with reason: Maintenance |
[production] |
11:38 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db1140.eqiad.wmnet with reason: Maintenance |
[production] |
11:38 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1139.eqiad.wmnet with reason: Maintenance |
[production] |
11:38 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db1139.eqiad.wmnet with reason: Maintenance |
[production] |
11:37 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1135 (T318955)', diff saved to https://phabricator.wikimedia.org/P38132 and previous config saved to /var/cache/conftool/dbconfig/20221104-113725-ladsgroup.json |
[production] |
11:35 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es2020 (re)pooling @ 10%: After reboot', diff saved to https://phabricator.wikimedia.org/P38131 and previous config saved to /var/cache/conftool/dbconfig/20221104-113546-root.json |
[production] |
11:33 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2176 (T318955)', diff saved to https://phabricator.wikimedia.org/P38130 and previous config saved to /var/cache/conftool/dbconfig/20221104-113329-ladsgroup.json |
[production] |
11:30 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2176 (T318955)', diff saved to https://phabricator.wikimedia.org/P38129 and previous config saved to /var/cache/conftool/dbconfig/20221104-113048-ladsgroup.json |
[production] |
11:30 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2176.codfw.wmnet with reason: Maintenance |
[production] |
11:30 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db2176.codfw.wmnet with reason: Maintenance |
[production] |
11:30 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2174 (T318955)', diff saved to https://phabricator.wikimedia.org/P38128 and previous config saved to /var/cache/conftool/dbconfig/20221104-113027-ladsgroup.json |
[production] |
11:27 |
<elukey> |
restart kube-apiserver on ml-serve-ctrl2002 - high latencies for LIST (knative resources) |
[production] |
11:22 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1135', diff saved to https://phabricator.wikimedia.org/P38127 and previous config saved to /var/cache/conftool/dbconfig/20221104-112218-ladsgroup.json |
[production] |
11:20 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es2020 (re)pooling @ 5%: After reboot', diff saved to https://phabricator.wikimedia.org/P38125 and previous config saved to /var/cache/conftool/dbconfig/20221104-112041-root.json |
[production] |
11:15 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2174', diff saved to https://phabricator.wikimedia.org/P38124 and previous config saved to /var/cache/conftool/dbconfig/20221104-111521-ladsgroup.json |
[production] |
11:07 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1135', diff saved to https://phabricator.wikimedia.org/P38123 and previous config saved to /var/cache/conftool/dbconfig/20221104-110712-ladsgroup.json |
[production] |
11:00 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2174', diff saved to https://phabricator.wikimedia.org/P38122 and previous config saved to /var/cache/conftool/dbconfig/20221104-110014-ladsgroup.json |
[production] |
10:52 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1135 (T318955)', diff saved to https://phabricator.wikimedia.org/P38121 and previous config saved to /var/cache/conftool/dbconfig/20221104-105205-ladsgroup.json |
[production] |
10:50 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'es2020 (re)pooling @ 1%: After reboot', diff saved to https://phabricator.wikimedia.org/P38120 and previous config saved to /var/cache/conftool/dbconfig/20221104-105031-root.json |
[production] |
10:49 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1135 (T318955)', diff saved to https://phabricator.wikimedia.org/P38119 and previous config saved to /var/cache/conftool/dbconfig/20221104-104927-ladsgroup.json |
[production] |
10:49 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1135.eqiad.wmnet with reason: Maintenance |
[production] |
10:49 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db1135.eqiad.wmnet with reason: Maintenance |
[production] |
10:45 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2174 (T318955)', diff saved to https://phabricator.wikimedia.org/P38117 and previous config saved to /var/cache/conftool/dbconfig/20221104-104508-ladsgroup.json |
[production] |
10:42 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2174 (T318955)', diff saved to https://phabricator.wikimedia.org/P38116 and previous config saved to /var/cache/conftool/dbconfig/20221104-104227-ladsgroup.json |
[production] |
10:42 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2174.codfw.wmnet with reason: Maintenance |
[production] |
10:42 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db2174.codfw.wmnet with reason: Maintenance |
[production] |
07:27 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2121 (re)pooling @ 100%: After schema change', diff saved to https://phabricator.wikimedia.org/P38115 and previous config saved to /var/cache/conftool/dbconfig/20221104-072722-root.json |
[production] |
07:12 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2121 (re)pooling @ 75%: After schema change', diff saved to https://phabricator.wikimedia.org/P38114 and previous config saved to /var/cache/conftool/dbconfig/20221104-071217-root.json |
[production] |
06:57 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2121 (re)pooling @ 50%: After schema change', diff saved to https://phabricator.wikimedia.org/P38113 and previous config saved to /var/cache/conftool/dbconfig/20221104-065712-root.json |
[production] |
06:42 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db2121 (re)pooling @ 25%: After schema change', diff saved to https://phabricator.wikimedia.org/P38112 and previous config saved to /var/cache/conftool/dbconfig/20221104-064207-root.json |
[production] |
06:32 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Give weight to es2021', diff saved to https://phabricator.wikimedia.org/P38111 and previous config saved to /var/cache/conftool/dbconfig/20221104-063250-root.json |
[production] |
06:32 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool es2020 T322389', diff saved to https://phabricator.wikimedia.org/P38110 and previous config saved to /var/cache/conftool/dbconfig/20221104-063224-root.json |
[production] |
06:31 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Promote es2021 to es4 primary and set section read-write T322389', diff saved to https://phabricator.wikimedia.org/P38109 and previous config saved to /var/cache/conftool/dbconfig/20221104-063128-root.json |
[production] |
06:30 |
<marostegui> |
Starting es4 codfw failover from es2020 to es2021 - T322389 |
[production] |
06:27 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Set es2021 with weight 0 T322389', diff saved to https://phabricator.wikimedia.org/P38108 and previous config saved to /var/cache/conftool/dbconfig/20221104-062740-root.json |
[production] |
06:27 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 6 hosts with reason: Primary switchover es4 T322389 |
[production] |