2022-06-30
ยง
|
12:36 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.decommission for hosts db2083.codfw.wmnet |
[production] |
12:29 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'db1128 (re)pooling @ 100%: Maint done', diff saved to https://phabricator.wikimedia.org/P30660 and previous config saved to /var/cache/conftool/dbconfig/20220630-122931-ladsgroup.json |
[production] |
12:14 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'db1128 (re)pooling @ 75%: Maint done', diff saved to https://phabricator.wikimedia.org/P30659 and previous config saved to /var/cache/conftool/dbconfig/20220630-121427-ladsgroup.json |
[production] |
11:59 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'db1128 (re)pooling @ 25%: Maint done', diff saved to https://phabricator.wikimedia.org/P30658 and previous config saved to /var/cache/conftool/dbconfig/20220630-115923-ladsgroup.json |
[production] |
11:44 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'db1128 (re)pooling @ 10%: Maint done', diff saved to https://phabricator.wikimedia.org/P30657 and previous config saved to /var/cache/conftool/dbconfig/20220630-114419-ladsgroup.json |
[production] |
09:47 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-cache2003.codfw.wmnet with OS buster |
[production] |
09:42 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Remove db2083 from dbctl', diff saved to https://phabricator.wikimedia.org/P30655 and previous config saved to /var/cache/conftool/dbconfig/20220630-094239-marostegui.json |
[production] |
09:36 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-cache2002.codfw.wmnet with OS buster |
[production] |
09:35 |
<volans@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:20:00 on sretest1001.eqiad.wmnet with reason: Testing |
[production] |
09:35 |
<volans@cumin1001> |
START - Cookbook sre.hosts.downtime for 0:20:00 on sretest1001.eqiad.wmnet with reason: Testing |
[production] |
08:57 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-cache2001.codfw.wmnet with OS buster |
[production] |
08:56 |
<elukey@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on ml-cache2003.codfw.wmnet with reason: host reimage |
[production] |
08:56 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ml-cache2003.codfw.wmnet with reason: host reimage |
[production] |
08:46 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Remove weight from x1 master - not neeed anymore', diff saved to https://phabricator.wikimedia.org/P30654 and previous config saved to /var/cache/conftool/dbconfig/20220630-084621-marostegui.json |
[production] |
08:45 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1103 (re)pooling @ 100%: After reimage', diff saved to https://phabricator.wikimedia.org/P30653 and previous config saved to /var/cache/conftool/dbconfig/20220630-084550-root.json |
[production] |
08:44 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-cache2002.codfw.wmnet with reason: host reimage |
[production] |
08:42 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reimage for host ml-cache2003.codfw.wmnet with OS buster |
[production] |
08:42 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ml-cache2002.codfw.wmnet with reason: host reimage |
[production] |
08:41 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1173 (re)pooling @ 100%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30652 and previous config saved to /var/cache/conftool/dbconfig/20220630-084148-root.json |
[production] |
08:33 |
<elukey@deploy1002> |
Finished deploy [ores/deploy@dfaec93]: Update ores submodule to its latest commit and scap canary settings (duration: 14m 48s) |
[production] |
08:30 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1103 (re)pooling @ 75%: After reimage', diff saved to https://phabricator.wikimedia.org/P30651 and previous config saved to /var/cache/conftool/dbconfig/20220630-083046-root.json |
[production] |
08:28 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-cache2001.codfw.wmnet with reason: host reimage |
[production] |
08:28 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reimage for host ml-cache2002.codfw.wmnet with OS buster |
[production] |
08:26 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1173 (re)pooling @ 75%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30650 and previous config saved to /var/cache/conftool/dbconfig/20220630-082644-root.json |
[production] |
08:26 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ml-cache2001.codfw.wmnet with reason: host reimage |
[production] |
08:19 |
<elukey@deploy1002> |
Started deploy [ores/deploy@dfaec93]: Update ores submodule to its latest commit and scap canary settings |
[production] |
08:15 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1103 (re)pooling @ 50%: After reimage', diff saved to https://phabricator.wikimedia.org/P30649 and previous config saved to /var/cache/conftool/dbconfig/20220630-081542-root.json |
[production] |
08:12 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reimage for host ml-cache2001.codfw.wmnet with OS buster |
[production] |
08:11 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1173 (re)pooling @ 50%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30648 and previous config saved to /var/cache/conftool/dbconfig/20220630-081140-root.json |
[production] |
08:00 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1103 (re)pooling @ 25%: After reimage', diff saved to https://phabricator.wikimedia.org/P30647 and previous config saved to /var/cache/conftool/dbconfig/20220630-080038-root.json |
[production] |
07:56 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1173 (re)pooling @ 25%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30646 and previous config saved to /var/cache/conftool/dbconfig/20220630-075637-root.json |
[production] |
07:45 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1103 (re)pooling @ 10%: After reimage', diff saved to https://phabricator.wikimedia.org/P30645 and previous config saved to /var/cache/conftool/dbconfig/20220630-074534-root.json |
[production] |
07:42 |
<slyngs> |
Move apt repository to Apache2, from Nginx https://gerrit.wikimedia.org/r/c/operations/puppet/+/807983 |
[production] |
07:41 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1173 (re)pooling @ 10%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30644 and previous config saved to /var/cache/conftool/dbconfig/20220630-074133-root.json |
[production] |
07:30 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1103 (re)pooling @ 2%: After reimage', diff saved to https://phabricator.wikimedia.org/P30643 and previous config saved to /var/cache/conftool/dbconfig/20220630-073030-root.json |
[production] |
07:26 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1173 (re)pooling @ 5%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30642 and previous config saved to /var/cache/conftool/dbconfig/20220630-072629-root.json |
[production] |
07:15 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1103 (re)pooling @ 1%: After reimage', diff saved to https://phabricator.wikimedia.org/P30641 and previous config saved to /var/cache/conftool/dbconfig/20220630-071526-root.json |
[production] |
07:15 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1103 weight', diff saved to https://phabricator.wikimedia.org/P30640 and previous config saved to /var/cache/conftool/dbconfig/20220630-071522-marostegui.json |
[production] |
07:11 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1173 (re)pooling @ 2%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30639 and previous config saved to /var/cache/conftool/dbconfig/20220630-071125-root.json |
[production] |
06:51 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1173 (re)pooling @ 2%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30636 and previous config saved to /var/cache/conftool/dbconfig/20220630-065126-root.json |
[production] |
06:37 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1103.eqiad.wmnet with reason: host reimage |
[production] |
06:36 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1173 (re)pooling @ 1%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30635 and previous config saved to /var/cache/conftool/dbconfig/20220630-063622-root.json |
[production] |
06:33 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1103.eqiad.wmnet with reason: host reimage |
[production] |
06:06 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Promote db1120 to x1 primary and set section read-write T300472', diff saved to https://phabricator.wikimedia.org/P30633 and previous config saved to /var/cache/conftool/dbconfig/20220630-060601-root.json |
[production] |
06:03 |
<marostegui> |
Starting x1 eqiad failover from db1103 to db1120 - T300472 |
[production] |
05:23 |
<eileen> |
civicrm upgraded from 9e5a5310 to 55bc690b |
[production] |
05:17 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Set db1120 with weight 0 T300472', diff saved to https://phabricator.wikimedia.org/P30632 and previous config saved to /var/cache/conftool/dbconfig/20220630-051730-root.json |
[production] |
05:17 |
<marostegui@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 10 hosts with reason: Primary switchover x1 T300472 |
[production] |
05:17 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 1:00:00 on 10 hosts with reason: Primary switchover x1 T300472 |
[production] |
02:59 |
<pt1979@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host db2160.codfw.wmnet with OS bullseye |
[production] |