1801-1850 of 10000 results (75ms)
2022-06-30 ยง
08:28 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-cache2001.codfw.wmnet with reason: host reimage [production]
08:28 <elukey@cumin1001> START - Cookbook sre.hosts.reimage for host ml-cache2002.codfw.wmnet with OS buster [production]
08:26 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 75%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30650 and previous config saved to /var/cache/conftool/dbconfig/20220630-082644-root.json [production]
08:26 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on ml-cache2001.codfw.wmnet with reason: host reimage [production]
08:19 <elukey@deploy1002> Started deploy [ores/deploy@dfaec93]: Update ores submodule to its latest commit and scap canary settings [production]
08:15 <marostegui@cumin1001> dbctl commit (dc=all): 'db1103 (re)pooling @ 50%: After reimage', diff saved to https://phabricator.wikimedia.org/P30649 and previous config saved to /var/cache/conftool/dbconfig/20220630-081542-root.json [production]
08:12 <elukey@cumin1001> START - Cookbook sre.hosts.reimage for host ml-cache2001.codfw.wmnet with OS buster [production]
08:11 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 50%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30648 and previous config saved to /var/cache/conftool/dbconfig/20220630-081140-root.json [production]
08:00 <marostegui@cumin1001> dbctl commit (dc=all): 'db1103 (re)pooling @ 25%: After reimage', diff saved to https://phabricator.wikimedia.org/P30647 and previous config saved to /var/cache/conftool/dbconfig/20220630-080038-root.json [production]
07:56 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 25%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30646 and previous config saved to /var/cache/conftool/dbconfig/20220630-075637-root.json [production]
07:45 <marostegui@cumin1001> dbctl commit (dc=all): 'db1103 (re)pooling @ 10%: After reimage', diff saved to https://phabricator.wikimedia.org/P30645 and previous config saved to /var/cache/conftool/dbconfig/20220630-074534-root.json [production]
07:42 <slyngs> Move apt repository to Apache2, from Nginx https://gerrit.wikimedia.org/r/c/operations/puppet/+/807983 [production]
07:41 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 10%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30644 and previous config saved to /var/cache/conftool/dbconfig/20220630-074133-root.json [production]
07:30 <marostegui@cumin1001> dbctl commit (dc=all): 'db1103 (re)pooling @ 2%: After reimage', diff saved to https://phabricator.wikimedia.org/P30643 and previous config saved to /var/cache/conftool/dbconfig/20220630-073030-root.json [production]
07:26 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 5%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30642 and previous config saved to /var/cache/conftool/dbconfig/20220630-072629-root.json [production]
07:15 <marostegui@cumin1001> dbctl commit (dc=all): 'db1103 (re)pooling @ 1%: After reimage', diff saved to https://phabricator.wikimedia.org/P30641 and previous config saved to /var/cache/conftool/dbconfig/20220630-071526-root.json [production]
07:15 <marostegui@cumin1001> dbctl commit (dc=all): 'db1103 weight', diff saved to https://phabricator.wikimedia.org/P30640 and previous config saved to /var/cache/conftool/dbconfig/20220630-071522-marostegui.json [production]
07:11 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 2%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30639 and previous config saved to /var/cache/conftool/dbconfig/20220630-071125-root.json [production]
06:51 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 2%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30636 and previous config saved to /var/cache/conftool/dbconfig/20220630-065126-root.json [production]
06:37 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1103.eqiad.wmnet with reason: host reimage [production]
06:36 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 1%: After on-site maintenance', diff saved to https://phabricator.wikimedia.org/P30635 and previous config saved to /var/cache/conftool/dbconfig/20220630-063622-root.json [production]
06:33 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1103.eqiad.wmnet with reason: host reimage [production]
06:06 <marostegui@cumin1001> dbctl commit (dc=all): 'Promote db1120 to x1 primary and set section read-write T300472', diff saved to https://phabricator.wikimedia.org/P30633 and previous config saved to /var/cache/conftool/dbconfig/20220630-060601-root.json [production]
06:03 <marostegui> Starting x1 eqiad failover from db1103 to db1120 - T300472 [production]
05:23 <eileen> civicrm upgraded from 9e5a5310 to 55bc690b [production]
05:17 <marostegui@cumin1001> dbctl commit (dc=all): 'Set db1120 with weight 0 T300472', diff saved to https://phabricator.wikimedia.org/P30632 and previous config saved to /var/cache/conftool/dbconfig/20220630-051730-root.json [production]
05:17 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 10 hosts with reason: Primary switchover x1 T300472 [production]
05:17 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on 10 hosts with reason: Primary switchover x1 T300472 [production]
02:59 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host db2160.codfw.wmnet with OS bullseye [production]
02:58 <eileen> civicrm upgraded from f48fe112 to 9e5a5310 [production]
02:50 <bmansurov@deploy1002> Finished deploy [airflow-dags/research@b3fe77c]: (no justification provided) (duration: 00m 08s) [production]
02:49 <bmansurov@deploy1002> Started deploy [airflow-dags/research@b3fe77c]: (no justification provided) [production]
02:49 <bmansurov@deploy1002> Finished deploy [airflow-dags/research@b3fe77c]: (no justification provided) (duration: 00m 08s) [production]
02:48 <bmansurov@deploy1002> Started deploy [airflow-dags/research@b3fe77c]: (no justification provided) [production]
02:48 <bmansurov@deploy1002> deploy aborted: (no justification provided) (duration: 00m 02s) [production]
02:48 <bmansurov@deploy1002> Started deploy [airflow-dags/research@b3fe77c]: (no justification provided) [production]
02:47 <bmansurov@deploy1002> Finished deploy [airflow-dags/research@b3fe77c]: (no justification provided) (duration: 00m 03s) [production]
02:47 <bmansurov@deploy1002> Started deploy [airflow-dags/research@b3fe77c]: (no justification provided) [production]
02:18 <bmansurov@deploy1002> Finished deploy [airflow-dags/research@b3fe77c]: (no justification provided) (duration: 00m 08s) [production]
02:18 <bmansurov@deploy1002> Started deploy [airflow-dags/research@b3fe77c]: (no justification provided) [production]
02:17 <bmansurov@deploy1002> deploy aborted: (no justification provided) (duration: 02m 03s) [production]
02:15 <bmansurov@deploy1002> Started deploy [airflow-dags/research@b3fe77c]: (no justification provided) [production]
02:11 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host db2160.codfw.wmnet with OS bullseye [production]
01:48 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2159.codfw.wmnet with OS bullseye [production]
01:36 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2158.codfw.wmnet with OS bullseye [production]
01:34 <eileen> civicrm upgraded from 3cb5e6dd to f48fe112 [production]
01:32 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2159.codfw.wmnet with reason: host reimage [production]
01:27 <pt1979@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on db2159.codfw.wmnet with reason: host reimage [production]
01:20 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2158.codfw.wmnet with reason: host reimage [production]
01:17 <pt1979@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on db2158.codfw.wmnet with reason: host reimage [production]