2022-11-03
ยง
|
13:01 |
<marostegui@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db1191.eqiad.wmnet with reason: Maintenance |
[production] |
13:01 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1174 (T321123)', diff saved to https://phabricator.wikimedia.org/P37955 and previous config saved to /var/cache/conftool/dbconfig/20221103-130117-marostegui.json |
[production] |
13:01 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1106', diff saved to https://phabricator.wikimedia.org/P37954 and previous config saved to /var/cache/conftool/dbconfig/20221103-130106-ladsgroup.json |
[production] |
12:55 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1100', diff saved to https://phabricator.wikimedia.org/P37953 and previous config saved to /var/cache/conftool/dbconfig/20221103-125555-ladsgroup.json |
[production] |
12:46 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2130', diff saved to https://phabricator.wikimedia.org/P37952 and previous config saved to /var/cache/conftool/dbconfig/20221103-124646-ladsgroup.json |
[production] |
12:46 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1174', diff saved to https://phabricator.wikimedia.org/P37951 and previous config saved to /var/cache/conftool/dbconfig/20221103-124607-marostegui.json |
[production] |
12:45 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1106', diff saved to https://phabricator.wikimedia.org/P37950 and previous config saved to /var/cache/conftool/dbconfig/20221103-124557-ladsgroup.json |
[production] |
12:45 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2123 (T318605)', diff saved to https://phabricator.wikimedia.org/P37949 and previous config saved to /var/cache/conftool/dbconfig/20221103-124516-ladsgroup.json |
[production] |
12:45 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2123.codfw.wmnet with reason: Maintenance |
[production] |
12:44 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2123.codfw.wmnet with reason: Maintenance |
[production] |
12:44 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2111 (T318605)', diff saved to https://phabricator.wikimedia.org/P37948 and previous config saved to /var/cache/conftool/dbconfig/20221103-124454-ladsgroup.json |
[production] |
12:44 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.misc-clusters.roll-restart-reboot-docker-registry (exit_code=0) rolling restart_daemons on A:docker-registry |
[production] |
12:43 |
<jelto@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:20:00 on gitlab1004.wikimedia.org with reason: upgrade gitlab1004 to new version |
[production] |
12:43 |
<jelto@cumin1001> |
START - Cookbook sre.hosts.downtime for 0:20:00 on gitlab1004.wikimedia.org with reason: upgrade gitlab1004 to new version |
[production] |
12:41 |
<jmm@cumin2002> |
START - Cookbook sre.misc-clusters.roll-restart-reboot-docker-registry rolling restart_daemons on A:docker-registry |
[production] |
12:40 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1100 (T318605)', diff saved to https://phabricator.wikimedia.org/P37947 and previous config saved to /var/cache/conftool/dbconfig/20221103-124047-ladsgroup.json |
[production] |
12:35 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1025.eqiad.wmnet |
[production] |
12:31 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2130 (T318955)', diff saved to https://phabricator.wikimedia.org/P37946 and previous config saved to /var/cache/conftool/dbconfig/20221103-123137-ladsgroup.json |
[production] |
12:31 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1174', diff saved to https://phabricator.wikimedia.org/P37945 and previous config saved to /var/cache/conftool/dbconfig/20221103-123101-marostegui.json |
[production] |
12:30 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1106 (T318955)', diff saved to https://phabricator.wikimedia.org/P37944 and previous config saved to /var/cache/conftool/dbconfig/20221103-123048-ladsgroup.json |
[production] |
12:29 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2111', diff saved to https://phabricator.wikimedia.org/P37943 and previous config saved to /var/cache/conftool/dbconfig/20221103-122944-ladsgroup.json |
[production] |
12:28 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2130 (T318955)', diff saved to https://phabricator.wikimedia.org/P37942 and previous config saved to /var/cache/conftool/dbconfig/20221103-122854-ladsgroup.json |
[production] |
12:28 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2130.codfw.wmnet with reason: Maintenance |
[production] |
12:28 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db2130.codfw.wmnet with reason: Maintenance |
[production] |
12:28 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2116 (T318955)', diff saved to https://phabricator.wikimedia.org/P37941 and previous config saved to /var/cache/conftool/dbconfig/20221103-122831-ladsgroup.json |
[production] |
12:27 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host ganeti1025.eqiad.wmnet |
[production] |
12:27 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1106 (T318955)', diff saved to https://phabricator.wikimedia.org/P37940 and previous config saved to /var/cache/conftool/dbconfig/20221103-122709-ladsgroup.json |
[production] |
12:27 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 16:00:00 on clouddb[1013,1017,1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance |
[production] |
12:26 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 16:00:00 on clouddb[1013,1017,1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance |
[production] |
12:26 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1106.eqiad.wmnet with reason: Maintenance |
[production] |
12:26 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db1106.eqiad.wmnet with reason: Maintenance |
[production] |
12:26 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1105:3311 (T318955)', diff saved to https://phabricator.wikimedia.org/P37939 and previous config saved to /var/cache/conftool/dbconfig/20221103-122640-ladsgroup.json |
[production] |
12:16 |
<hnowlan@deploy1002> |
helmfile [staging] DONE helmfile.d/services/thumbor: sync |
[production] |
12:16 |
<hnowlan@deploy1002> |
helmfile [staging] START helmfile.d/services/thumbor: sync |
[production] |
12:15 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1174 (T321123)', diff saved to https://phabricator.wikimedia.org/P37938 and previous config saved to /var/cache/conftool/dbconfig/20221103-121553-marostegui.json |
[production] |
12:15 |
<hnowlan@deploy1002> |
helmfile [staging] DONE helmfile.d/services/thumbor: sync |
[production] |
12:15 |
<hnowlan@deploy1002> |
helmfile [staging] START helmfile.d/services/thumbor: sync |
[production] |
12:14 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1100 (T318605)', diff saved to https://phabricator.wikimedia.org/P37937 and previous config saved to /var/cache/conftool/dbconfig/20221103-121458-ladsgroup.json |
[production] |
12:14 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1100.eqiad.wmnet with reason: Maintenance |
[production] |
12:14 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2111', diff saved to https://phabricator.wikimedia.org/P37936 and previous config saved to /var/cache/conftool/dbconfig/20221103-121436-ladsgroup.json |
[production] |
12:14 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1100.eqiad.wmnet with reason: Maintenance |
[production] |
12:14 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1096:3315 (T318605)', diff saved to https://phabricator.wikimedia.org/P37935 and previous config saved to /var/cache/conftool/dbconfig/20221103-121423-ladsgroup.json |
[production] |
12:13 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2116', diff saved to https://phabricator.wikimedia.org/P37934 and previous config saved to /var/cache/conftool/dbconfig/20221103-121320-ladsgroup.json |
[production] |
12:11 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1105:3311', diff saved to https://phabricator.wikimedia.org/P37932 and previous config saved to /var/cache/conftool/dbconfig/20221103-121133-ladsgroup.json |
[production] |
12:11 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti1025.eqiad.wmnet with OS bullseye |
[production] |
12:08 |
<hnowlan@deploy1002> |
helmfile [staging] DONE helmfile.d/services/thumbor: sync |
[production] |
12:08 |
<hnowlan@deploy1002> |
helmfile [staging] START helmfile.d/services/thumbor: sync |
[production] |
12:07 |
<hnowlan@deploy1002> |
helmfile [staging] DONE helmfile.d/services/thumbor: sync |
[production] |
12:06 |
<hnowlan@deploy1002> |
helmfile [staging] START helmfile.d/services/thumbor: sync |
[production] |
12:05 |
<hnowlan@deploy1002> |
helmfile [staging] DONE helmfile.d/services/thumbor: sync |
[production] |