2151-2200 of 10000 results (121ms)
2024-04-03 §
05:46 <marostegui@cumin1002> dbctl commit (dc=all): 'db1222 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P59241 and previous config saved to /var/cache/conftool/dbconfig/20240403-054641-root.json [production]
05:44 <marostegui@cumin1002> START - Cookbook sre.hosts.reimage for host db2148.codfw.wmnet with OS bookworm [production]
05:43 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2148 T361543', diff saved to https://phabricator.wikimedia.org/P59240 and previous config saved to /var/cache/conftool/dbconfig/20240403-054310-root.json [production]
05:28 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1222.eqiad.wmnet with reason: host reimage [production]
05:25 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db1222.eqiad.wmnet with reason: host reimage [production]
05:13 <marostegui@cumin1002> START - Cookbook sre.hosts.reimage for host db1222.eqiad.wmnet with OS bookworm [production]
05:11 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db1222 T361543', diff saved to https://phabricator.wikimedia.org/P59239 and previous config saved to /var/cache/conftool/dbconfig/20240403-051149-root.json [production]
05:10 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1167 (T356166)', diff saved to https://phabricator.wikimedia.org/P59238 and previous config saved to /var/cache/conftool/dbconfig/20240403-051029-marostegui.json [production]
05:10 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 16:00:00 on clouddb[1016,1020-1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
05:10 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 16:00:00 on clouddb[1016,1020-1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
05:10 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1167.eqiad.wmnet with reason: Maintenance [production]
05:09 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 8:00:00 on db1167.eqiad.wmnet with reason: Maintenance [production]
01:58 <logmsgbot> @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
01:58 <logmsgbot> @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
01:51 <logmsgbot> @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
01:51 <logmsgbot> @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
01:40 <cwhite@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logging-hd2002.codfw.wmnet with OS bookworm [production]
01:19 <cwhite@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logging-hd2002.codfw.wmnet with reason: host reimage [production]
01:15 <cwhite@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on logging-hd2002.codfw.wmnet with reason: host reimage [production]
01:06 <logmsgbot> @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
01:06 <logmsgbot> @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
01:00 <logmsgbot> @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
01:00 <logmsgbot> @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
00:46 <cwhite@cumin2002> START - Cookbook sre.hosts.reimage for host logging-hd2002.codfw.wmnet with OS bookworm [production]
00:44 <logmsgbot> @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
00:44 <logmsgbot> @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
00:43 <cwhite@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host logging-hd2002.codfw.wmnet with OS bookworm [production]
00:37 <logmsgbot> @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
00:36 <logmsgbot> @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
00:30 <logmsgbot> @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
00:30 <logmsgbot> @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
00:25 <cwhite@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logging-hd2003.codfw.wmnet with OS bookworm [production]
00:25 <logmsgbot> @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
00:25 <logmsgbot> @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
00:23 <logmsgbot> @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
00:23 <logmsgbot> @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
00:17 <logmsgbot> @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
00:17 <logmsgbot> @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
00:13 <logmsgbot> @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
00:13 <logmsgbot> @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
00:07 <logmsgbot> @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
00:07 <logmsgbot> @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
00:03 <cwhite@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logging-hd2003.codfw.wmnet with reason: host reimage [production]
2024-04-02 §
23:59 <cwhite@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on logging-hd2003.codfw.wmnet with reason: host reimage [production]
23:59 <logmsgbot> @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
23:59 <logmsgbot> @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
23:50 <cwhite@cumin2002> START - Cookbook sre.hosts.reimage for host logging-hd2002.codfw.wmnet with OS bookworm [production]
23:48 <logmsgbot> @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
23:48 <logmsgbot> @deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
23:42 <logmsgbot> @deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]