1151-1200 of 10000 results (89ms)
2024-05-31 ยง
12:15 <dcausse@deploy1002> Started deploy [airflow-dags/search@45de44b]: search: bump rdf-spark-tools to 0.3.141 [production]
12:13 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
12:13 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
12:09 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1209.eqiad.wmnet with reason: host reimage [production]
12:08 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
12:08 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
12:06 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
12:06 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
12:06 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db1209.eqiad.wmnet with reason: host reimage [production]
12:03 <jiji@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc2038.codfw.wmnet with reason: host reimage [production]
12:00 <jiji@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on mc2038.codfw.wmnet with reason: host reimage [production]
12:00 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
12:00 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
11:58 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
11:58 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
11:56 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
11:56 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
11:54 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
11:53 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
11:53 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2197.codfw.wmnet with reason: Maintenance [production]
11:52 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2197.codfw.wmnet with reason: Maintenance [production]
11:52 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2189 (T352010)', diff saved to https://phabricator.wikimedia.org/P63764 and previous config saved to /var/cache/conftool/dbconfig/20240531-115244-ladsgroup.json [production]
11:52 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
11:51 <marostegui@cumin1002> START - Cookbook sre.hosts.reimage for host db1209.eqiad.wmnet with OS bookworm [production]
11:51 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
11:49 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
11:49 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
11:47 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
11:47 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
11:46 <jiji@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host mc1039.eqiad.wmnet with OS bookworm [production]
11:45 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
11:45 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
11:43 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
11:43 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
11:42 <jiji@cumin2002> START - Cookbook sre.hosts.reimage for host mc2038.codfw.wmnet with OS bookworm [production]
11:41 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
11:41 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
11:39 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
11:39 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
11:37 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
11:37 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
11:37 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2189', diff saved to https://phabricator.wikimedia.org/P63763 and previous config saved to /var/cache/conftool/dbconfig/20240531-113735-ladsgroup.json [production]
11:28 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
11:27 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
11:26 <jiji@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc2039.codfw.wmnet with OS bookworm [production]
11:25 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
11:25 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
11:24 <logmsgbot> @deploy1002 helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
11:23 <logmsgbot> @deploy1002 helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
11:22 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2189', diff saved to https://phabricator.wikimedia.org/P63762 and previous config saved to /var/cache/conftool/dbconfig/20240531-112227-ladsgroup.json [production]