2751-2800 of 10000 results (132ms)
2025-09-22 §
17:55 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1024.eqiad.wmnet with reason: host reimage [production]
17:38 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1024.eqiad.wmnet with OS bookworm [production]
17:36 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1023.eqiad.wmnet with OS bookworm [production]
17:24 <sfaci@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply [production]
17:23 <sfaci@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply [production]
17:18 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1023.eqiad.wmnet with reason: host reimage [production]
17:11 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1023.eqiad.wmnet with reason: host reimage [production]
16:54 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1023.eqiad.wmnet with OS bookworm [production]
16:48 <andrew@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudcephosd1025.eqiad.wmnet with OS bookworm [production]
16:45 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1020.eqiad.wmnet with OS bookworm [production]
16:43 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1025.eqiad.wmnet with OS bookworm [production]
16:41 <andrew@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cloudcephosd1025.eqiad.wmnet'] [production]
16:32 <andrew@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudcephosd1025.eqiad.wmnet'] [production]
16:31 <andrew@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudcephosd1025.eqiad.wmnet with OS bookworm [production]
16:28 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1020.eqiad.wmnet with reason: host reimage [production]
16:22 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1020.eqiad.wmnet with reason: host reimage [production]
16:12 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1025.eqiad.wmnet with OS bookworm [production]
16:10 <jhathaway@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on sretest2001.codfw.wmnet with reason: T383173 [production]
16:05 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1020.eqiad.wmnet with OS bookworm [production]
16:01 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1019.eqiad.wmnet with OS bookworm [production]
15:45 <toyofuku@deploy1003> Finished scap sync-world: Backport for [[gerrit:1187052|Enable search recommendation on Wikipedia (T402048)]] (duration: 11m 35s) [production]
15:43 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1019.eqiad.wmnet with reason: host reimage [production]
15:40 <toyofuku@deploy1003> jdlrobson, toyofuku: Continuing with sync [production]
15:39 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1019.eqiad.wmnet with reason: host reimage [production]
15:37 <toyofuku@deploy1003> jdlrobson, toyofuku: Backport for [[gerrit:1187052|Enable search recommendation on Wikipedia (T402048)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
15:33 <toyofuku@deploy1003> Started scap sync-world: Backport for [[gerrit:1187052|Enable search recommendation on Wikipedia (T402048)]] [production]
15:22 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1019.eqiad.wmnet with OS bookworm [production]
15:19 <pt1979@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on fasw2-c8a-codfw,fasw2-c8b-codfw with reason: pfw1-codfw relocation [production]
15:17 <pt1979@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pfw1-codfw with reason: pfw1-codfw relocation [production]
15:15 <moritzm> installing clamav security updates [production]
15:11 <pt1979@cumin2002> DONE (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on ‘pfw1-codfw’ with reason: ‘pfw1 [production]
14:32 <dcausse@deploy1003> helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
14:32 <dcausse@deploy1003> helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
14:31 <dcausse@deploy1003> helmfile [codfw] DONE helmfile.d/services/rdf-streaming-updater: apply [production]
14:31 <dcausse@deploy1003> helmfile [codfw] START helmfile.d/services/rdf-streaming-updater: apply [production]
14:22 <brouberol@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
14:21 <brouberol@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. [production]
14:20 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
14:20 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
14:12 <brouberol@deploy1003> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
14:11 <brouberol@deploy1003> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
14:10 <brouberol@deploy1003> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
14:09 <brouberol@deploy1003> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
14:08 <brouberol@deploy1003> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
14:07 <brouberol@deploy1003> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
13:58 <sukhe> delete list: sectrainings@lists.wikimedia.org [no archives, project obsolete since 2022] [production]
13:54 <phuedx@deploy1003> Finished scap sync-world: Backport for [[gerrit:1190280|Revert^2 "WikimediaEvents: Disable client-side error logging for certain wikis"]] (duration: 12m 25s) [production]
13:49 <phuedx@deploy1003> phuedx: Continuing with sync [production]
13:48 <phuedx@deploy1003> phuedx: Backport for [[gerrit:1190280|Revert^2 "WikimediaEvents: Disable client-side error logging for certain wikis"]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:42 <phuedx@deploy1003> Started scap sync-world: Backport for [[gerrit:1190280|Revert^2 "WikimediaEvents: Disable client-side error logging for certain wikis"]] [production]