1451-1500 of 10000 results (22ms)
2025-09-22 §
16:41 <andrew@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cloudcephosd1025.eqiad.wmnet'] [production]
16:41 <James_F> Zuul: [mediawiki/extensions/ReaderExperiments] Add WikimediaMessages dependency [releng]
16:37 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component components-api [tools]
16:32 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component components-api [tools]
16:32 <andrew@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudcephosd1025.eqiad.wmnet'] [production]
16:31 <andrew@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudcephosd1025.eqiad.wmnet with OS bookworm [production]
16:28 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1020.eqiad.wmnet with reason: host reimage [production]
16:22 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1020.eqiad.wmnet with reason: host reimage [production]
16:16 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component components-api [toolsbeta]
16:12 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1025.eqiad.wmnet with OS bookworm [production]
16:11 <dcaro@cloudcumin1001> START - Cookbook wmcs.toolforge.component.deploy for component components-api [toolsbeta]
16:10 <jhathaway@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on sretest2001.codfw.wmnet with reason: T383173 [production]
16:05 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1020.eqiad.wmnet with OS bookworm [production]
16:02 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.reactivate (exit_code=0) [admin]
16:01 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1019.eqiad.wmnet with OS bookworm [production]
16:01 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.reactivate [admin]
15:59 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.ceph.osd.reactivate (exit_code=99) [admin]
15:59 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.reactivate [admin]
15:59 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.ceph.osd.reactivate (exit_code=0) [admin]
15:59 <andrew@cloudcumin1001> START - Cookbook wmcs.ceph.osd.reactivate [admin]
15:45 <toyofuku@deploy1003> Finished scap sync-world: Backport for [[gerrit:1187052|Enable search recommendation on Wikipedia (T402048)]] (duration: 11m 35s) [production]
15:43 <andrew@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1019.eqiad.wmnet with reason: host reimage [production]
15:40 <toyofuku@deploy1003> jdlrobson, toyofuku: Continuing with sync [production]
15:39 <andrew@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1019.eqiad.wmnet with reason: host reimage [production]
15:37 <toyofuku@deploy1003> jdlrobson, toyofuku: Backport for [[gerrit:1187052|Enable search recommendation on Wikipedia (T402048)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
15:33 <toyofuku@deploy1003> Started scap sync-world: Backport for [[gerrit:1187052|Enable search recommendation on Wikipedia (T402048)]] [production]
15:22 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1019.eqiad.wmnet with OS bookworm [production]
15:19 <pt1979@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on fasw2-c8a-codfw,fasw2-c8b-codfw with reason: pfw1-codfw relocation [production]
15:17 <pt1979@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pfw1-codfw with reason: pfw1-codfw relocation [production]
15:15 <moritzm> installing clamav security updates [production]
15:11 <pt1979@cumin2002> DONE (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on ‘pfw1-codfw’ with reason: ‘pfw1 [production]
14:32 <dcausse@deploy1003> helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
14:32 <dcausse@deploy1003> helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
14:31 <dcausse@deploy1003> helmfile [codfw] DONE helmfile.d/services/rdf-streaming-updater: apply [production]
14:31 <dcausse@deploy1003> helmfile [codfw] START helmfile.d/services/rdf-streaming-updater: apply [production]
14:22 <brouberol@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
14:21 <brouberol@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. [production]
14:20 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
14:20 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
14:15 <btullis> restarting the hadoop-yarn-resourcemanager.service on an-master1003 and then an-master1004 for T404871 [analytics]
14:12 <brouberol@deploy1003> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
14:11 <brouberol@deploy1003> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
14:10 <brouberol@deploy1003> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
14:09 <brouberol@deploy1003> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
14:08 <brouberol@deploy1003> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
14:07 <brouberol@deploy1003> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
14:07 <Lucas_WMDE> ssh integration-castor05.integration.eqiad1.wikimedia.cloud sudo -u jenkins-deploy rm -rf /srv/castor/castor-mw-ext-and-skins/master/mwext-node20-rundoc # fix failure seen in mwext-node20-rundoc 18248 + 18249 [releng]
13:58 <sukhe> delete list: sectrainings@lists.wikimedia.org [no archives, project obsolete since 2022] [production]
13:54 <phuedx@deploy1003> Finished scap sync-world: Backport for [[gerrit:1190280|Revert^2 "WikimediaEvents: Disable client-side error logging for certain wikis"]] (duration: 12m 25s) [production]
13:49 <phuedx@deploy1003> phuedx: Continuing with sync [production]