451-500 of 10000 results (113ms)
2025-03-27 ยง
10:12 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
10:11 <joal@deploy1003> Started deploy [analytics/refinery@bc1b576] (hadoop-test): Analytics webrequest_frontend update TEST [analytics/refinery@bc1b5761] [production]
10:10 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
10:10 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
10:09 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ganeti4005.ulsfo.wmnet with reason: host reimage [production]
10:08 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
10:08 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
10:06 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti4005.ulsfo.wmnet with reason: host reimage [production]
10:01 <dcausse@deploy1003> helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
10:01 <dcausse@deploy1003> helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
10:01 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-serve-ctrl2002.codfw.wmnet with reason: host reimage [production]
09:58 <elukey@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on ml-serve-ctrl2002.codfw.wmnet with reason: host reimage [production]
09:48 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host ganeti4005.ulsfo.wmnet with OS bookworm [production]
09:45 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
09:45 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
09:45 <jmm@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ganeti4005.ulsfo.wmnet [production]
09:45 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti4005.ulsfo.wmnet [production]
09:41 <aklapper@deploy1003> rebuilt and synchronized wikiversions files: group2 to 1.44.0-wmf.22 refs T386217 [production]
09:41 <elukey@cumin1002> START - Cookbook sre.hosts.reimage for host ml-serve-ctrl2002.codfw.wmnet with OS bookworm [production]
09:33 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti4005.ulsfo.wmnet [production]
09:32 <dcausse@deploy1003> helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
09:32 <dcausse@deploy1003> helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
09:30 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
09:29 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
09:29 <godog> silence LogstashKafkaConsumerLag and LogstashIndexingFailures for today for 1d - T390140 [production]
09:29 <dcausse@deploy1003> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
09:29 <dcausse@deploy1003> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
09:28 <dcausse@deploy1003> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
09:28 <dcausse@deploy1003> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
09:27 <dcausse@deploy1003> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
09:27 <dcausse@deploy1003> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
09:14 <aklapper@deploy1003> rebuilt and synchronized wikiversions files: group1 to 1.44.0-wmf.22 refs T386217 [production]
09:01 <aklapper@deploy1003> Finished scap sync-world: Backport for [[gerrit:1131348|Instead of calling deprecated parserOptions(), parse content ourselves (T390032)]] (duration: 12m 24s) [production]
08:54 <aklapper@deploy1003> aklapper, jforrester: Continuing with sync [production]
08:53 <aklapper@deploy1003> aklapper, jforrester: Backport for [[gerrit:1131348|Instead of calling deprecated parserOptions(), parse content ourselves (T390032)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
08:50 <jmm@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ganeti4005.ulsfo.wmnet [production]
08:48 <aklapper@deploy1003> Started scap sync-world: Backport for [[gerrit:1131348|Instead of calling deprecated parserOptions(), parse content ourselves (T390032)]] [production]
08:44 <aklapper@deploy1003> Finished scap sync-world: Backport for [[gerrit:1131018|Allow arwikisource bureaucrat to manage "import" (T389952)]] (duration: 13m 28s) [production]
08:42 <jmm@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on ganeti4005.ulsfo.wmnet with reason: remove from cluster for reimage [production]
08:41 <fabfur@cumin1002> conftool action : set/pooled=yes; selector: name=cp7001.magru.wmnet [production]
08:41 <fabfur@cumin1002> conftool action : set/pooled=yes; selector: name=cp7009.magru.wmnet [production]
08:41 <fabfur> repooling cp7001 and cp7009 with new TLS certificate path (T384227) [production]
08:40 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti4005.ulsfo.wmnet [production]
08:37 <aklapper@deploy1003> hubaishan, aklapper: Continuing with sync [production]
08:37 <aklapper@deploy1003> hubaishan, aklapper: Backport for [[gerrit:1131018|Allow arwikisource bureaucrat to manage "import" (T389952)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
08:30 <aklapper@deploy1003> Started scap sync-world: Backport for [[gerrit:1131018|Allow arwikisource bureaucrat to manage "import" (T389952)]] [production]
08:29 <aklapper@deploy1003> Finished scap sync-world: Backport for [[gerrit:1131410|Make officewiki readonly after moving flow pages (T380909)]] (duration: 14m 14s) [production]
08:28 <fabfur@cumin1002> conftool action : set/pooled=no; selector: name=cp7009.magru.wmnet [production]
08:28 <fabfur@cumin1002> conftool action : set/pooled=no; selector: name=cp7001.magru.wmnet [production]
08:28 <fabfur> depooling cp7001 and cp7009 to apply https://gerrit.wikimedia.org/r/c/operations/puppet/+/1131052 (T384227) [production]