251-300 of 10000 results (119ms)
2026-06-10 ยง
07:23 <fceratto@cumin1003> START - Cookbook sre.hosts.reimage for host db1215.eqiad.wmnet with OS trixie [production]
07:23 <trueg@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs: apply [production]
07:22 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1215.eqiad.wmnet with reason: Reimage [production]
07:21 <fceratto@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
07:21 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
07:20 <fceratto@cumin1003> END (FAIL) - Cookbook sre.mysql.major-upgrade (exit_code=99) [production]
07:20 <fceratto@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
07:17 <atsuko@deploy1003> atsuko: Backport for [[gerrit:1299556|ElasticSearchTtmServer: drop include_type_name and support int replicas (T428168)]], [[gerrit:1299561|ElasticSearchTtmServer: clean stale _doc usage and version error output (T428168)]], [[gerrit:1299529|translate: adding separate read/write endpoints (T425377)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be veri [production]
07:16 <trueg@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs: apply [production]
07:15 <atsuko@deploy1003> Started scap sync-world: Backport for [[gerrit:1299556|ElasticSearchTtmServer: drop include_type_name and support int replicas (T428168)]], [[gerrit:1299561|ElasticSearchTtmServer: clean stale _doc usage and version error output (T428168)]], [[gerrit:1299529|translate: adding separate read/write endpoints (T425377)]] [production]
07:14 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
07:12 <atsukoito> backporting extensions/Translate to wmf/1.47.0-wmf.5 and applying the config [production]
07:12 <trueg@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs: apply [production]
07:11 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
07:11 <trueg@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs: apply [production]
06:45 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti5004.eqsin.wmnet [production]
06:45 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti5004.eqsin.wmnet [production]
05:43 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti5004.eqsin.wmnet [production]
05:43 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti5004.eqsin.wmnet [production]
05:42 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti5004.eqsin.wmnet [production]
05:41 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti5004.eqsin.wmnet [production]
02:07 <mwpresync@deploy1003> Finished scap build-images: Publishing wmf/next image (duration: 06m 47s) [production]
02:07 <jasmine@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-main1008.eqiad.wmnet with OS trixie [production]
02:03 <jasmine@deploy1003> helmfile [eqiad] DONE helmfile.d/services/eventgate-main: sync [production]
02:02 <jasmine@deploy1003> helmfile [eqiad] START helmfile.d/services/eventgate-main: sync [production]
02:00 <mwpresync@deploy1003> Started scap build-images: Publishing wmf/next image [production]
01:52 <jasmine@deploy1003> helmfile [aux-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
01:51 <jasmine@deploy1003> helmfile [aux-k8s-codfw] START helmfile.d/admin 'apply'. [production]
01:51 <jasmine@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
01:50 <jasmine@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
01:50 <jasmine@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
01:49 <jasmine@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-main1008.eqiad.wmnet with reason: host reimage [production]
01:49 <jasmine@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. [production]
01:49 <jasmine@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
01:49 <jasmine@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
01:49 <jasmine@deploy1003> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. [production]
01:48 <jasmine@deploy1003> helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'. [production]
01:48 <jasmine@deploy1003> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. [production]
01:47 <jasmine@deploy1003> helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. [production]
01:47 <jasmine@deploy1003> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. [production]
01:46 <jasmine@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. [production]
01:46 <jasmine@deploy1003> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
01:45 <jasmine@deploy1003> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
01:45 <jasmine@deploy1003> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
01:45 <jasmine@deploy1003> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
01:45 <jasmine@deploy1003> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
01:44 <jasmine@deploy1003> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
01:44 <jasmine@deploy1003> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
01:43 <jasmine@deploy1003> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
01:43 <jasmine@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-main1008.eqiad.wmnet with reason: host reimage [production]