101-150 of 10000 results (119ms)
2026-04-28 ยง
16:22 <btullis@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/dse-k8s-services/opensearch-test: apply [production]
16:21 <btullis@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/dse-k8s-services/opensearch-test: apply [production]
15:52 <gkyziridis@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
15:34 <herron@cumin1003> END (PASS) - Cookbook sre.kafka.roll-restart-reboot-brokers (exit_code=0) rolling restart_daemons on A:kafka-logging-eqiad [production]
15:28 <otto@deploy1003> Finished scap sync-world: Backport for [[gerrit:1278476|EventStreamConfig - Declare .v1 streams for html content and feature counts (T423920)]] (duration: 08m 21s) [production]
15:24 <otto@deploy1003> otto: Continuing with deployment [production]
15:21 <otto@deploy1003> otto: Backport for [[gerrit:1278476|EventStreamConfig - Declare .v1 streams for html content and feature counts (T423920)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
15:19 <otto@deploy1003> Started scap sync-world: Backport for [[gerrit:1278476|EventStreamConfig - Declare .v1 streams for html content and feature counts (T423920)]] [production]
15:19 <klausman@deploy1003> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. [production]
15:19 <klausman@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. [production]
15:19 <klausman@deploy1003> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. [production]
15:18 <klausman@deploy1003> helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. [production]
15:18 <klausman@deploy1003> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. [production]
15:17 <klausman@deploy1003> helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. [production]
15:16 <klausman@deploy1003> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. [production]
15:16 <klausman@deploy1003> helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'. [production]
15:14 <herron@cumin1003> START - Cookbook sre.kafka.roll-restart-reboot-brokers rolling restart_daemons on A:kafka-logging-eqiad [production]
15:05 <brennen@deploy1003> Finished deploy [phabricator/deployment@ce0b865]: deploy phab1004 for T424656 (duration: 00m 52s) [production]
15:04 <brennen@deploy1003> Started deploy [phabricator/deployment@ce0b865]: deploy phab1004 for T424656 [production]
15:04 <brennen@deploy1003> Finished deploy [phabricator/deployment@ce0b865]: deploy phab2002 for T424656 (duration: 00m 44s) [production]
15:03 <brennen@deploy1003> Started deploy [phabricator/deployment@ce0b865]: deploy phab2002 for T424656 [production]
15:02 <jelto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on phab1004.eqiad.wmnet with reason: Phabricator deploy [production]
15:02 <jelto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on phab2002.codfw.wmnet with reason: Phabricator deploy [production]
14:47 <herron@cumin1003> END (PASS) - Cookbook sre.kafka.change-confluent-distro-version (exit_code=0) Change Confluent distribution for Kafka A:kafka-logging-eqiad cluster: Change Confluent distribution. [production]
14:37 <aokoth@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host phab2003.codfw.wmnet with OS bullseye [production]
14:26 <herron@cumin1003> START - Cookbook sre.kafka.change-confluent-distro-version Change Confluent distribution for Kafka A:kafka-logging-eqiad cluster: Change Confluent distribution. [production]
13:58 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1170 (T419961)', diff saved to https://phabricator.wikimedia.org/P91807 and previous config saved to /var/cache/conftool/dbconfig/20260428-135847-fceratto.json [production]
13:54 <klausman@deploy1003> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. [production]
13:54 <klausman@deploy1003> helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. [production]
13:52 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db1259: after reimage to trixie [production]
13:49 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db2226: after reimage to trixie [production]
13:49 <aokoth@cumin1003> START - Cookbook sre.hosts.reimage for host phab2003.codfw.wmnet with OS bullseye [production]
13:48 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1170', diff saved to https://phabricator.wikimedia.org/P91804 and previous config saved to /var/cache/conftool/dbconfig/20260428-134838-fceratto.json [production]
13:43 <Lucas_WMDE> UTC afternoon backport+config window done [production]
13:42 <lucaswerkmeister-wmde@deploy1003> Finished scap sync-world: Backport for [[gerrit:1278435|testwiki: allow sysops to add/remove electionadmin (T423962)]] (duration: 07m 37s) [production]
13:38 <lucaswerkmeister-wmde@deploy1003> lucaswerkmeister-wmde, novemlinguae: Continuing with deployment [production]
13:38 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1170', diff saved to https://phabricator.wikimedia.org/P91803 and previous config saved to /var/cache/conftool/dbconfig/20260428-133830-fceratto.json [production]
13:36 <lucaswerkmeister-wmde@deploy1003> lucaswerkmeister-wmde, novemlinguae: Backport for [[gerrit:1278435|testwiki: allow sysops to add/remove electionadmin (T423962)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:34 <lucaswerkmeister-wmde@deploy1003> Started scap sync-world: Backport for [[gerrit:1278435|testwiki: allow sysops to add/remove electionadmin (T423962)]] [production]
13:31 <stran@deploy1003> Finished scap sync-world: Backport for [[gerrit:1278381|Update action parameter for bulk blocking instrumented events (T420517)]], [[gerrit:1278442|Resources: Define required message for 'oojs-ui-windows' module (T424653)]] (duration: 15m 10s) [production]
13:28 <fceratto@cumin1003> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts dborch1001.wikimedia.org [production]
13:28 <fceratto@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:28 <fceratto@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: dborch1001.wikimedia.org decommissioned, removing all IPs except the asset tag one - fceratto@cumin1003" [production]
13:28 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1170 (T419961)', diff saved to https://phabricator.wikimedia.org/P91800 and previous config saved to /var/cache/conftool/dbconfig/20260428-132822-fceratto.json [production]
13:26 <fceratto@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: dborch1001.wikimedia.org decommissioned, removing all IPs except the asset tag one - fceratto@cumin1003" [production]
13:26 <jynus@cumin1003> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts db2141.codfw.wmnet [production]
13:26 <jynus@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:25 <stran@deploy1003> dreamyjazz, stran: Continuing with deployment [production]
13:23 <jynus@cumin1003> START - Cookbook sre.dns.netbox [production]
13:22 <stran@deploy1003> dreamyjazz, stran: Backport for [[gerrit:1278381|Update action parameter for bulk blocking instrumented events (T420517)]], [[gerrit:1278442|Resources: Define required message for 'oojs-ui-windows' module (T424653)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]