1-50 of 10000 results (107ms)
2026-06-25 ยง
09:15 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2236.codfw.wmnet with reason: host reimage [production]
09:13 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host kafka-logging2008.codfw.wmnet with OS trixie [production]
09:12 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host kafka-logging2007.codfw.wmnet with OS trixie [production]
09:11 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1221.eqiad.wmnet with reason: host reimage [production]
09:11 <cwilliams@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on db2236.codfw.wmnet with reason: host reimage [production]
09:08 <elukey@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-logging2006.codfw.wmnet with reason: host reimage [production]
09:06 <marostegui@cumin1003> conftool action : set/weight=100; selector: name=clouddb1026.eqiad.wmnet [production]
09:06 <cwilliams@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on db1221.eqiad.wmnet with reason: host reimage [production]
09:05 <jforrester@deploy1003> Finished scap sync-world: Backport for [[gerrit:1305599|On AW article deletion, clear all AWArticleStore from sections and metadata (T429873)]], [[gerrit:1305600|AWStorage: Use global stash keys (T430060)]] (duration: 07m 29s) [production]
09:05 <elukey@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-logging2006.codfw.wmnet with reason: host reimage [production]
09:00 <jforrester@deploy1003> jforrester: Continuing with deployment [production]
09:00 <jforrester@deploy1003> jforrester: Backport for [[gerrit:1305599|On AW article deletion, clear all AWArticleStore from sections and metadata (T429873)]], [[gerrit:1305600|AWStorage: Use global stash keys (T430060)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
08:58 <brouberol@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
08:58 <jforrester@deploy1003> Started scap sync-world: Backport for [[gerrit:1305599|On AW article deletion, clear all AWArticleStore from sections and metadata (T429873)]], [[gerrit:1305600|AWStorage: Use global stash keys (T430060)]] [production]
08:57 <brouberol@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. [production]
08:57 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
08:56 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
08:55 <marostegui@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host db2234.codfw.wmnet with OS trixie [production]
08:54 <cwilliams@cumin1003> START - Cookbook sre.hosts.reimage for host db2236.codfw.wmnet with OS trixie [production]
08:53 <cwilliams@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db2236: Upgrading db2236.codfw.wmnet [production]
08:52 <cwilliams@cumin1003> START - Cookbook sre.mysql.depool depool db2236: Upgrading db2236.codfw.wmnet [production]
08:52 <cwilliams@cumin1003> dbmaint on s4@codfw T429893 [production]
08:52 <cwilliams@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
08:50 <cwilliams@cumin1003> START - Cookbook sre.hosts.reimage for host db1221.eqiad.wmnet with OS trixie [production]
08:48 <cwilliams@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db1221: Upgrading db1221.eqiad.wmnet [production]
08:48 <cwilliams@cumin1003> START - Cookbook sre.mysql.depool depool db1221: Upgrading db1221.eqiad.wmnet [production]
08:47 <cwilliams@cumin1003> dbmaint on s4@eqiad T429893 [production]
08:47 <cwilliams@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
08:47 <elukey@cumin1003> START - Cookbook sre.hosts.reimage for host kafka-logging2006.codfw.wmnet with OS trixie [production]
08:45 <marostegui@cumin1003> conftool action : set/weight=30; selector: name=clouddb1026.eqiad.wmnet [production]
08:44 <cwilliams@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1015,1024-1025].eqiad.wmnet,db1155.eqiad.wmnet with reason: Reimaging db1221 [production]
08:10 <jnuche@deploy1003> Finished deploy [releng/jenkins-deploy@ec879e3] (releasing): T430110 deploy to Jenkins primary (duration: 00m 52s) [production]
08:10 <jnuche@deploy1003> Started deploy [releng/jenkins-deploy@ec879e3] (releasing): T430110 deploy to Jenkins primary [production]
08:07 <jnuche@deploy1003> Finished deploy [releng/jenkins-deploy@ec879e3] (releasing): T430110 retry Jenkins secondary (duration: 00m 53s) [production]
08:07 <jnuche@deploy1003> Started deploy [releng/jenkins-deploy@ec879e3] (releasing): T430110 retry Jenkins secondary [production]
08:03 <marostegui> Pool clouddb1026:s1 with a bit of weight T409557 [production]
08:03 <marostegui@cumin1003> conftool action : set/pooled=yes; selector: name=clouddb1026.eqiad.wmnet,service=s1 [production]
08:02 <marostegui@cumin1003> conftool action : set/weight=10; selector: name=clouddb1026.eqiad.wmnet [production]
07:52 <filippo@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
07:52 <filippo@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Allocate IPs for cloudvirt1077 - filippo@cumin1003" [production]
07:52 <filippo@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Allocate IPs for cloudvirt1077 - filippo@cumin1003" [production]
07:51 <arnaudb@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on releases2003.codfw.wmnet with reason: T410849 [production]
07:47 <filippo@cumin1003> START - Cookbook sre.dns.netbox [production]
07:41 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db2160.codfw.wmnet with reason: Upgrading [production]
07:35 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host db2234.codfw.wmnet with OS trixie [production]
07:35 <marostegui@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host db2234.codfw.wmnet with OS trixie [production]
07:29 <jnuche@deploy1003> Finished deploy [releng/jenkins-deploy@86ab691] (releasing): T430110 Test on Jenkins secondary (duration: 00m 50s) [production]
07:29 <jnuche@deploy1003> Started deploy [releng/jenkins-deploy@86ab691] (releasing): T430110 Test on Jenkins secondary [production]
07:24 <moritzm> installing nginx security updates [production]
07:20 <dcausse> T423993: dropping ttmserver indices from the cirrussearch opensearch clusters [production]