351-400 of 10000 results (141ms)
2026-05-07 ยง
08:29 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of prometheus4003.ulsfo.wmnet to drbd [production]
08:28 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of ncredir4004.ulsfo.wmnet to drbd [production]
08:28 <marostegui@cumin1003> dbctl commit (dc=all): 'Remove db2144 T425522', diff saved to https://phabricator.wikimedia.org/P92389 and previous config saved to /var/cache/conftool/dbconfig/20260507-082822-marostegui.json [production]
08:23 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool db2208: After reimage [production]
08:23 <marostegui@cumin1003> END (ERROR) - Cookbook sre.mysql.pool (exit_code=97) pool db2208: After reimage [production]
08:23 <XioNoX> drmrs remove old v6 gateway IP [production]
08:22 <ayounsi@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:22 <ayounsi@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: drmrs v6 gateway IPs change - ayounsi@cumin1003" [production]
08:22 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool db2208: After reimage [production]
08:21 <ayounsi@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: drmrs v6 gateway IPs change - ayounsi@cumin1003" [production]
08:17 <ayounsi@cumin1003> START - Cookbook sre.dns.netbox [production]
08:14 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of ncredir4004.ulsfo.wmnet to drbd [production]
08:13 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.addnode (exit_code=0) for new host ganeti4008.ulsfo.wmnet to cluster ulsfo02 and group 01 [production]
08:12 <elukey@deploy1003> helmfile [staging] DONE helmfile.d/services/wikifunctions: sync [production]
08:12 <elukey@deploy1003> helmfile [staging] START helmfile.d/services/wikifunctions: sync [production]
08:12 <jmm@cumin2002> START - Cookbook sre.ganeti.addnode for new host ganeti4008.ulsfo.wmnet to cluster ulsfo02 and group 01 [production]
08:12 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti4008.ulsfo.wmnet [production]
08:04 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti4008.ulsfo.wmnet [production]
08:03 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti4008.ulsfo.wmnet to cluster ulsfo02 and group 01 [production]
08:03 <jmm@cumin2002> START - Cookbook sre.ganeti.addnode for new host ganeti4008.ulsfo.wmnet to cluster ulsfo02 and group 01 [production]
07:54 <dcausse@deploy1003> Finished scap sync-world: Backport for [[gerrit:1269465|search: add alt. completion indices to test keyword tokenizer (2/2) (T420427)]] (duration: 09m 46s) [production]
07:49 <dcausse@deploy1003> dcausse: Continuing with deployment [production]
07:46 <dcausse@deploy1003> dcausse: Backport for [[gerrit:1269465|search: add alt. completion indices to test keyword tokenizer (2/2) (T420427)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
07:44 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of netflow4003.ulsfo.wmnet to drbd [production]
07:44 <dcausse@deploy1003> Started scap sync-world: Backport for [[gerrit:1269465|search: add alt. completion indices to test keyword tokenizer (2/2) (T420427)]] [production]
07:32 <moritzm> installing apache2 security updates [production]
07:30 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of netflow4003.ulsfo.wmnet to drbd [production]
07:27 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM testvm2005.codfw.wmnet [production]
07:23 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM testvm2005.codfw.wmnet [production]
07:02 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of ncredir4003.ulsfo.wmnet to drbd [production]
06:48 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of ncredir4003.ulsfo.wmnet to drbd [production]
06:46 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.changedisk (exit_code=99) for changing disk type of ncredir4003.ulsfo.wmnet to drbd [production]
06:46 <jmm@cumin2002> START - Cookbook sre.ganeti.changedisk for changing disk type of ncredir4003.ulsfo.wmnet to drbd [production]
06:42 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti4006.ulsfo.wmnet to cluster ulsfo02 and group 01 [production]
06:41 <jmm@cumin2002> START - Cookbook sre.ganeti.addnode for new host ganeti4006.ulsfo.wmnet to cluster ulsfo02 and group 01 [production]
06:39 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db2207: after reimage to trixie [production]
05:54 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool db2207: after reimage to trixie [production]
05:51 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2207.codfw.wmnet with OS trixie [production]
05:33 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2208.codfw.wmnet with OS trixie [production]
05:28 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2207.codfw.wmnet with reason: host reimage [production]
05:23 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on db2207.codfw.wmnet with reason: host reimage [production]
05:09 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2208.codfw.wmnet with reason: host reimage [production]
05:04 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on db2208.codfw.wmnet with reason: host reimage [production]
05:03 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host db2207.codfw.wmnet with OS trixie [production]
05:01 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db2207: Reimage to Trixie [production]
05:01 <marostegui@cumin1003> START - Cookbook sre.mysql.depool depool db2207: Reimage to Trixie [production]
05:01 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db2207.codfw.wmnet with reason: Reimage to Trixie [production]
04:52 <marostegui@cumin1003> dbctl commit (dc=all): 'Depool db2207 T424848', diff saved to https://phabricator.wikimedia.org/P92383 and previous config saved to /var/cache/conftool/dbconfig/20260507-045219-marostegui.json [production]
04:51 <marostegui@cumin1003> dbctl commit (dc=all): 'Promote db2204 to s2 primary T424848', diff saved to https://phabricator.wikimedia.org/P92382 and previous config saved to /var/cache/conftool/dbconfig/20260507-045141-marostegui.json [production]
04:51 <marostegui> Starting s2 codfw failover from db2207 to db2204 - T424848 [production]