|
2026-04-30
ยง
|
| 11:50 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P92042 and previous config saved to /var/cache/conftool/dbconfig/20260430-115028-fceratto.json |
[production] |
| 11:50 |
<jnuche@deploy1003> |
Finished deploy [releng/jenkins-deploy@fb711fc] (releasing): Update production releases Jenkins (duration: 01m 04s) |
[production] |
| 11:49 |
<jnuche@deploy1003> |
Started deploy [releng/jenkins-deploy@fb711fc] (releasing): Update production releases Jenkins |
[production] |
| 11:47 |
<jnuche@deploy1003> |
Finished deploy [releng/jenkins-deploy@fb711fc] (releasing): Update backup releases Jenkins (duration: 00m 33s) |
[production] |
| 11:47 |
<jnuche@deploy1003> |
Started deploy [releng/jenkins-deploy@fb711fc] (releasing): Update backup releases Jenkins |
[production] |
| 11:47 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2203 (T419961)', diff saved to https://phabricator.wikimedia.org/P92041 and previous config saved to /var/cache/conftool/dbconfig/20260430-114703-fceratto.json |
[production] |
| 11:46 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.provision for host wikikube-worker1378.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 11:46 |
<jclark@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host wikikube-worker1378.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 11:45 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on bast5005.wikimedia.org with reason: host reimage |
[production] |
| 11:44 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.reimage for host wikikube-worker1377.eqiad.wmnet with OS trixie |
[production] |
| 11:42 |
<jclark@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wikikube-worker1377.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 11:40 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on bast5005.wikimedia.org with reason: host reimage |
[production] |
| 11:40 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1159 (T419635)', diff saved to https://phabricator.wikimedia.org/P92040 and previous config saved to /var/cache/conftool/dbconfig/20260430-114020-fceratto.json |
[production] |
| 11:39 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Depooling db2203 (T419961)', diff saved to https://phabricator.wikimedia.org/P92039 and previous config saved to /var/cache/conftool/dbconfig/20260430-113948-fceratto.json |
[production] |
| 11:39 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2203.codfw.wmnet with reason: Maintenance |
[production] |
| 11:39 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Depooling db1159 (T419635)', diff saved to https://phabricator.wikimedia.org/P92038 and previous config saved to /var/cache/conftool/dbconfig/20260430-113910-fceratto.json |
[production] |
| 11:39 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.pool pool db2205: after reimage to trixie |
[production] |
| 11:39 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1159.eqiad.wmnet with reason: Maintenance |
[production] |
| 11:38 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.provision for host wikikube-worker1378.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 11:37 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2188 (T419961)', diff saved to https://phabricator.wikimedia.org/P92036 and previous config saved to /var/cache/conftool/dbconfig/20260430-113704-fceratto.json |
[production] |
| 11:36 |
<elukey@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kafka-logging1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 11:35 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.provision for host kafka-logging1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 11:35 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.provision for host wikikube-worker1377.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 11:33 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2205.codfw.wmnet with OS trixie |
[production] |
| 11:28 |
<elukey@cumin1003> |
END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kafka-logging1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 11:27 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.provision for host kafka-logging1006.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
| 11:27 |
<jayme@cumin1003> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for wikikube-worker1039.eqiad.wmnet |
[production] |
| 11:27 |
<jayme@cumin1003> |
START - Cookbook sre.hosts.remove-downtime for wikikube-worker1039.eqiad.wmnet |
[production] |
| 11:27 |
<jayme@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker1039.eqiad.wmnet |
[production] |
| 11:27 |
<jayme@cumin1003> |
START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker1039.eqiad.wmnet |
[production] |
| 11:26 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2188', diff saved to https://phabricator.wikimedia.org/P92035 and previous config saved to /var/cache/conftool/dbconfig/20260430-112656-fceratto.json |
[production] |
| 11:20 |
<moritzm> |
installing policykit-1 security updates |
[production] |
| 11:19 |
<elukey> |
upgrade spicerack on cumin hosts to 12.5.0 |
[production] |
| 11:16 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2188', diff saved to https://phabricator.wikimedia.org/P92034 and previous config saved to /var/cache/conftool/dbconfig/20260430-111648-fceratto.json |
[production] |
| 11:10 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2205.codfw.wmnet with reason: host reimage |
[production] |
| 11:06 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db2205.codfw.wmnet with reason: host reimage |
[production] |
| 11:06 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2188 (T419961)', diff saved to https://phabricator.wikimedia.org/P92033 and previous config saved to /var/cache/conftool/dbconfig/20260430-110640-fceratto.json |
[production] |
| 11:02 |
<atsuko@deploy1003> |
helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
| 11:01 |
<atsuko@deploy1003> |
helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. |
[production] |
| 11:00 |
<atsuko@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
| 10:59 |
<atsuko@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
| 10:59 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Depooling db2188 (T419961)', diff saved to https://phabricator.wikimedia.org/P92032 and previous config saved to /var/cache/conftool/dbconfig/20260430-105924-fceratto.json |
[production] |
| 10:59 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2188.codfw.wmnet with reason: Maintenance |
[production] |
| 10:58 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2176 (T419961)', diff saved to https://phabricator.wikimedia.org/P92031 and previous config saved to /var/cache/conftool/dbconfig/20260430-105854-fceratto.json |
[production] |
| 10:48 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.reimage for host db2205.codfw.wmnet with OS trixie |
[production] |
| 10:48 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2176', diff saved to https://phabricator.wikimedia.org/P92030 and previous config saved to /var/cache/conftool/dbconfig/20260430-104846-fceratto.json |
[production] |
| 10:47 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db2205: Reimage to Trixie |
[production] |
| 10:47 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.depool depool db2205: Reimage to Trixie |
[production] |
| 10:46 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5:00:00 on db2205.codfw.wmnet with reason: Reimage to Trixie |
[production] |
| 10:42 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti4008.ulsfo.wmnet |
[production] |