|
2025-10-08
ยง
|
| 09:19 |
<godog> |
shut down nfs while investigating T406688 |
[toolsbeta] |
| 09:14 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component builds-builder |
[tools] |
| 09:11 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component jobs-api |
[toolsbeta] |
| 09:08 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component builds-builder |
[tools] |
| 09:08 |
<topranks> |
disable BGP to asw*-esams from cr1-esams as the CR external links are also down |
[production] |
| 09:02 |
<mvernon@cumin1002> |
END (PASS) - Cookbook sre.dns.admin (exit_code=0) DNS admin: depool site esams [reason: no reason specified, ] |
[production] |
| 09:02 |
<Emperor> |
depool esams |
[production] |
| 09:02 |
<mvernon@cumin1002> |
START - Cookbook sre.dns.admin DNS admin: depool site esams [reason: no reason specified, ] |
[production] |
| 09:00 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component builds-builder |
[toolsbeta] |
| 08:55 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component builds-builder |
[toolsbeta] |
| 08:52 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.pool es2027 gradually with 4 steps - Pool es2027.codfw.wmnet in after cloning |
[production] |
| 08:50 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db2172 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P83669 and previous config saved to /var/cache/conftool/dbconfig/20251008-085005-root.json |
[production] |
| 08:44 |
<elukey@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cp2058.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
| 08:35 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db2172 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P83667 and previous config saved to /var/cache/conftool/dbconfig/20251008-083459-root.json |
[production] |
| 08:33 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.provision for host cp2058.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
| 08:31 |
<elukey@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cp2057.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
| 08:30 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component components-api |
[tools] |
| 08:24 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component components-api |
[tools] |
| 08:21 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.provision for host cp2057.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
| 08:19 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db2172 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P83666 and previous config saved to /var/cache/conftool/dbconfig/20251008-081953-root.json |
[production] |
| 08:14 |
<jnuche@deploy2002> |
rebuilt and synchronized wikiversions files: group1 to 1.45.0-wmf.22 refs T405678 |
[production] |
| 08:07 |
<dcaro@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.component.deploy (exit_code=0) for component components-api |
[toolsbeta] |
| 08:04 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db2172 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P83665 and previous config saved to /var/cache/conftool/dbconfig/20251008-080448-root.json |
[production] |
| 08:03 |
<slyngshede@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host idp-test2005.wikimedia.org with OS trixie |
[production] |
| 08:02 |
<elukey@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cp2055.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
| 08:01 |
<dcaro@cloudcumin1001> |
START - Cookbook wmcs.toolforge.component.deploy for component components-api |
[toolsbeta] |
| 08:00 |
<moritzm> |
installing libxml2 security updates |
[production] |
| 07:56 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depool db2172 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P83664 and previous config saved to /var/cache/conftool/dbconfig/20251008-075612-marostegui.json |
[production] |
| 07:56 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2172.codfw.wmnet with reason: Maintenance |
[production] |
| 07:52 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.provision for host cp2055.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
| 07:49 |
<elukey@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cp2054.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
| 07:47 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.provision for host cp2054.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
| 07:46 |
<elukey@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cp2053.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
| 07:44 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.provision for host cp2053.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
| 07:37 |
<elukey@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cp2052.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
| 07:27 |
<stevemunene@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-wikidata: apply |
[production] |
| 07:22 |
<slyngshede@cumin1003> |
START - Cookbook sre.hosts.reimage for host idp-test2005.wikimedia.org with OS trixie |
[production] |
| 07:21 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.clone_es of es1030.eqiad.wmnet onto es1053.eqiad.wmnet |
[production] |
| 07:17 |
<stevemunene@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-wikidata: apply |
[production] |
| 07:16 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depool es1030 T406488', diff saved to https://phabricator.wikimedia.org/P83663 and previous config saved to /var/cache/conftool/dbconfig/20251008-071656-marostegui.json |
[production] |
| 07:16 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.provision for host cp2052.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
| 07:15 |
<elukey@cumin1003> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host cp2051.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
| 07:05 |
<elukey@cumin1003> |
START - Cookbook sre.hosts.provision for host cp2051.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED |
[production] |
| 06:57 |
<ryankemper@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1018.eqiad.wmnet with OS bullseye |
[production] |
| 06:55 |
<filippo@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for tools-k8s-worker-nfs-13, tools-k8s-worker-nfs-71 |
[tools] |
| 06:55 |
<slyngshede@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host idp-test2005.wikimedia.org with OS trixie |
[production] |
| 06:53 |
<slyngshede@cumin1003> |
START - Cookbook sre.hosts.reimage for host idp-test2005.wikimedia.org with OS trixie |
[production] |
| 06:43 |
<filippo@cloudcumin1001> |
START - Cookbook wmcs.toolforge.k8s.reboot for tools-k8s-worker-nfs-13, tools-k8s-worker-nfs-71 |
[tools] |
| 06:31 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.clone_es of es1027.eqiad.wmnet onto es1050.eqiad.wmnet |
[production] |
| 06:29 |
<moritzm> |
installing openssl security updates |
[production] |