2025-07-24
ยง
|
18:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2036 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P79870 and previous config saved to /var/cache/conftool/dbconfig/20250724-180258-root.json |
[production] |
17:58 |
<cwhite@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logstash1025.eqiad.wmnet with reason: host reimage |
[production] |
17:52 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2239.codfw.wmnet with reason: Maintenance |
[production] |
17:52 |
<cwhite@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on logstash1025.eqiad.wmnet with reason: host reimage |
[production] |
17:52 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2227 (T399728)', diff saved to https://phabricator.wikimedia.org/P79869 and previous config saved to /var/cache/conftool/dbconfig/20250724-175242-fceratto.json |
[production] |
17:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2036 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P79868 and previous config saved to /var/cache/conftool/dbconfig/20250724-174752-root.json |
[production] |
17:38 |
<cwhite@cumin2002> |
START - Cookbook sre.hosts.reimage for host logstash1025.eqiad.wmnet with OS bookworm |
[production] |
17:37 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2227', diff saved to https://phabricator.wikimedia.org/P79867 and previous config saved to /var/cache/conftool/dbconfig/20250724-173734-fceratto.json |
[production] |
17:34 |
<cwhite@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logstash1024.eqiad.wmnet with OS bookworm |
[production] |
17:23 |
<fceratto@deploy1003> |
helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . |
[production] |
17:22 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2227', diff saved to https://phabricator.wikimedia.org/P79866 and previous config saved to /var/cache/conftool/dbconfig/20250724-172227-fceratto.json |
[production] |
17:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1202 (T399249)', diff saved to https://phabricator.wikimedia.org/P79865 and previous config saved to /var/cache/conftool/dbconfig/20250724-172140-marostegui.json |
[production] |
17:21 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1202.eqiad.wmnet with reason: Maintenance |
[production] |
17:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1194 (T399249)', diff saved to https://phabricator.wikimedia.org/P79864 and previous config saved to /var/cache/conftool/dbconfig/20250724-172117-marostegui.json |
[production] |
17:17 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host es2036 |
[production] |
17:17 |
<jhancock@cumin1003> |
START - Cookbook sre.network.configure-switch-interfaces for host es2036 |
[production] |
17:16 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
17:14 |
<jhancock@cumin1003> |
START - Cookbook sre.dns.netbox |
[production] |
17:07 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2227 (T399728)', diff saved to https://phabricator.wikimedia.org/P79863 and previous config saved to /var/cache/conftool/dbconfig/20250724-170719-fceratto.json |
[production] |
17:06 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P79862 and previous config saved to /var/cache/conftool/dbconfig/20250724-170608-marostegui.json |
[production] |
16:53 |
<hnowlan> |
delete thumbor pod where all instances displayed signs of T374350 |
[production] |
16:52 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2227 (T399728)', diff saved to https://phabricator.wikimedia.org/P79860 and previous config saved to /var/cache/conftool/dbconfig/20250724-165228-fceratto.json |
[production] |
16:52 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2227.codfw.wmnet with reason: Maintenance |
[production] |
16:52 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2209 (T399728)', diff saved to https://phabricator.wikimedia.org/P79859 and previous config saved to /var/cache/conftool/dbconfig/20250724-165205-fceratto.json |
[production] |
16:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P79858 and previous config saved to /var/cache/conftool/dbconfig/20250724-165100-marostegui.json |
[production] |
16:48 |
<cwhite@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logstash1024.eqiad.wmnet with reason: host reimage |
[production] |
16:43 |
<cwhite@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on logstash1024.eqiad.wmnet with reason: host reimage |
[production] |
16:36 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2209', diff saved to https://phabricator.wikimedia.org/P79857 and previous config saved to /var/cache/conftool/dbconfig/20250724-163658-fceratto.json |
[production] |
16:35 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1194 (T399249)', diff saved to https://phabricator.wikimedia.org/P79856 and previous config saved to /var/cache/conftool/dbconfig/20250724-163553-marostegui.json |
[production] |
16:34 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool es2036 T399927', diff saved to https://phabricator.wikimedia.org/P79855 and previous config saved to /var/cache/conftool/dbconfig/20250724-163439-root.json |
[production] |
16:33 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es2036.codfw.wmnet with reason: Maintenance |
[production] |
16:27 |
<cwhite@cumin2002> |
START - Cookbook sre.hosts.reimage for host logstash1024.eqiad.wmnet with OS bookworm |
[production] |
16:21 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2209', diff saved to https://phabricator.wikimedia.org/P79854 and previous config saved to /var/cache/conftool/dbconfig/20250724-162150-fceratto.json |
[production] |
16:08 |
<dancy@deploy1003> |
Installation of scap version "4.190.0" completed for 2 hosts |
[production] |
16:06 |
<dancy@deploy1003> |
Installing scap version "4.190.0" for 2 host(s) |
[production] |
16:06 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2209 (T399728)', diff saved to https://phabricator.wikimedia.org/P79852 and previous config saved to /var/cache/conftool/dbconfig/20250724-160643-fceratto.json |
[production] |
15:58 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['cirrussearch2079'] |
[production] |
15:52 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1194 (T399249)', diff saved to https://phabricator.wikimedia.org/P79851 and previous config saved to /var/cache/conftool/dbconfig/20250724-155206-marostegui.json |
[production] |
15:52 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1194.eqiad.wmnet with reason: Maintenance |
[production] |
15:51 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2209 (T399728)', diff saved to https://phabricator.wikimedia.org/P79850 and previous config saved to /var/cache/conftool/dbconfig/20250724-155151-fceratto.json |
[production] |
15:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1191 (T399249)', diff saved to https://phabricator.wikimedia.org/P79849 and previous config saved to /var/cache/conftool/dbconfig/20250724-155144-marostegui.json |
[production] |
15:51 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2209.codfw.wmnet with reason: Maintenance |
[production] |
15:51 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2194 (T399728)', diff saved to https://phabricator.wikimedia.org/P79848 and previous config saved to /var/cache/conftool/dbconfig/20250724-155128-fceratto.json |
[production] |
15:51 |
<jhancock@cumin1003> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cirrussearch2079'] |
[production] |
15:48 |
<isaranto@deploy1003> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'edit-check' for release 'main' . |
[production] |
15:48 |
<isaranto@deploy1003> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . |
[production] |
15:46 |
<isaranto@deploy1003> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . |
[production] |
15:37 |
<jhancock@cumin1003> |
END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['cirrussearch2079'] |
[production] |
15:37 |
<jhancock@cumin1003> |
START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cirrussearch2079'] |
[production] |
15:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1191', diff saved to https://phabricator.wikimedia.org/P79847 and previous config saved to /var/cache/conftool/dbconfig/20250724-153637-marostegui.json |
[production] |