2025-07-24
ยง
|
19:03 |
<cwhite@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on logstash1030.eqiad.wmnet with reason: host reimage |
[production] |
18:54 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.reimage for host clouddb1022.eqiad.wmnet with OS bookworm |
[production] |
18:53 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1227 (T399249)', diff saved to https://phabricator.wikimedia.org/P79878 and previous config saved to /var/cache/conftool/dbconfig/20250724-185343-marostegui.json |
[production] |
18:53 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1227.eqiad.wmnet with reason: Maintenance |
[production] |
18:53 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1202 (T399249)', diff saved to https://phabricator.wikimedia.org/P79877 and previous config saved to /var/cache/conftool/dbconfig/20250724-185320-marostegui.json |
[production] |
18:52 |
<vriley@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host clouddb1022.eqiad.wmnet with OS bookworm |
[production] |
18:48 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2036 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P79876 and previous config saved to /var/cache/conftool/dbconfig/20250724-184815-root.json |
[production] |
18:45 |
<cwhite@cumin2002> |
START - Cookbook sre.hosts.reimage for host logstash1030.eqiad.wmnet with OS bookworm |
[production] |
18:38 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1202', diff saved to https://phabricator.wikimedia.org/P79875 and previous config saved to /var/cache/conftool/dbconfig/20250724-183813-marostegui.json |
[production] |
18:33 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2036 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P79874 and previous config saved to /var/cache/conftool/dbconfig/20250724-183309-root.json |
[production] |
18:29 |
<cwhite@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logstash1025.eqiad.wmnet with OS bookworm |
[production] |
18:23 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1202', diff saved to https://phabricator.wikimedia.org/P79873 and previous config saved to /var/cache/conftool/dbconfig/20250724-182306-marostegui.json |
[production] |
18:18 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2036 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P79872 and previous config saved to /var/cache/conftool/dbconfig/20250724-181803-root.json |
[production] |
18:15 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.reimage for host clouddb1022.eqiad.wmnet with OS bookworm |
[production] |
18:13 |
<vriley@cumin1002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host clouddb1022.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
18:12 |
<dduvall@deploy1003> |
rebuilt and synchronized wikiversions files: group2 to 1.45.0-wmf.11 refs T396372 |
[production] |
18:07 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1202 (T399249)', diff saved to https://phabricator.wikimedia.org/P79871 and previous config saved to /var/cache/conftool/dbconfig/20250724-180758-marostegui.json |
[production] |
18:06 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.provision for host clouddb1022.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
18:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2036 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P79870 and previous config saved to /var/cache/conftool/dbconfig/20250724-180258-root.json |
[production] |
17:58 |
<cwhite@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logstash1025.eqiad.wmnet with reason: host reimage |
[production] |
17:52 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2239.codfw.wmnet with reason: Maintenance |
[production] |
17:52 |
<cwhite@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on logstash1025.eqiad.wmnet with reason: host reimage |
[production] |
17:52 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2227 (T399728)', diff saved to https://phabricator.wikimedia.org/P79869 and previous config saved to /var/cache/conftool/dbconfig/20250724-175242-fceratto.json |
[production] |
17:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2036 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P79868 and previous config saved to /var/cache/conftool/dbconfig/20250724-174752-root.json |
[production] |
17:38 |
<cwhite@cumin2002> |
START - Cookbook sre.hosts.reimage for host logstash1025.eqiad.wmnet with OS bookworm |
[production] |
17:37 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2227', diff saved to https://phabricator.wikimedia.org/P79867 and previous config saved to /var/cache/conftool/dbconfig/20250724-173734-fceratto.json |
[production] |
17:34 |
<cwhite@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logstash1024.eqiad.wmnet with OS bookworm |
[production] |
17:23 |
<fceratto@deploy1003> |
helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . |
[production] |
17:22 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2227', diff saved to https://phabricator.wikimedia.org/P79866 and previous config saved to /var/cache/conftool/dbconfig/20250724-172227-fceratto.json |
[production] |
17:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1202 (T399249)', diff saved to https://phabricator.wikimedia.org/P79865 and previous config saved to /var/cache/conftool/dbconfig/20250724-172140-marostegui.json |
[production] |
17:21 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1202.eqiad.wmnet with reason: Maintenance |
[production] |
17:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1194 (T399249)', diff saved to https://phabricator.wikimedia.org/P79864 and previous config saved to /var/cache/conftool/dbconfig/20250724-172117-marostegui.json |
[production] |
17:17 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host es2036 |
[production] |
17:17 |
<jhancock@cumin1003> |
START - Cookbook sre.network.configure-switch-interfaces for host es2036 |
[production] |
17:16 |
<jhancock@cumin1003> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
17:14 |
<jhancock@cumin1003> |
START - Cookbook sre.dns.netbox |
[production] |
17:07 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2227 (T399728)', diff saved to https://phabricator.wikimedia.org/P79863 and previous config saved to /var/cache/conftool/dbconfig/20250724-170719-fceratto.json |
[production] |
17:06 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P79862 and previous config saved to /var/cache/conftool/dbconfig/20250724-170608-marostegui.json |
[production] |
16:53 |
<hnowlan> |
delete thumbor pod where all instances displayed signs of T374350 |
[production] |
16:52 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2227 (T399728)', diff saved to https://phabricator.wikimedia.org/P79860 and previous config saved to /var/cache/conftool/dbconfig/20250724-165228-fceratto.json |
[production] |
16:52 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2227.codfw.wmnet with reason: Maintenance |
[production] |
16:52 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2209 (T399728)', diff saved to https://phabricator.wikimedia.org/P79859 and previous config saved to /var/cache/conftool/dbconfig/20250724-165205-fceratto.json |
[production] |
16:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P79858 and previous config saved to /var/cache/conftool/dbconfig/20250724-165100-marostegui.json |
[production] |
16:48 |
<cwhite@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logstash1024.eqiad.wmnet with reason: host reimage |
[production] |
16:43 |
<cwhite@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on logstash1024.eqiad.wmnet with reason: host reimage |
[production] |
16:36 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2209', diff saved to https://phabricator.wikimedia.org/P79857 and previous config saved to /var/cache/conftool/dbconfig/20250724-163658-fceratto.json |
[production] |
16:35 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1194 (T399249)', diff saved to https://phabricator.wikimedia.org/P79856 and previous config saved to /var/cache/conftool/dbconfig/20250724-163553-marostegui.json |
[production] |
16:34 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool es2036 T399927', diff saved to https://phabricator.wikimedia.org/P79855 and previous config saved to /var/cache/conftool/dbconfig/20250724-163439-root.json |
[production] |
16:33 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es2036.codfw.wmnet with reason: Maintenance |
[production] |
16:27 |
<cwhite@cumin2002> |
START - Cookbook sre.hosts.reimage for host logstash1024.eqiad.wmnet with OS bookworm |
[production] |