101-150 of 10000 results (72ms)
2025-07-11 ยง
12:20 <fceratto@cumin1002> START - Cookbook sre.mysql.pool es1034 gradually with 4 steps - Pooling in [production]
12:19 <gmodena@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
12:18 <gmodena@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
12:17 <andrew@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd1037.eqiad.wmnet with OS bullseye [production]
12:17 <fceratto@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for es1034.eqiad.wmnet [production]
12:16 <fceratto@cumin1002> START - Cookbook sre.hosts.remove-downtime for es1034.eqiad.wmnet [production]
12:06 <gmodena@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
12:06 <gmodena@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
12:04 <gmodena@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
12:04 <gmodena@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
12:03 <fceratto@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host es1034.eqiad.wmnet [production]
12:01 <gmodena@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
12:01 <gmodena@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
11:52 <fceratto@cumin1002> START - Cookbook sre.hosts.reboot-single for host es1034.eqiad.wmnet [production]
11:44 <marostegui@cumin1002> dbctl commit (dc=all): 'es1039 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P78916 and previous config saved to /var/cache/conftool/dbconfig/20250711-114439-root.json [production]
11:35 <fceratto@cumin1002> dbctl commit (dc=all): 'Depool es1034 for upgrade', diff saved to https://phabricator.wikimedia.org/P78915 and previous config saved to /var/cache/conftool/dbconfig/20250711-113532-fceratto.json [production]
11:34 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on es1034.eqiad.wmnet with reason: Maintenance [production]
11:31 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on es1032.eqiad.wmnet with reason: Maintenance [production]
11:30 <fceratto@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for es1031.eqiad.wmnet [production]
11:30 <fceratto@cumin1002> START - Cookbook sre.hosts.remove-downtime for es1031.eqiad.wmnet [production]
11:29 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2192.codfw.wmnet with reason: Maintenance [production]
11:29 <marostegui@cumin1002> dbctl commit (dc=all): 'es1039 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P78914 and previous config saved to /var/cache/conftool/dbconfig/20250711-112933-root.json [production]
11:26 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcephosd1037.eqiad.wmnet with OS bullseye [production]
11:26 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on es1031.eqiad.wmnet with reason: Maintenance [production]
11:14 <marostegui@cumin1002> dbctl commit (dc=all): 'es1039 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P78913 and previous config saved to /var/cache/conftool/dbconfig/20250711-111428-root.json [production]
10:59 <marostegui@cumin1002> dbctl commit (dc=all): 'es1039 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P78912 and previous config saved to /var/cache/conftool/dbconfig/20250711-105922-root.json [production]
10:50 <marostegui@cumin1002> dbctl commit (dc=all): 'db2192 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P78911 and previous config saved to /var/cache/conftool/dbconfig/20250711-105039-root.json [production]
10:35 <marostegui@cumin1002> dbctl commit (dc=all): 'db2192 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P78910 and previous config saved to /var/cache/conftool/dbconfig/20250711-103533-root.json [production]
10:32 <hnowlan@deploy1003> helmfile [eqiad] DONE helmfile.d/services/changeprop: apply [production]
10:32 <hnowlan@deploy1003> helmfile [eqiad] START helmfile.d/services/changeprop: apply [production]
10:31 <hnowlan@deploy1003> helmfile [codfw] DONE helmfile.d/services/changeprop: apply [production]
10:31 <hnowlan@deploy1003> helmfile [codfw] START helmfile.d/services/changeprop: apply [production]
10:30 <hnowlan@deploy1003> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
10:30 <hnowlan@deploy1003> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
10:26 <jmm@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1003.eqiad.wmnet with OS trixie [production]
10:20 <marostegui@cumin1002> dbctl commit (dc=all): 'db2192 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P78909 and previous config saved to /var/cache/conftool/dbconfig/20250711-102027-root.json [production]
10:05 <marostegui@cumin1002> dbctl commit (dc=all): 'db2192 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P78908 and previous config saved to /var/cache/conftool/dbconfig/20250711-100522-root.json [production]
10:01 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2192', diff saved to https://phabricator.wikimedia.org/P78907 and previous config saved to /var/cache/conftool/dbconfig/20250711-100106-root.json [production]
10:00 <marostegui@cumin1002> dbctl commit (dc=all): 'db2192 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P78906 and previous config saved to /var/cache/conftool/dbconfig/20250711-100033-root.json [production]
09:45 <marostegui@cumin1002> dbctl commit (dc=all): 'db2192 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P78905 and previous config saved to /var/cache/conftool/dbconfig/20250711-094527-root.json [production]
09:39 <root@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2192.codfw.wmnet with reason: Maintenance [production]
09:31 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2192 T399280', diff saved to https://phabricator.wikimedia.org/P78904 and previous config saved to /var/cache/conftool/dbconfig/20250711-093115-root.json [production]
09:30 <marostegui@cumin1002> dbctl commit (dc=all): 'Promote db2213 to s5 primary T399280', diff saved to https://phabricator.wikimedia.org/P78903 and previous config saved to /var/cache/conftool/dbconfig/20250711-093006-marostegui.json [production]
09:29 <marostegui> Starting s5 codfw failover from db2192 to db2213 - T399280 [production]
09:27 <jmm@cumin1003> START - Cookbook sre.hosts.reimage for host sretest1003.eqiad.wmnet with OS trixie [production]
09:25 <moritzm> imported perccli for trixie-wikimedia T391083 [production]
09:18 <marostegui@cumin1002> dbctl commit (dc=all): 'Remove db2213 from API/vslow/dump T399280', diff saved to https://phabricator.wikimedia.org/P78902 and previous config saved to /var/cache/conftool/dbconfig/20250711-091812-root.json [production]
09:15 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 22 hosts with reason: Primary switchover s5 T399280 [production]
09:12 <marostegui@cumin1002> dbctl commit (dc=all): 'db2223 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P78901 and previous config saved to /var/cache/conftool/dbconfig/20250711-091242-root.json [production]
09:04 <jmm@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1003.eqiad.wmnet with OS trixie [production]