2025-01-03
§
|
09:05 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2236 (re)pooling @ 10%: Repooling after upgrade', diff saved to https://phabricator.wikimedia.org/P71772 and previous config saved to /var/cache/conftool/dbconfig/20250103-090541-root.json |
[production] |
09:03 |
<marostegui> |
Upgrade db2236 to 10.11.10 s4 codfw dbmaint T378940 |
[production] |
09:02 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2236.codfw.wmnet with reason: upgrade |
[production] |
09:02 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on db2236.codfw.wmnet with reason: upgrade |
[production] |
09:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db2236 to upgrade to 10.11.10 T378940', diff saved to https://phabricator.wikimedia.org/P71771 and previous config saved to /var/cache/conftool/dbconfig/20250103-090215-marostegui.json |
[production] |
08:35 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts db2115.codfw.wmnet |
[production] |
08:35 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
08:35 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: db2115.codfw.wmnet decommissioned, removing all IPs except the asset tag one - marostegui@cumin1002" |
[production] |
08:33 |
<marostegui@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: db2115.codfw.wmnet decommissioned, removing all IPs except the asset tag one - marostegui@cumin1002" |
[production] |
08:29 |
<marostegui@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
08:24 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.decommission for hosts db2115.codfw.wmnet |
[production] |
07:36 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on db1193.eqiad.wmnet with reason: maintenance |
[production] |
07:36 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 4 days, 0:00:00 on db1193.eqiad.wmnet with reason: maintenance |
[production] |
07:23 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Remove db2115 from dbctl T362949', diff saved to https://phabricator.wikimedia.org/P71770 and previous config saved to /var/cache/conftool/dbconfig/20250103-072349-marostegui.json |
[production] |
06:32 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts db2116.codfw.wmnet |
[production] |
06:32 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
06:32 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: db2116.codfw.wmnet decommissioned, removing all IPs except the asset tag one - marostegui@cumin1002" |
[production] |
06:32 |
<marostegui@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: db2116.codfw.wmnet decommissioned, removing all IPs except the asset tag one - marostegui@cumin1002" |
[production] |
06:28 |
<marostegui@cumin1002> |
START - Cookbook sre.dns.netbox |
[production] |
06:23 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.decommission for hosts db2116.codfw.wmnet |
[production] |
2025-01-02
§
|
22:23 |
<wfan> |
SmashPig upgraded from 17ac74f2 to 1d060a11 |
[production] |
21:26 |
<urbanecm@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1106316|[Growth] Remove Marketing campaign (T382499)]], [[gerrit:1107561|gomwiki: Use wikitext talk pages by default (T382810)]] (duration: 21m 42s) |
[production] |
21:18 |
<urbanecm@deploy2002> |
urbanecm: Continuing with sync |
[production] |
21:17 |
<urbanecm@deploy2002> |
urbanecm: Backport for [[gerrit:1106316|[Growth] Remove Marketing campaign (T382499)]], [[gerrit:1107561|gomwiki: Use wikitext talk pages by default (T382810)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
21:04 |
<urbanecm@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1106316|[Growth] Remove Marketing campaign (T382499)]], [[gerrit:1107561|gomwiki: Use wikitext talk pages by default (T382810)]] |
[production] |
18:23 |
<bd808@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/developer-portal: apply |
[production] |
18:23 |
<bd808@deploy2002> |
helmfile [codfw] START helmfile.d/services/developer-portal: apply |
[production] |
18:23 |
<bd808@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/developer-portal: apply |
[production] |
18:22 |
<bd808@deploy2002> |
helmfile [eqiad] START helmfile.d/services/developer-portal: apply |
[production] |
18:22 |
<bd808@deploy2002> |
helmfile [staging] DONE helmfile.d/services/developer-portal: apply |
[production] |
18:22 |
<bd808@deploy2002> |
helmfile [staging] START helmfile.d/services/developer-portal: apply |
[production] |
18:22 |
<bd808@deploy2002> |
helmfile [staging] DONE helmfile.d/services/developer-portal: apply |
[production] |
18:21 |
<bd808@deploy2002> |
helmfile [staging] START helmfile.d/services/developer-portal: apply |
[production] |
14:28 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Remove db2116 from dbctl T362950', diff saved to https://phabricator.wikimedia.org/P71768 and previous config saved to /var/cache/conftool/dbconfig/20250102-142806-marostegui.json |
[production] |
11:49 |
<mvernon@cumin1002> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for ms-be1066.eqiad.wmnet |
[production] |
11:49 |
<mvernon@cumin1002> |
START - Cookbook sre.hosts.remove-downtime for ms-be1066.eqiad.wmnet |
[production] |
11:31 |
<marostegui> |
dbmaint s8 db1193 eqiad rebuild pagelinks and recentchanges and deploy schema change on revision table T367856 T382842 |
[production] |
11:29 |
<mvernon@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on ms-be1066.eqiad.wmnet with reason: vacuum three container dbs |
[production] |
11:28 |
<mvernon@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on ms-be1066.eqiad.wmnet with reason: vacuum three container dbs |
[production] |
11:22 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on db1193.eqiad.wmnet with reason: maintenance |
[production] |
11:22 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 5 days, 0:00:00 on db1193.eqiad.wmnet with reason: maintenance |
[production] |
11:11 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1193 T381993', diff saved to https://phabricator.wikimedia.org/P71766 and previous config saved to /var/cache/conftool/dbconfig/20250102-111105-marostegui.json |
[production] |
11:09 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Promote db1209 to s8 primary T381993', diff saved to https://phabricator.wikimedia.org/P71765 and previous config saved to /var/cache/conftool/dbconfig/20250102-110923-marostegui.json |
[production] |
11:09 |
<marostegui> |
Starting s8 eqiad failover from db1193 to db1209 - T381993 |
[production] |
11:03 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Remove db1209 from API/vslow/dump T381993', diff saved to https://phabricator.wikimedia.org/P71764 and previous config saved to /var/cache/conftool/dbconfig/20250102-110305-root.json |
[production] |
11:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Set db1209 with weight 0 T381993', diff saved to https://phabricator.wikimedia.org/P71763 and previous config saved to /var/cache/conftool/dbconfig/20250102-110232-root.json |
[production] |
11:00 |
<root@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 32 hosts with reason: Primary switchover s8 T381993 |
[production] |
11:00 |
<root@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on 32 hosts with reason: Primary switchover s8 T381993 |
[production] |
10:56 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts dbproxy2004.codfw.wmnet |
[production] |
10:56 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |