2025-07-24
§
|
09:58 |
<cgoubert@dns1004> |
END - running authdns-update |
[production] |
09:58 |
<hnowlan@deploy1003> |
helmfile [staging] DONE helmfile.d/services/thumbor: sync |
[production] |
09:58 |
<hnowlan@deploy1003> |
helmfile [staging] START helmfile.d/services/thumbor: sync |
[production] |
09:57 |
<hnowlan@deploy1003> |
helmfile [staging] DONE helmfile.d/services/thumbor: apply |
[production] |
09:57 |
<hnowlan@deploy1003> |
helmfile [staging] START helmfile.d/services/thumbor: apply |
[production] |
09:57 |
<vgutierrez@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host lvs1013.eqiad.wmnet with OS bookworm |
[production] |
09:57 |
<cgoubert@dns1004> |
START - running authdns-update |
[production] |
09:55 |
<hnowlan@deploy1003> |
helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
09:54 |
<hnowlan@deploy1003> |
helmfile [staging-eqiad] START helmfile.d/admin 'apply'. |
[production] |
09:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1170', diff saved to https://phabricator.wikimedia.org/P79803 and previous config saved to /var/cache/conftool/dbconfig/20250724-094706-marostegui.json |
[production] |
09:42 |
<vgutierrez@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs1013.eqiad.wmnet with reason: host reimage |
[production] |
09:37 |
<vgutierrez> |
disable BGP for lvs1013 on lsw1-e1-eqiad.mgmt.eqiad.wmnet - T400259 |
[production] |
09:36 |
<vgutierrez@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on lvs1013.eqiad.wmnet with reason: host reimage |
[production] |
09:31 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1170 (T399249)', diff saved to https://phabricator.wikimedia.org/P79801 and previous config saved to /var/cache/conftool/dbconfig/20250724-093158-marostegui.json |
[production] |
09:22 |
<vgutierrez@cumin1002> |
START - Cookbook sre.hosts.reimage for host lvs1013.eqiad.wmnet with OS bookworm |
[production] |
09:13 |
<vgutierrez@cumin1002> |
END (PASS) - Cookbook sre.loadbalancer.admin (exit_code=0) depooling P{lvs1013.eqiad.wmnet} and A:liberica (T400259) |
[production] |
09:12 |
<vgutierrez@cumin1002> |
START - Cookbook sre.loadbalancer.admin depooling P{lvs1013.eqiad.wmnet} and A:liberica (T400259) |
[production] |
08:22 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1170 (T399249)', diff saved to https://phabricator.wikimedia.org/P79800 and previous config saved to /var/cache/conftool/dbconfig/20250724-082213-marostegui.json |
[production] |
08:22 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1170.eqiad.wmnet with reason: Maintenance |
[production] |
08:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1158 (T399249)', diff saved to https://phabricator.wikimedia.org/P79799 and previous config saved to /var/cache/conftool/dbconfig/20250724-082150-marostegui.json |
[production] |
08:06 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P79798 and previous config saved to /var/cache/conftool/dbconfig/20250724-080643-marostegui.json |
[production] |
08:06 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1227 (re)pooling @ 100%: 10', diff saved to https://phabricator.wikimedia.org/P79797 and previous config saved to /var/cache/conftool/dbconfig/20250724-080617-root.json |
[production] |
07:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P79796 and previous config saved to /var/cache/conftool/dbconfig/20250724-075135-marostegui.json |
[production] |
07:51 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1227 (re)pooling @ 75%: 10', diff saved to https://phabricator.wikimedia.org/P79795 and previous config saved to /var/cache/conftool/dbconfig/20250724-075112-root.json |
[production] |
07:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1158 (T399249)', diff saved to https://phabricator.wikimedia.org/P79794 and previous config saved to /var/cache/conftool/dbconfig/20250724-073628-marostegui.json |
[production] |
07:36 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1227 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P79793 and previous config saved to /var/cache/conftool/dbconfig/20250724-073606-root.json |
[production] |
07:21 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db1227 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P79792 and previous config saved to /var/cache/conftool/dbconfig/20250724-072100-root.json |
[production] |
07:13 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db1227 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P79791 and previous config saved to /var/cache/conftool/dbconfig/20250724-071300-marostegui.json |
[production] |
07:12 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1227.eqiad.wmnet with reason: Maintenance |
[production] |
06:52 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depooling db1158 (T399249)', diff saved to https://phabricator.wikimedia.org/P79790 and previous config saved to /var/cache/conftool/dbconfig/20250724-065222-marostegui.json |
[production] |
06:52 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 16:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1014,1018].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance |
[production] |
06:51 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1158.eqiad.wmnet with reason: Maintenance |
[production] |
06:33 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2035 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P79789 and previous config saved to /var/cache/conftool/dbconfig/20250724-063300-root.json |
[production] |
06:17 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2035 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P79788 and previous config saved to /var/cache/conftool/dbconfig/20250724-061755-root.json |
[production] |
06:02 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2035 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P79787 and previous config saved to /var/cache/conftool/dbconfig/20250724-060249-root.json |
[production] |
05:47 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2035 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P79786 and previous config saved to /var/cache/conftool/dbconfig/20250724-054743-root.json |
[production] |
05:32 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'es2035 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P79785 and previous config saved to /var/cache/conftool/dbconfig/20250724-053236-root.json |
[production] |
05:28 |
<marostegui@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2218.codfw.wmnet with reason: Maintenance |
[production] |
01:28 |
<ryankemper> |
[Cirrus] `ryankemper@cirrussearch2071:~$ sudo systemctl restart opensearch-disable-readahead-production-search-psi-codfw.service` |
[production] |
01:01 |
<ryankemper@cumin1002> |
END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_codfw: activate new plugins packages - ryankemper@cumin1002 - T397227 |
[production] |
2025-07-23
§
|
23:54 |
<dzahn@cumin2002> |
END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab2002.wikimedia.org with reason: security release 20250723 |
[production] |
23:48 |
<ryankemper@cumin1002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_codfw: activate new plugins packages - ryankemper@cumin1002 - T397227 |
[production] |
23:46 |
<ryankemper> |
[Cirrus] Depooled codfw in anticipation of rolling restart. Hopefully minimal noise on this one :) |
[production] |
23:46 |
<ryankemper@cumin1002> |
conftool action : set/pooled=false; selector: dnsdisc=search,name=codfw |
[production] |
23:15 |
<inflatador> |
pool cirrussearch eqiad, will resume investigations tomorrow T400160 |
[production] |
23:14 |
<bking@cumin2002> |
conftool action : set/pooled=true; selector: dnsdisc=search,name=eqiad |
[production] |
23:08 |
<bking@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 55 hosts with reason: testing cluster quorum |
[production] |
22:53 |
<bking@cumin1002> |
END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_eqiad: activate new plugins packages - bking@cumin1002 - T397227 |
[production] |
22:17 |
<vriley@cumin1002> |
END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host clouddb1022.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |
22:05 |
<vriley@cumin1002> |
START - Cookbook sre.hosts.provision for host clouddb1022.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART |
[production] |