|
2026-06-30
ยง
|
| 11:13 |
<moritzm> |
installing Linux 6.12.94 on Trixie hosts |
[production] |
| 11:03 |
<javiermonton@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/pageview-trending-relative-next: apply |
[production] |
| 11:03 |
<javiermonton@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/pageview-trending-relative-next: apply |
[production] |
| 10:56 |
<javiermonton@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/webrequest-page-view-next: apply |
[production] |
| 10:55 |
<javiermonton@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/webrequest-page-view-next: apply |
[production] |
| 10:50 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s_services/services/datahub-next: apply |
[production] |
| 10:43 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on dbproxy[1026,1028].eqiad.wmnet with reason: cloning |
[production] |
| 10:40 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1228.eqiad.wmnet with reason: cloning |
[production] |
| 10:40 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub-next: apply |
[production] |
| 10:38 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1217.eqiad.wmnet with reason: cloning |
[production] |
| 10:35 |
<cwilliams@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1261.eqiad.wmnet with OS trixie |
[production] |
| 10:33 |
<javiermonton@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1306647|stream: webrequest.page_view.dev0 (T426091)]] (duration: 07m 56s) |
[production] |
| 10:32 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s_services/services/datahub: apply} |
[production] |
| 10:30 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s_services/services/datahub: apply |
[production] |
| 10:28 |
<javiermonton@deploy1003> |
javiermonton: Continuing with deployment |
[production] |
| 10:27 |
<javiermonton@deploy1003> |
javiermonton: Backport for [[gerrit:1306647|stream: webrequest.page_view.dev0 (T426091)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 10:25 |
<daniel@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply |
[production] |
| 10:25 |
<daniel@deploy1003> |
helmfile [eqiad] START helmfile.d/services/rest-gateway: apply |
[production] |
| 10:25 |
<javiermonton@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1306647|stream: webrequest.page_view.dev0 (T426091)]] |
[production] |
| 10:20 |
<daniel@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply |
[production] |
| 10:19 |
<cwilliams@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1261.eqiad.wmnet with reason: host reimage |
[production] |
| 10:18 |
<daniel@deploy1003> |
helmfile [codfw] START helmfile.d/services/rest-gateway: apply |
[production] |
| 10:15 |
<cwilliams@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on db1261.eqiad.wmnet with reason: host reimage |
[production] |
| 10:12 |
<daniel@deploy1003> |
helmfile [staging] DONE helmfile.d/services/rest-gateway: apply |
[production] |
| 10:07 |
<daniel@deploy1003> |
helmfile [staging] START helmfile.d/services/rest-gateway: apply |
[production] |
| 10:00 |
<cwilliams@cumin1003> |
START - Cookbook sre.hosts.reimage for host db1261.eqiad.wmnet with OS trixie |
[production] |
| 09:56 |
<godog> |
restart pybal on A:lvs-high-traffic2-eqiad |
[production] |
| 09:53 |
<godog> |
restart pybal on A:lvs-secondary-eqiad |
[production] |
| 09:51 |
<aikochou@deploy1003> |
helmfile [staging] DONE helmfile.d/services/changeprop: sync |
[production] |
| 09:51 |
<aikochou@deploy1003> |
helmfile [staging] START helmfile.d/services/changeprop: sync |
[production] |
| 09:49 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2204 (T426633)', diff saved to https://phabricator.wikimedia.org/P94616 and previous config saved to /var/cache/conftool/dbconfig/20260630-094938-fceratto.json |
[production] |
| 09:39 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2204', diff saved to https://phabricator.wikimedia.org/P94615 and previous config saved to /var/cache/conftool/dbconfig/20260630-093931-fceratto.json |
[production] |
| 09:38 |
<aklapper@deploy1003> |
rebuilt and synchronized wikiversions files: group0 to 1.47.0-wmf.9 refs T423918 |
[production] |
| 09:35 |
<cwilliams@cumin1003> |
END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db1261: Upgrading db1261.eqiad.wmnet |
[production] |
| 09:34 |
<cwilliams@cumin1003> |
START - Cookbook sre.mysql.depool depool db1261: Upgrading db1261.eqiad.wmnet |
[production] |
| 09:34 |
<cwilliams@cumin1003> |
dbmaint on s4@eqiad T429893 |
[production] |
| 09:34 |
<cwilliams@cumin1003> |
START - Cookbook sre.mysql.major-upgrade |
[production] |
| 09:29 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2204', diff saved to https://phabricator.wikimedia.org/P94613 and previous config saved to /var/cache/conftool/dbconfig/20260630-092923-fceratto.json |
[production] |
| 09:23 |
<aklapper@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1306629|Fix overflow menu for non-advanced users (T428220)]] (duration: 11m 40s) |
[production] |
| 09:19 |
<filippo@puppetserver1001> |
conftool action : set/pooled=yes:weight=100; selector: service=dumps-nfs |
[production] |
| 09:19 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2204 (T426633)', diff saved to https://phabricator.wikimedia.org/P94612 and previous config saved to /var/cache/conftool/dbconfig/20260630-091915-fceratto.json |
[production] |
| 09:17 |
<aklapper@deploy1003> |
aklapper: Continuing with deployment |
[production] |
| 09:16 |
<aklapper@deploy1003> |
aklapper: Backport for [[gerrit:1306629|Fix overflow menu for non-advanced users (T428220)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 09:13 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Depooling db2204 (T426633)', diff saved to https://phabricator.wikimedia.org/P94611 and previous config saved to /var/cache/conftool/dbconfig/20260630-091307-fceratto.json |
[production] |
| 09:13 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2204.codfw.wmnet with reason: Maintenance |
[production] |
| 09:12 |
<aklapper@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1306629|Fix overflow menu for non-advanced users (T428220)]] |
[production] |
| 09:08 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Set weight db2204 T430624', diff saved to https://phabricator.wikimedia.org/P94610 and previous config saved to /var/cache/conftool/dbconfig/20260630-090841-fceratto.json |
[production] |
| 09:05 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Promote db2207 to s2 primary T430624', diff saved to https://phabricator.wikimedia.org/P94609 and previous config saved to /var/cache/conftool/dbconfig/20260630-090530-fceratto.json |
[production] |
| 09:04 |
<federico3> |
Starting s2 codfw failover from db2204 to db2207 - T430624 |
[production] |
| 09:02 |
<marostegui@cumin1003> |
conftool action : set/pooled=no; selector: name=clouddb1014.eqiad.wmnet,service=s2 |
[production] |