|
2026-04-27
ยง
|
| 15:05 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1249 (T419961)', diff saved to https://phabricator.wikimedia.org/P91625 and previous config saved to /var/cache/conftool/dbconfig/20260427-150547-fceratto.json |
[production] |
| 15:05 |
<dpogorzelski@deploy1003> |
helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. |
[production] |
| 15:02 |
<mvernon@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
| 15:00 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.move-vlan for host ms-be1092 |
[production] |
| 15:00 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.reimage for host ms-be1092.eqiad.wmnet with OS bullseye |
[production] |
| 14:59 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2157', diff saved to https://phabricator.wikimedia.org/P91624 and previous config saved to /var/cache/conftool/dbconfig/20260427-145957-fceratto.json |
[production] |
| 14:59 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be1091.eqiad.wmnet with OS bullseye |
[production] |
| 14:58 |
<jiji@deploy1003> |
Locking from deployment [ALL REPOSITORIES]: Upgrading mw-mcrouter - effie |
[production] |
| 14:55 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1249', diff saved to https://phabricator.wikimedia.org/P91623 and previous config saved to /var/cache/conftool/dbconfig/20260427-145539-fceratto.json |
[production] |
| 14:55 |
<XioNoX> |
upgrade gnmic on netflow7002 - T416360 |
[production] |
| 14:55 |
<aokoth@cumin1003> |
START - Cookbook sre.hosts.reimage for host phab2003.codfw.wmnet with OS bullseye |
[production] |
| 14:51 |
<aokoth@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host phab2003.codfw.wmnet with OS bullseye |
[production] |
| 14:50 |
<herron@cumin1003> |
START - Cookbook sre.kafka.roll-restart-reboot-brokers rolling restart_daemons on A:kafka-logging-codfw |
[production] |
| 14:50 |
<XioNoX> |
add gnmic 0.45 to trixie-wikimedia - T416360 |
[production] |
| 14:49 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2157 (T419635)', diff saved to https://phabricator.wikimedia.org/P91622 and previous config saved to /var/cache/conftool/dbconfig/20260427-144949-fceratto.json |
[production] |
| 14:47 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Depooling db2157 (T419635)', diff saved to https://phabricator.wikimedia.org/P91621 and previous config saved to /var/cache/conftool/dbconfig/20260427-144718-fceratto.json |
[production] |
| 14:47 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2157.codfw.wmnet with reason: Maintenance |
[production] |
| 14:47 |
<XioNoX> |
add gnmic 0.45 to bookworm-wikimedia - T416360 |
[production] |
| 14:45 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1249', diff saved to https://phabricator.wikimedia.org/P91620 and previous config saved to /var/cache/conftool/dbconfig/20260427-144531-fceratto.json |
[production] |
| 14:42 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be1091.eqiad.wmnet with reason: host reimage |
[production] |
| 14:37 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be1091.eqiad.wmnet with reason: host reimage |
[production] |
| 14:35 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1249 (T419961)', diff saved to https://phabricator.wikimedia.org/P91619 and previous config saved to /var/cache/conftool/dbconfig/20260427-143523-fceratto.json |
[production] |
| 14:29 |
<Lucas_WMDE> |
UTC afternoon backport+config window done |
[production] |
| 14:29 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-wmde: apply |
[production] |
| 14:28 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-wmde: apply |
[production] |
| 14:27 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Depooling db1249 (T419961)', diff saved to https://phabricator.wikimedia.org/P91618 and previous config saved to /var/cache/conftool/dbconfig/20260427-142714-fceratto.json |
[production] |
| 14:27 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1249.eqiad.wmnet with reason: Maintenance |
[production] |
| 14:26 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1248 (T419961)', diff saved to https://phabricator.wikimedia.org/P91617 and previous config saved to /var/cache/conftool/dbconfig/20260427-142646-fceratto.json |
[production] |
| 14:25 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.loadbalancer.migrate-service-ipip (exit_code=0) for alias: dse-k8s-master-codfw@codfw |
[production] |
| 14:25 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on (A:lvs-low-traffic-codfw or A:lvs-secondary-codfw) and A:bullseye and A:lvs |
[production] |
| 14:24 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host ms-be1091 |
[production] |
| 14:24 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ms-be1091 |
[production] |
| 14:24 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-wikidata: apply |
[production] |
| 14:24 |
<btullis@cumin1003> |
START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on (A:lvs-low-traffic-codfw or A:lvs-secondary-codfw) and A:bullseye and A:lvs |
[production] |
| 14:24 |
<mvernon@cumin2002> |
START - Cookbook sre.network.configure-switch-interfaces for host ms-be1091 |
[production] |
| 14:24 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ms-be1091.eqiad.wmnet 21.48.64.10.in-addr.arpa 1.2.0.0.8.4.0.0.4.6.0.0.0.1.0.0.7.0.1.0.1.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors |
[production] |
| 14:24 |
<mvernon@cumin2002> |
START - Cookbook sre.dns.wipe-cache ms-be1091.eqiad.wmnet 21.48.64.10.in-addr.arpa 1.2.0.0.8.4.0.0.4.6.0.0.0.1.0.0.7.0.1.0.1.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors |
[production] |
| 14:24 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
| 14:24 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host ms-be1091 - mvernon@cumin2002" |
[production] |
| 14:23 |
<kharlan@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1277426|wmgPrivilegedGroups/wmgPrivilegedGlobalGroups: Update to include temporary account IP viewers]], [[gerrit:1277455|hCaptcha: enable for mobile apps account creation on testwiki (T412132)]] (duration: 06m 23s) |
[production] |
| 14:23 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-wikidata: apply |
[production] |
| 14:23 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-sre: apply |
[production] |
| 14:23 |
<mvernon@cumin2002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host ms-be1091 - mvernon@cumin2002" |
[production] |
| 14:22 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-sre: apply |
[production] |
| 14:20 |
<dpogorzelski@deploy1003> |
helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. |
[production] |
| 14:20 |
<kharlan@deploy1003> |
kharlan: Continuing with deployment |
[production] |
| 14:19 |
<fceratto@cumin1003> |
END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db1159: Repooling |
[production] |
| 14:19 |
<kharlan@deploy1003> |
kharlan: Backport for [[gerrit:1277426|wmgPrivilegedGroups/wmgPrivilegedGlobalGroups: Update to include temporary account IP viewers]], [[gerrit:1277455|hCaptcha: enable for mobile apps account creation on testwiki (T412132)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 14:18 |
<dpogorzelski@deploy1003> |
helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. |
[production] |
| 14:18 |
<btullis@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-ml: apply |
[production] |