|
2026-04-02
ยง
|
| 09:48 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2156.codfw.wmnet with reason: Maintenance |
[production] |
| 09:48 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2149 (T419635)', diff saved to https://phabricator.wikimedia.org/P90210 and previous config saved to /var/cache/conftool/dbconfig/20260402-094808-fceratto.json |
[production] |
| 09:48 |
<javiermonton@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/eventgate-main: sync |
[production] |
| 09:47 |
<javiermonton@deploy1003> |
helmfile [eqiad] START helmfile.d/services/eventgate-main: sync |
[production] |
| 09:45 |
<javiermonton@deploy1003> |
helmfile [staging] DONE helmfile.d/services/eventgate-main: sync |
[production] |
| 09:45 |
<javiermonton@deploy1003> |
helmfile [staging] START helmfile.d/services/eventgate-main: sync |
[production] |
| 09:38 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2149', diff saved to https://phabricator.wikimedia.org/P90209 and previous config saved to /var/cache/conftool/dbconfig/20260402-093759-fceratto.json |
[production] |
| 09:33 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.misc-clusters.restart-reboot-config-master (exit_code=0) rolling reboot on A:config-master-codfw |
[production] |
| 09:29 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) config-master.discovery.wmnet. on all recursors |
[production] |
| 09:29 |
<jmm@cumin2002> |
START - Cookbook sre.dns.wipe-cache config-master.discovery.wmnet. on all recursors |
[production] |
| 09:29 |
<jmm@cumin2002> |
START - Cookbook sre.misc-clusters.restart-reboot-config-master rolling reboot on A:config-master-codfw |
[production] |
| 09:27 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2149', diff saved to https://phabricator.wikimedia.org/P90208 and previous config saved to /var/cache/conftool/dbconfig/20260402-092751-fceratto.json |
[production] |
| 09:27 |
<gkyziridis@deploy1003> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
| 09:27 |
<gkyziridis@deploy1003> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
| 09:19 |
<moritzm> |
upgrading Envoy on the config-master servers to 1.35.9 T419637 T410975 |
[production] |
| 09:17 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2149 (T419635)', diff saved to https://phabricator.wikimedia.org/P90207 and previous config saved to /var/cache/conftool/dbconfig/20260402-091743-fceratto.json |
[production] |
| 08:55 |
<gkyziridis@deploy1003> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
| 08:55 |
<gkyziridis@deploy1003> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
| 08:49 |
<moritzm> |
added Atsuko to the cn=ops LDAP group T421860 |
[production] |
| 08:46 |
<dpogorzelski@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/changeprop: sync |
[production] |
| 08:45 |
<dpogorzelski@deploy1003> |
helmfile [eqiad] START helmfile.d/services/changeprop: sync |
[production] |
| 08:45 |
<dpogorzelski@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/changeprop: sync |
[production] |
| 08:44 |
<fceratto@cumin1003> |
dbctl commit (dc=all): 'Depooling db2149 (T419635)', diff saved to https://phabricator.wikimedia.org/P90206 and previous config saved to /var/cache/conftool/dbconfig/20260402-084452-fceratto.json |
[production] |
| 08:44 |
<fceratto@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2149.codfw.wmnet with reason: Maintenance |
[production] |
| 08:44 |
<dpogorzelski@deploy1003> |
helmfile [codfw] START helmfile.d/services/changeprop: sync |
[production] |
| 08:42 |
<XioNoX> |
reboot mr1-esams - T416450 |
[production] |
| 08:30 |
<volans> |
briefly disabling puppet on P:installserver::proxy to deploy g/1266885 |
[production] |
| 08:17 |
<kevinbazira@deploy1003> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . |
[production] |
| 08:10 |
<jnuche@deploy1003> |
rebuilt and synchronized wikiversions files: group2 to 1.46.0-wmf.22 refs T420480 |
[production] |
| 08:00 |
<mszwarc@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1266866|Disable external link analysis (T419837)]] (duration: 10m 13s) |
[production] |
| 07:56 |
<mszwarc@deploy1003> |
mszwarc: Continuing with sync |
[production] |
| 07:55 |
<jmm@dns1004> |
END - running authdns-update |
[production] |
| 07:54 |
<jmm@dns1004> |
START - running authdns-update |
[production] |
| 07:52 |
<mszwarc@deploy1003> |
mszwarc: Backport for [[gerrit:1266866|Disable external link analysis (T419837)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 07:50 |
<mszwarc@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1266866|Disable external link analysis (T419837)]] |
[production] |
| 07:47 |
<jnuche@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1266861|ApiAuthManagerHelper: Accept fields with undefined label (T422027)]] (duration: 06m 39s) |
[production] |
| 07:47 |
<bking@cumin2002> |
START - Cookbook sre.wdqs.data-transfer (, T421714) xfer wdqs-all from wdqs2016.codfw.wmnet -> wdqs1027.eqiad.wmnet, repooling both afterwards |
[production] |
| 07:43 |
<jnuche@deploy1003> |
jnuche: Continuing with sync |
[production] |
| 07:43 |
<jnuche@deploy1003> |
jnuche: Backport for [[gerrit:1266861|ApiAuthManagerHelper: Accept fields with undefined label (T422027)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 07:41 |
<jnuche@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1266861|ApiAuthManagerHelper: Accept fields with undefined label (T422027)]] |
[production] |
| 07:38 |
<moritzm> |
purge prometheus-nginx-exporter from url downloaders, remnants of early hcapcha rollout |
[production] |
| 07:38 |
<ryankemper@deploy1003> |
Finished deploy [wdqs/wdqs@fea7794]: deploy to freshly reimaged wdqs host (duration: 00m 05s) |
[production] |
| 07:38 |
<ryankemper@deploy1003> |
Started deploy [wdqs/wdqs@fea7794]: deploy to freshly reimaged wdqs host |
[production] |
| 07:32 |
<ayounsi@cumin1003> |
END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 64049 |
[production] |
| 07:30 |
<ayounsi@cumin1003> |
START - Cookbook sre.network.peering with action 'email' for AS: 64049 |
[production] |
| 07:25 |
<gkyziridis@deploy1003> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
| 07:24 |
<gkyziridis@deploy1003> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
| 07:12 |
<gkyziridis@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1266228|EventStreamConfig: Add rr-multilingual prediction_change stream (T415892)]] (duration: 07m 00s) |
[production] |
| 07:08 |
<gkyziridis@deploy1003> |
gkyziridis: Continuing with sync |
[production] |
| 07:07 |
<gkyziridis@deploy1003> |
gkyziridis: Backport for [[gerrit:1266228|EventStreamConfig: Add rr-multilingual prediction_change stream (T415892)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |