2025-09-01
ยง
|
10:44 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2177.codfw.wmnet with reason: Maintenance |
[production] |
10:43 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2156 (T401906)', diff saved to https://phabricator.wikimedia.org/P82297 and previous config saved to /var/cache/conftool/dbconfig/20250901-104345-fceratto.json |
[production] |
10:28 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2156', diff saved to https://phabricator.wikimedia.org/P82296 and previous config saved to /var/cache/conftool/dbconfig/20250901-102837-fceratto.json |
[production] |
10:13 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2156', diff saved to https://phabricator.wikimedia.org/P82295 and previous config saved to /var/cache/conftool/dbconfig/20250901-101330-fceratto.json |
[production] |
10:07 |
<jmm@cumin2002> |
START - Cookbook sre.postgresql.postgres-init |
[production] |
10:06 |
<jmm@cumin2002> |
START - Cookbook sre.postgresql.postgres-init |
[production] |
10:05 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.postgresql.postgres-init (exit_code=99) |
[production] |
10:04 |
<jmm@cumin2002> |
START - Cookbook sre.postgresql.postgres-init |
[production] |
10:01 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1216.eqiad.wmnet with reason: Maintenance |
[production] |
10:00 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1179 (T403362)', diff saved to https://phabricator.wikimedia.org/P82294 and previous config saved to /var/cache/conftool/dbconfig/20250901-100054-ladsgroup.json |
[production] |
09:58 |
<dcausse@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1183454|SECURITY: declare PoolCounter settings for cirrusbuilddoc (T401220)]] (duration: 11m 12s) |
[production] |
09:58 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2156 (T401906)', diff saved to https://phabricator.wikimedia.org/P82293 and previous config saved to /var/cache/conftool/dbconfig/20250901-095822-fceratto.json |
[production] |
09:53 |
<dcausse@deploy1003> |
dcausse: Continuing with sync |
[production] |
09:52 |
<dcausse@deploy1003> |
dcausse: Backport for [[gerrit:1183454|SECURITY: declare PoolCounter settings for cirrusbuilddoc (T401220)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
09:47 |
<dcausse@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1183454|SECURITY: declare PoolCounter settings for cirrusbuilddoc (T401220)]] |
[production] |
09:47 |
<hnowlan@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply |
[production] |
09:47 |
<hnowlan@deploy1003> |
helmfile [eqiad] START helmfile.d/services/rest-gateway: apply |
[production] |
09:45 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1179', diff saved to https://phabricator.wikimedia.org/P82292 and previous config saved to /var/cache/conftool/dbconfig/20250901-094547-ladsgroup.json |
[production] |
09:45 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2156 (T401906)', diff saved to https://phabricator.wikimedia.org/P82291 and previous config saved to /var/cache/conftool/dbconfig/20250901-094504-fceratto.json |
[production] |
09:44 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2156.codfw.wmnet with reason: Maintenance |
[production] |
09:44 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2149 (T401906)', diff saved to https://phabricator.wikimedia.org/P82290 and previous config saved to /var/cache/conftool/dbconfig/20250901-094442-fceratto.json |
[production] |
09:43 |
<hnowlan@deploy1003> |
helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply |
[production] |
09:43 |
<hnowlan@deploy1003> |
helmfile [codfw] START helmfile.d/services/rest-gateway: apply |
[production] |
09:41 |
<hnowlan@deploy1003> |
helmfile [staging] DONE helmfile.d/services/rest-gateway: apply |
[production] |
09:41 |
<hnowlan@deploy1003> |
helmfile [staging] START helmfile.d/services/rest-gateway: apply |
[production] |
09:38 |
<dcausse@deploy1003> |
dcausse: Continuing with sync |
[production] |
09:33 |
<dcausse@deploy1003> |
dcausse: Backport for [[gerrit:1183454|SECURITY: declare PoolCounter settings for cirrusbuilddoc (T401220)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
09:30 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1179', diff saved to https://phabricator.wikimedia.org/P82289 and previous config saved to /var/cache/conftool/dbconfig/20250901-093039-ladsgroup.json |
[production] |
09:29 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2149', diff saved to https://phabricator.wikimedia.org/P82288 and previous config saved to /var/cache/conftool/dbconfig/20250901-092934-fceratto.json |
[production] |
09:27 |
<dcausse@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1183454|SECURITY: declare PoolCounter settings for cirrusbuilddoc (T401220)]] |
[production] |
09:25 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/postgresql-airflow-main: apply |
[production] |
09:25 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/postgresql-airflow-main: apply |
[production] |
09:24 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ganeti3005.esams.wmnet with OS bookworm |
[production] |
09:24 |
<dcausse@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1183112|hCaptcha: Provide label/help in authmanagerinfo API calls (T403253)]] (duration: 16m 15s) |
[production] |
09:19 |
<dcausse@deploy1003> |
kharlan, dcausse: Continuing with sync |
[production] |
09:15 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1179 (T403362)', diff saved to https://phabricator.wikimedia.org/P82287 and previous config saved to /var/cache/conftool/dbconfig/20250901-091531-ladsgroup.json |
[production] |
09:14 |
<dcausse@deploy1003> |
kharlan, dcausse: Backport for [[gerrit:1183112|hCaptcha: Provide label/help in authmanagerinfo API calls (T403253)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
09:14 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2149', diff saved to https://phabricator.wikimedia.org/P82286 and previous config saved to /var/cache/conftool/dbconfig/20250901-091427-fceratto.json |
[production] |
09:08 |
<dcausse@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1183112|hCaptcha: Provide label/help in authmanagerinfo API calls (T403253)]] |
[production] |
09:06 |
<dcausse@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1183279|Lift permission for event-organizer in Chinese Wikipedia (T403350)]] (duration: 14m 20s) |
[production] |
09:01 |
<dcausse@deploy1003> |
hamishz, dcausse: Continuing with sync |
[production] |
08:59 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2149 (T401906)', diff saved to https://phabricator.wikimedia.org/P82285 and previous config saved to /var/cache/conftool/dbconfig/20250901-085920-fceratto.json |
[production] |
08:58 |
<dcausse@deploy1003> |
hamishz, dcausse: Backport for [[gerrit:1183279|Lift permission for event-organizer in Chinese Wikipedia (T403350)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
08:52 |
<dcausse@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1183279|Lift permission for event-organizer in Chinese Wikipedia (T403350)]] |
[production] |
08:51 |
<jmm@cumin2002> |
START - Cookbook sre.postgresql.postgres-init |
[production] |
08:51 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es2049.codfw.wmnet with reason: T402859 |
[production] |
08:46 |
<dcausse@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1182692|Revert "wikimaniawiki: update logo to 2025" (T403148)]], [[gerrit:1182798|Remove setting `wgEnablePartialActionBlocks`. (T280532)]] (duration: 12m 05s) |
[production] |
08:45 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depooling db2149 (T401906)', diff saved to https://phabricator.wikimedia.org/P82284 and previous config saved to /var/cache/conftool/dbconfig/20250901-084558-fceratto.json |
[production] |
08:45 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2149.codfw.wmnet with reason: Maintenance |
[production] |
08:42 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Depooling db1179 (T403362)', diff saved to https://phabricator.wikimedia.org/P82283 and previous config saved to /var/cache/conftool/dbconfig/20250901-084254-ladsgroup.json |
[production] |