|
2026-01-13
§
|
| 00:35 |
<dani@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/miscweb: apply |
[production] |
| 00:35 |
<dani@deploy2002> |
helmfile [codfw] START helmfile.d/services/miscweb: apply |
[production] |
| 00:35 |
<dani@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/miscweb: apply |
[production] |
| 00:35 |
<dani@deploy2002> |
helmfile [eqiad] START helmfile.d/services/miscweb: apply |
[production] |
| 00:35 |
<dani@deploy2002> |
helmfile [staging] DONE helmfile.d/services/miscweb: apply |
[production] |
| 00:35 |
<dani@deploy2002> |
helmfile [staging] START helmfile.d/services/miscweb: apply |
[production] |
| 00:31 |
<swfrench@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/shellbox-video: apply |
[production] |
| 00:31 |
<swfrench@deploy2002> |
helmfile [codfw] START helmfile.d/services/shellbox-video: apply |
[production] |
| 00:30 |
<swfrench@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/shellbox-video: apply |
[production] |
| 00:30 |
<swfrench@deploy2002> |
helmfile [codfw] START helmfile.d/services/shellbox-video: apply |
[production] |
| 00:29 |
<swfrench@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/shellbox-video: apply |
[production] |
| 00:28 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1247 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87445 and previous config saved to /var/cache/conftool/dbconfig/20260113-002853-marostegui.json |
[production] |
| 00:09 |
<swfrench@deploy2002> |
helmfile [codfw] START helmfile.d/services/shellbox-video: apply |
[production] |
|
2026-01-12
§
|
| 23:52 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1008.eqiad.wmnet with reason: Maintenance |
[production] |
| 23:52 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1251 (T413525)', diff saved to https://phabricator.wikimedia.org/P87444 and previous config saved to /var/cache/conftool/dbconfig/20260112-235209-marostegui.json |
[production] |
| 23:42 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1251', diff saved to https://phabricator.wikimedia.org/P87443 and previous config saved to /var/cache/conftool/dbconfig/20260112-234201-marostegui.json |
[production] |
| 23:38 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db2245 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87442 and previous config saved to /var/cache/conftool/dbconfig/20260112-233850-marostegui.json |
[production] |
| 23:38 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db2245.codfw.wmnet with reason: Maintenance |
[production] |
| 23:38 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2240 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87441 and previous config saved to /var/cache/conftool/dbconfig/20260112-233825-marostegui.json |
[production] |
| 23:31 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1251', diff saved to https://phabricator.wikimedia.org/P87440 and previous config saved to /var/cache/conftool/dbconfig/20260112-233152-marostegui.json |
[production] |
| 23:30 |
<cjming@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1225678|Revert^2 "Deploy TestKitchen to Beta Cluster"]] (duration: 06m 14s) |
[production] |
| 23:28 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2240', diff saved to https://phabricator.wikimedia.org/P87439 and previous config saved to /var/cache/conftool/dbconfig/20260112-232817-marostegui.json |
[production] |
| 23:26 |
<cjming@deploy2002> |
cjming: Continuing with sync |
[production] |
| 23:26 |
<cjming@deploy2002> |
cjming: Backport for [[gerrit:1225678|Revert^2 "Deploy TestKitchen to Beta Cluster"]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 23:24 |
<cjming@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1225678|Revert^2 "Deploy TestKitchen to Beta Cluster"]] |
[production] |
| 23:21 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1251 (T413525)', diff saved to https://phabricator.wikimedia.org/P87438 and previous config saved to /var/cache/conftool/dbconfig/20260112-232144-marostegui.json |
[production] |
| 23:20 |
<cjming@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1225675|Revert to `product_metrics` schemas and use `default` as the coordinator value (T407901)]] (duration: 06m 13s) |
[production] |
| 23:18 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2240', diff saved to https://phabricator.wikimedia.org/P87437 and previous config saved to /var/cache/conftool/dbconfig/20260112-231809-marostegui.json |
[production] |
| 23:16 |
<cjming@deploy2002> |
cjming: Continuing with sync |
[production] |
| 23:16 |
<cjming@deploy2002> |
cjming: Backport for [[gerrit:1225675|Revert to `product_metrics` schemas and use `default` as the coordinator value (T407901)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 23:14 |
<cjming@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1225675|Revert to `product_metrics` schemas and use `default` as the coordinator value (T407901)]] |
[production] |
| 23:08 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2240 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87436 and previous config saved to /var/cache/conftool/dbconfig/20260112-230801-marostegui.json |
[production] |
| 22:56 |
<cjming@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1225687|tests: skip test when WebAuthn is not loaded (T407797)]] (duration: 06m 25s) |
[production] |
| 22:52 |
<cjming@deploy2002> |
cjming, zabe: Continuing with sync |
[production] |
| 22:51 |
<cjming@deploy2002> |
cjming, zabe: Backport for [[gerrit:1225687|tests: skip test when WebAuthn is not loaded (T407797)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 22:50 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db1251 (T413525)', diff saved to https://phabricator.wikimedia.org/P87435 and previous config saved to /var/cache/conftool/dbconfig/20260112-225015-marostegui.json |
[production] |
| 22:50 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1251.eqiad.wmnet with reason: Maintenance |
[production] |
| 22:50 |
<cjming@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1225687|tests: skip test when WebAuthn is not loaded (T407797)]] |
[production] |
| 22:50 |
<ryankemper@cumin2002> |
START - Cookbook sre.hadoop.reboot-workers for Hadoop analytics cluster |
[production] |
| 22:47 |
<ryankemper@cumin2002> |
END (PASS) - Cookbook sre.hadoop.reboot-workers (exit_code=0) for Hadoop analytics cluster |
[production] |
| 22:47 |
<ryankemper@cumin2002> |
START - Cookbook sre.hadoop.reboot-workers for Hadoop analytics cluster |
[production] |
| 22:41 |
<ryankemper@cumin2002> |
END (FAIL) - Cookbook sre.hadoop.reboot-workers (exit_code=99) for Hadoop analytics cluster |
[production] |
| 22:41 |
<ryankemper@cumin2002> |
START - Cookbook sre.hadoop.reboot-workers for Hadoop analytics cluster |
[production] |
| 22:39 |
<ryankemper@cumin2002> |
END (FAIL) - Cookbook sre.hadoop.reboot-workers (exit_code=99) for Hadoop analytics cluster |
[production] |
| 22:39 |
<ryankemper@cumin2002> |
START - Cookbook sre.hadoop.reboot-workers for Hadoop analytics cluster |
[production] |
| 22:34 |
<cjming@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1225687|tests: skip test when WebAuthn is not loaded (T407797)]] |
[production] |
| 22:24 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1240.eqiad.wmnet with reason: Maintenance |
[production] |
| 22:15 |
<rzl> |
apt1002# reprepro --noskipold --restrict vopsbot update bookworm-wikimedia |
[production] |
| 22:01 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2216 (T413525)', diff saved to https://phabricator.wikimedia.org/P87434 and previous config saved to /var/cache/conftool/dbconfig/20260112-220104-marostegui.json |
[production] |
| 21:57 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1239.eqiad.wmnet with reason: Maintenance |
[production] |