2024-08-19
ยง
|
12:38 |
<pfischer@deploy1003> |
helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
12:38 |
<pfischer@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
12:37 |
<pfischer@deploy1003> |
helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
12:37 |
<pfischer@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
12:37 |
<pfischer@deploy1003> |
helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
12:33 |
<fnegri@cumin1002> |
START - Cookbook sre.hosts.reimage for host clouddb1015.eqiad.wmnet with OS bookworm |
[production] |
12:31 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2154 (T367856)', diff saved to https://phabricator.wikimedia.org/P67382 and previous config saved to /var/cache/conftool/dbconfig/20240819-123119-marostegui.json |
[production] |
12:28 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply |
[production] |
12:27 |
<fnegri@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on clouddb1015.eqiad.wmnet with reason: Reimaging clouddb1015 T365424 |
[production] |
12:27 |
<fnegri@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on clouddb1015.eqiad.wmnet with reason: Reimaging clouddb1015 T365424 |
[production] |
12:26 |
<fnegri@cumin1002> |
conftool action : set/pooled=no; selector: name=clouddb1015.eqiad.wmnet,service=s6 |
[production] |
12:26 |
<fnegri@cumin1002> |
conftool action : set/pooled=no; selector: name=clouddb1015.eqiad.wmnet,service=s4 |
[production] |
12:25 |
<dreamyjazz@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1063776|Enable temporary accounts on test2wiki (T371116)]] (duration: 22m 14s) |
[production] |
12:23 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply |
[production] |
12:21 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply |
[production] |
12:18 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply |
[production] |
12:18 |
<dreamyjazz@deploy1003> |
dreamyjazz: Continuing with sync |
[production] |
12:17 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
12:17 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
12:16 |
<dreamyjazz@deploy1003> |
dreamyjazz: Backport for [[gerrit:1063776|Enable temporary accounts on test2wiki (T371116)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
12:14 |
<pfischer@deploy1003> |
helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
12:11 |
<pfischer@deploy1003> |
helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
12:03 |
<kevinbazira@deploy1003> |
helmfile [ml-serve-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . |
[production] |
12:03 |
<dreamyjazz@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1063776|Enable temporary accounts on test2wiki (T371116)]] |
[production] |
12:01 |
<kevinbazira@deploy1003> |
helmfile [ml-serve-eqiad] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . |
[production] |
11:56 |
<Dreamy_Jazz> |
Started scanning script for ruwiki with timeout of 6h to catchup to monthly request limit |
[production] |
11:49 |
<Dreamy_Jazz> |
Restarted MediaModeration scanning script - https://wikitech.wikimedia.org/wiki/MediaModeration |
[production] |
11:30 |
<kevinbazira@deploy1003> |
helmfile [ml-staging-codfw] 'sync' command on namespace 'recommendation-api-ng' for release 'main' . |
[production] |
11:27 |
<ayounsi@cumin1002> |
START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox |
[production] |
10:49 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply |
[production] |
10:30 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox-canary |
[production] |
10:29 |
<ayounsi@cumin1002> |
START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox-canary |
[production] |
10:14 |
<kevinbazira@deploy1003> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . |
[production] |
10:10 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox-canary |
[production] |
10:10 |
<ayounsi@cumin1002> |
START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox-canary |
[production] |
10:08 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2136 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P67378 and previous config saved to /var/cache/conftool/dbconfig/20240819-100847-root.json |
[production] |
09:53 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2136 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P67377 and previous config saved to /var/cache/conftool/dbconfig/20240819-095342-root.json |
[production] |
09:38 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2136 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P67376 and previous config saved to /var/cache/conftool/dbconfig/20240819-093836-root.json |
[production] |
09:23 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2136 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P67375 and previous config saved to /var/cache/conftool/dbconfig/20240819-092331-root.json |
[production] |
09:17 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
09:16 |
<brouberol@deploy1003> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
09:08 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2136 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P67374 and previous config saved to /var/cache/conftool/dbconfig/20240819-090825-root.json |
[production] |
09:07 |
<ayounsi@cumin1002> |
END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox-canary |
[production] |
09:06 |
<ayounsi@cumin1002> |
START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox-canary |
[production] |
08:53 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2136 (re)pooling @ 5%: Repooling', diff saved to https://phabricator.wikimedia.org/P67373 and previous config saved to /var/cache/conftool/dbconfig/20240819-085320-root.json |
[production] |
08:38 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2136 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P67372 and previous config saved to /var/cache/conftool/dbconfig/20240819-083814-root.json |
[production] |
08:35 |
<marostegui> |
Upgrade db2136 to 10.11.9 T372551 |
[production] |
08:35 |
<marostegui@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2136.codfw.wmnet with reason: Upgrade to 10.11.9 |
[production] |
08:35 |
<marostegui@cumin1002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on db2136.codfw.wmnet with reason: Upgrade to 10.11.9 |
[production] |
08:34 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'Depool db2136', diff saved to https://phabricator.wikimedia.org/P67371 and previous config saved to /var/cache/conftool/dbconfig/20240819-083439-root.json |
[production] |