2023-12-16
§
|
22:01 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.grid.cleanup_queue_errors (exit_code=0) |
[tools] |
22:01 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.grid.cleanup_queue_errors |
[tools] |
20:54 |
<bd808> |
Rebuilding all containers to pick up lighttpd config fix and normal package updates (T293552) |
[tools] |
08:14 |
<dhinus> |
restarting toolsdb with jemalloc |
[tools] |
05:32 |
<andrewbogott> |
restarting mariadb on toolsdb-1 because it's just about to go oom (or possibly just did) |
[tools] |
01:21 |
<eevans@deploy2002> |
Finished deploy [cassandra/logstash-logback-encoder@fb10de1]: (no justification provided) (duration: 00m 10s) |
[production] |
01:21 |
<eevans@deploy2002> |
Started deploy [cassandra/logstash-logback-encoder@fb10de1]: (no justification provided) |
[production] |
00:44 |
<htriedman@deploy2002> |
Finished deploy [airflow-dags/platform_eng@63804c4]: (no justification provided) (duration: 00m 25s) |
[production] |
00:44 |
<htriedman@deploy2002> |
Started deploy [airflow-dags/platform_eng@63804c4]: (no justification provided) |
[production] |
00:21 |
<dhinus> |
restarting toolsdb again as it's again low in free mem T353093 |
[tools] |
00:05 |
<jhathaway> |
unbreaking my puppet change with, https://gerrit.wikimedia.org/r/c/operations/puppet/+/983504 |
[production] |
2023-12-15
§
|
23:46 |
<htriedman@deploy2002> |
Finished deploy [airflow-dags/platform_eng@9600237]: (no justification provided) (duration: 00m 27s) |
[production] |
23:46 |
<htriedman@deploy2002> |
Started deploy [airflow-dags/platform_eng@9600237]: (no justification provided) |
[production] |
23:06 |
<milimetric@deploy2002> |
Finished deploy [airflow-dags/platform_eng@160d0f0]: (no justification provided) (duration: 00m 25s) |
[production] |
23:06 |
<milimetric@deploy2002> |
Started deploy [airflow-dags/platform_eng@160d0f0]: (no justification provided) |
[production] |
22:42 |
<pfischer@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
22:42 |
<pfischer@deploy2002> |
helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
22:03 |
<htriedman@deploy2002> |
Finished deploy [airflow-dags/platform_eng@5090fdc]: (no justification provided) (duration: 00m 25s) |
[production] |
22:03 |
<htriedman@deploy2002> |
Started deploy [airflow-dags/platform_eng@5090fdc]: (no justification provided) |
[production] |
21:48 |
<milimetric@deploy2002> |
Finished deploy [analytics/refinery@eeb98ac] (thin): Syncing changes to HDFS (duration: 00m 06s) |
[production] |
21:48 |
<milimetric@deploy2002> |
Started deploy [analytics/refinery@eeb98ac] (thin): Syncing changes to HDFS |
[production] |
21:48 |
<milimetric@deploy2002> |
Finished deploy [analytics/refinery@eeb98ac]: Syncing changes to HDFS (duration: 81m 46s) |
[production] |
21:26 |
<mutante> |
running puppet on all prometheus* |
[production] |
20:43 |
<dancy> |
Rebooting gitlab-runner-1002.devtools. It was overloaded by a quibble job. |
[releng] |
20:26 |
<milimetric@deploy2002> |
Started deploy [analytics/refinery@eeb98ac]: Syncing changes to HDFS |
[production] |
20:26 |
<andrewbogott> |
restarting toolsdb to avoid upcoming oom crash |
[tools] |
16:49 |
<dhinus> |
restarting toolsdb before it's about to go OOM, enabling performance_schema for debugging |
[tools] |
15:44 |
<isaranto@deploy2002> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . |
[production] |
15:25 |
<klausman@deploy2002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . |
[production] |
15:01 |
<klausman@deploy2002> |
helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
15:00 |
<klausman@deploy2002> |
helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. |
[production] |
14:46 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/spark-history: apply |
[production] |
14:46 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db2112 (re)pooling @ 100%: candidate master repooling', diff saved to https://phabricator.wikimedia.org/P54482 and previous config saved to /var/cache/conftool/dbconfig/20231215-144624-arnaudb.json |
[production] |
14:46 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/spark-history: apply |
[production] |
14:45 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/spark-history: apply |
[production] |
14:44 |
<brouberol@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/spark-history: apply |
[production] |
14:40 |
<dcausse@deploy2002> |
helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
14:40 |
<dcaro> |
deploy toolforge-builds-cli 0.0.10 (T341067) |
[tools] |
14:39 |
<dcausse@deploy2002> |
helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
14:38 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db2179 (re)pooling @ 100%: candidate master proper repooling', diff saved to https://phabricator.wikimedia.org/P54481 and previous config saved to /var/cache/conftool/dbconfig/20231215-143812-arnaudb.json |
[production] |
14:31 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db2112 (re)pooling @ 80%: candidate master repooling', diff saved to https://phabricator.wikimedia.org/P54480 and previous config saved to /var/cache/conftool/dbconfig/20231215-143118-arnaudb.json |
[production] |
14:27 |
<klausman@deploy2002> |
helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. |
[production] |