| 2023-12-15
      
      § | 
    
  | 21:48 | <milimetric@deploy2002> | Finished deploy [analytics/refinery@eeb98ac]: Syncing changes to HDFS (duration: 81m 46s) | [production] | 
            
  | 21:26 | <mutante> | running puppet on all prometheus* | [production] | 
            
  | 20:26 | <milimetric@deploy2002> | Started deploy [analytics/refinery@eeb98ac]: Syncing changes to HDFS | [production] | 
            
  | 15:44 | <isaranto@deploy2002> | helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . | [production] | 
            
  | 15:25 | <klausman@deploy2002> | helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . | [production] | 
            
  | 15:01 | <klausman@deploy2002> | helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 15:00 | <klausman@deploy2002> | helmfile [ml-serve-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 14:46 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/spark-history: apply | [production] | 
            
  | 14:46 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'db2112 (re)pooling @ 100%: candidate master repooling', diff saved to https://phabricator.wikimedia.org/P54482 and previous config saved to /var/cache/conftool/dbconfig/20231215-144624-arnaudb.json | [production] | 
            
  | 14:46 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/spark-history: apply | [production] | 
            
  | 14:45 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/spark-history: apply | [production] | 
            
  | 14:44 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/spark-history: apply | [production] | 
            
  | 14:40 | <dcausse@deploy2002> | helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 14:39 | <dcausse@deploy2002> | helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 14:38 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'db2179 (re)pooling @ 100%: candidate master proper repooling', diff saved to https://phabricator.wikimedia.org/P54481 and previous config saved to /var/cache/conftool/dbconfig/20231215-143812-arnaudb.json | [production] | 
            
  | 14:31 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'db2112 (re)pooling @ 80%: candidate master repooling', diff saved to https://phabricator.wikimedia.org/P54480 and previous config saved to /var/cache/conftool/dbconfig/20231215-143118-arnaudb.json | [production] | 
            
  | 14:27 | <klausman@deploy2002> | helmfile [ml-serve-codfw] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 14:27 | <arnaudb@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 20 days, 0:00:00 on db2194.codfw.wmnet with reason: production freeze will occur before cookbook is finished | [production] | 
            
  | 14:27 | <klausman@deploy2002> | helmfile [ml-serve-codfw] START helmfile.d/admin 'apply'. | [production] | 
            
  | 14:27 | <arnaudb@cumin1001> | START - Cookbook sre.hosts.downtime for 20 days, 0:00:00 on db2194.codfw.wmnet with reason: production freeze will occur before cookbook is finished | [production] | 
            
  | 14:23 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'db2179 (re)pooling @ 75%: candidate master proper repooling', diff saved to https://phabricator.wikimedia.org/P54479 and previous config saved to /var/cache/conftool/dbconfig/20231215-142307-arnaudb.json | [production] | 
            
  | 14:16 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'db2112 (re)pooling @ 40%: candidate master repooling', diff saved to https://phabricator.wikimedia.org/P54478 and previous config saved to /var/cache/conftool/dbconfig/20231215-141613-arnaudb.json | [production] | 
            
  | 14:08 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'db2179 (re)pooling @ 50%: candidate master proper repooling', diff saved to https://phabricator.wikimedia.org/P54477 and previous config saved to /var/cache/conftool/dbconfig/20231215-140802-arnaudb.json | [production] | 
            
  | 14:07 | <klausman@deploy2002> | helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 14:07 | <klausman@deploy2002> | helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'. | [production] | 
            
  | 14:01 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'db2112 (re)pooling @ 20%: candidate master repooling', diff saved to https://phabricator.wikimedia.org/P54476 and previous config saved to /var/cache/conftool/dbconfig/20231215-140108-arnaudb.json | [production] | 
            
  | 13:54 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/spark-history: apply | [production] | 
            
  | 13:53 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/spark-history: apply | [production] | 
            
  | 13:52 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'db2179 (re)pooling @ 25%: candidate master proper repooling', diff saved to https://phabricator.wikimedia.org/P54475 and previous config saved to /var/cache/conftool/dbconfig/20231215-135257-arnaudb.json | [production] | 
            
  | 13:52 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'depool db2179 to repool w/ api', diff saved to https://phabricator.wikimedia.org/P54474 and previous config saved to /var/cache/conftool/dbconfig/20231215-135228-arnaudb.json | [production] | 
            
  | 13:46 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'db2112 (re)pooling @ 10%: candidate master repooling', diff saved to https://phabricator.wikimedia.org/P54473 and previous config saved to /var/cache/conftool/dbconfig/20231215-134603-arnaudb.json | [production] | 
            
  | 13:39 | <jelto@cumin1001> | END (FAIL) - Cookbook sre.gitlab.upgrade (exit_code=99) on GitLab host gitlab1004.wikimedia.org with reason: Test upgrade GitLab Replica with insufficient API key | [production] | 
            
  | 13:39 | <jelto@cumin1001> | START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1004.wikimedia.org with reason: Test upgrade GitLab Replica with insufficient API key | [production] | 
            
  | 12:55 | <btullis@deploy2002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/spark-history: apply | [production] | 
            
  | 12:55 | <btullis@deploy2002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/spark-history: apply | [production] | 
            
  | 12:25 | <hashar@deploy2002> | Finished deploy [integration/docroot@7f6c112]: doc: add integration/tox-jenkins-override - T353515 (duration: 00m 06s) | [production] | 
            
  | 12:25 | <hashar@deploy2002> | Started deploy [integration/docroot@7f6c112]: doc: add integration/tox-jenkins-override - T353515 | [production] | 
            
  | 11:28 | <hashar@deploy2002> | Finished deploy [gerrit/gerrit@304c63a]: wm-pcc: only act on Puppet repositories - T353181 (duration: 00m 08s) | [production] | 
            
  | 11:28 | <hashar@deploy2002> | Started deploy [gerrit/gerrit@304c63a]: wm-pcc: only act on Puppet repositories - T353181 | [production] | 
            
  | 10:56 | <isaranto@deploy2002> | helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . | [production] | 
            
  | 10:54 | <isaranto@deploy2002> | helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'llm' for release 'main' . | [production] | 
            
  | 10:52 | <isaranto@deploy2002> | helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'llm' for release 'main' . | [production] | 
            
  | 09:05 | <moritzm> | installing Linux 6.1.67 packages on Bookworm hosts | [production] | 
            
  | 08:56 | <XioNoX> | shutdown already down IPv6 BGP session from ulsfo to the office | [production] |