| 2023-09-13
      
      ยง | 
    
  | 15:01 | <akosiaris@deploy1002> | helmfile [codfw] DONE helmfile.d/services/machinetranslation: apply | [production] | 
            
  | 15:01 | <hnowlan> | repooling cp2037 and enabling puppet on A:cp | [production] | 
            
  | 14:56 | <akosiaris@deploy1002> | helmfile [codfw] START helmfile.d/services/machinetranslation: apply | [production] | 
            
  | 14:55 | <akosiaris@deploy1002> | helmfile [eqiad] DONE helmfile.d/services/machinetranslation: apply | [production] | 
            
  | 14:52 | <hnowlan> | disable puppet on A:cp | [production] | 
            
  | 14:51 | <hnowlan> | depooled service=ats-be,name=cp2037.codfw.wmnet | [production] | 
            
  | 14:51 | <jayme> | updated kubernetes-* packages fleet wide to 1.23.14-3 - T329826 | [production] | 
            
  | 14:50 | <akosiaris@deploy1002> | helmfile [eqiad] START helmfile.d/services/machinetranslation: apply | [production] | 
            
  | 14:41 | <akosiaris@deploy1002> | helmfile [staging] DONE helmfile.d/services/machinetranslation: apply | [production] | 
            
  | 14:39 | <akosiaris@deploy1002> | helmfile [staging] START helmfile.d/services/machinetranslation: apply | [production] | 
            
  | 14:36 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on sretest1001.eqiad.wmnet with reason: WIP towards puppetised nftables firewall | [production] | 
            
  | 14:36 | <jmm@cumin2002> | START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on sretest1001.eqiad.wmnet with reason: WIP towards puppetised nftables firewall | [production] | 
            
  | 14:31 | <ayounsi@cumin1001> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sretest1001.eqiad.wmnet with OS bullseye | [production] | 
            
  | 14:29 | <bking@deploy1002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/rdf-streaming-updater: apply | [production] | 
            
  | 14:29 | <bking@deploy1002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/rdf-streaming-updater: apply | [production] | 
            
  | 14:26 | <bking@deploy1002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/rdf-streaming-updater: apply | [production] | 
            
  | 14:25 | <bking@deploy1002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/rdf-streaming-updater: apply | [production] | 
            
  | 14:17 | <bking@deploy1002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/rdf-streaming-updater: apply | [production] | 
            
  | 14:17 | <bking@deploy1002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/rdf-streaming-updater: apply | [production] | 
            
  | 14:10 | <bking@deploy1002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/rdf-streaming-updater: apply | [production] | 
            
  | 14:10 | <bking@deploy1002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/rdf-streaming-updater: apply | [production] | 
            
  | 14:08 | <hnowlan> | stopping cassandra on restbase1030-c | [production] | 
            
  | 13:52 | <jmm@cumin2002> | START - Cookbook sre.aqs.roll-restart-reboot rolling reboot on A:aqs-codfw | [production] | 
            
  | 13:34 | <Lucas_WMDE> | UTC afternoon backport+config window done | [production] | 
            
  | 13:34 | <lucaswerkmeister-wmde@deploy1002> | Finished scap: Backport for [[gerrit:956818|rdbms: Use `debugSql` instead of `debugDumpSql` which is unuset (T318272)]] (duration: 15m 42s) | [production] | 
            
  | 13:27 | <lucaswerkmeister-wmde@deploy1002> | lucaswerkmeister-wmde and d3r1ck01: Continuing with sync | [production] | 
            
  | 13:20 | <lucaswerkmeister-wmde@deploy1002> | lucaswerkmeister-wmde and d3r1ck01: Backport for [[gerrit:956818|rdbms: Use `debugSql` instead of `debugDumpSql` which is unuset (T318272)]] synced to the testservers mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) | [production] | 
            
  | 13:18 | <lucaswerkmeister-wmde@deploy1002> | Started scap: Backport for [[gerrit:956818|rdbms: Use `debugSql` instead of `debugDumpSql` which is unuset (T318272)]] | [production] | 
            
  | 12:23 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'db1174 (re)pooling @ 100%: Maint over', diff saved to https://phabricator.wikimedia.org/P52499 and previous config saved to /var/cache/conftool/dbconfig/20230913-122323-ladsgroup.json | [production] | 
            
  | 12:17 | <godog> | pool only titan hosts for thanos-web and thanos-query services - T341488 | [production] | 
            
  | 12:08 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'db1174 (re)pooling @ 75%: Maint over', diff saved to https://phabricator.wikimedia.org/P52498 and previous config saved to /var/cache/conftool/dbconfig/20230913-120818-ladsgroup.json | [production] | 
            
  | 11:53 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'db1174 (re)pooling @ 25%: Maint over', diff saved to https://phabricator.wikimedia.org/P52497 and previous config saved to /var/cache/conftool/dbconfig/20230913-115314-ladsgroup.json | [production] | 
            
  | 11:30 | <hnowlan@deploy1002> | helmfile [eqiad] DONE helmfile.d/services/thumbor: apply | [production] | 
            
  | 11:29 | <hnowlan@deploy1002> | helmfile [eqiad] START helmfile.d/services/thumbor: apply | [production] | 
            
  | 11:27 | <hnowlan@deploy1002> | helmfile [codfw] DONE helmfile.d/services/thumbor: apply | [production] | 
            
  | 11:26 | <hnowlan@deploy1002> | helmfile [codfw] START helmfile.d/services/thumbor: apply | [production] | 
            
  | 11:25 | <hnowlan@deploy1002> | helmfile [staging] DONE helmfile.d/services/thumbor: apply | [production] | 
            
  | 11:24 | <hnowlan@deploy1002> | helmfile [staging] START helmfile.d/services/thumbor: apply | [production] | 
            
  | 11:19 | <hnowlan@deploy1002> | helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: apply | [production] | 
            
  | 11:18 | <hnowlan@deploy1002> | helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: apply | [production] | 
            
  | 11:18 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1174 (T343198)', diff saved to https://phabricator.wikimedia.org/P52495 and previous config saved to /var/cache/conftool/dbconfig/20230913-111834-arnaudb.json | [production] | 
            
  | 11:17 | <hnowlan@deploy1002> | helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply | [production] | 
            
  | 11:16 | <hnowlan@deploy1002> | helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply | [production] | 
            
  | 11:15 | <hnowlan@deploy1002> | helmfile [staging] DONE helmfile.d/services/changeprop-jobqueue: apply | [production] | 
            
  | 11:15 | <hnowlan@deploy1002> | helmfile [staging] START helmfile.d/services/changeprop-jobqueue: apply | [production] | 
            
  | 11:11 | <filippo@cumin1001> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host titan2002.codfw.wmnet with OS bookworm | [production] | 
            
  | 10:54 | <filippo@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on titan2002.codfw.wmnet with reason: host reimage | [production] | 
            
  | 10:51 | <filippo@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on titan2002.codfw.wmnet with reason: host reimage | [production] | 
            
  | 10:49 | <jayme> | imported kubernetes_1.23.14-3 to bullseye-wikimedia component/kubernetes123 - T329826 | [production] | 
            
  | 10:46 | <ladsgroup@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2105.codfw.wmnet with reason: Maintenance | [production] |