| 2024-05-28
      
      ยง | 
    
  | 16:33 | <sfaci@deploy1002> | helmfile [staging] START helmfile.d/services/device-analytics: apply | [production] | 
            
  | 16:21 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db1207 (re)pooling @ 75%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63454 and previous config saved to /var/cache/conftool/dbconfig/20240528-162141-arnaudb.json | [production] | 
            
  | 16:18 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1170', diff saved to https://phabricator.wikimedia.org/P63453 and previous config saved to /var/cache/conftool/dbconfig/20240528-161845-marostegui.json | [production] | 
            
  | 16:17 | <ladsgroup@deploy1002> | Finished scap: Backport for [[gerrit:1035361|x-wikimedia-debug: add datacenter options for k8s (T365478)]] (duration: 12m 00s) | [production] | 
            
  | 16:14 | <Lucas_WMDE> | lucaswerkmeister-wmde@stat1011:~$ sudo -u analytics-wmde rm -rf /srv/analytics-wmde/wdcm/ # T364965; contained src/ as a clean git clone as of c2b0a324e9 / I024691a148, and nothing else | [production] | 
            
  | 16:12 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1206.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 16:12 | <hnowlan> | kubectl node uncordon wikikube-worker2002.codfw.wmnet | [production] | 
            
  | 16:11 | <hnowlan@cumin1002> | conftool action : set/pooled=yes:weight=10; selector: name=wikikube-worker2002.codfw.wmnet | [production] | 
            
  | 16:10 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on db1206.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 16:10 | <dani@deploy1002> | helmfile [codfw] DONE helmfile.d/services/miscweb: apply | [production] | 
            
  | 16:09 | <dani@deploy1002> | helmfile [codfw] START helmfile.d/services/miscweb: apply | [production] | 
            
  | 16:09 | <dani@deploy1002> | helmfile [eqiad] DONE helmfile.d/services/miscweb: apply | [production] | 
            
  | 16:08 | <dani@deploy1002> | helmfile [eqiad] START helmfile.d/services/miscweb: apply | [production] | 
            
  | 16:08 | <dani@deploy1002> | helmfile [staging] DONE helmfile.d/services/miscweb: apply | [production] | 
            
  | 16:08 | <dani@deploy1002> | helmfile [staging] START helmfile.d/services/miscweb: apply | [production] | 
            
  | 16:08 | <ladsgroup@deploy1002> | ladsgroup and jiji: Continuing with sync | [production] | 
            
  | 16:08 | <ladsgroup@deploy1002> | ladsgroup and jiji: Backport for [[gerrit:1035361|x-wikimedia-debug: add datacenter options for k8s (T365478)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 16:06 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db1207 (re)pooling @ 50%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63452 and previous config saved to /var/cache/conftool/dbconfig/20240528-160635-arnaudb.json | [production] | 
            
  | 16:05 | <ladsgroup@deploy1002> | Started scap: Backport for [[gerrit:1035361|x-wikimedia-debug: add datacenter options for k8s (T365478)]] | [production] | 
            
  | 16:03 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1170 (T364299)', diff saved to https://phabricator.wikimedia.org/P63451 and previous config saved to /var/cache/conftool/dbconfig/20240528-160337-marostegui.json | [production] | 
            
  | 16:01 | <ladsgroup@deploy1002> | Finished scap: Backport for [[gerrit:1036633|Create electionadmin group on testwiki (T209892)]] (duration: 17m 48s) | [production] | 
            
  | 16:00 | <cdanis@deploy1002> | helmfile [eqiad] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 16:00 | <cdanis@deploy1002> | helmfile [eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 15:55 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.reimage for host db1206.eqiad.wmnet with OS bookworm | [production] | 
            
  | 15:53 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on db1206.eqiad.wmnet with reason: reimage | [production] | 
            
  | 15:53 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 3:00:00 on db1206.eqiad.wmnet with reason: reimage | [production] | 
            
  | 15:53 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Depool db1206 T364290', diff saved to https://phabricator.wikimedia.org/P63449 and previous config saved to /var/cache/conftool/dbconfig/20240528-155309-arnaudb.json | [production] | 
            
  | 15:52 | <ejegg> | fundraising civicrm upgraded from 4dd78bcc to 3fee95bc | [production] | 
            
  | 15:51 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db1207 (re)pooling @ 25%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63448 and previous config saved to /var/cache/conftool/dbconfig/20240528-155129-arnaudb.json | [production] | 
            
  | 15:50 | <hnowlan> | ran `sudo puppet node deactivate  kubernetes2032.codfw.wmnet` to fix renamed host erroring in scap | [production] | 
            
  | 15:48 | <ladsgroup@deploy1002> | tstarling and ladsgroup: Continuing with sync | [production] | 
            
  | 15:48 | <ladsgroup@deploy1002> | tstarling and ladsgroup: Backport for [[gerrit:1036633|Create electionadmin group on testwiki (T209892)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 15:45 | <sukhe> | sudo cumin -b1 -s120 'A:dnsbox and not P{dns6001*}' 'run-puppet-agent --enable "merging CR 1034476"' | [production] | 
            
  | 15:45 | <brennen@deploy1002> | Finished deploy [phabricator/deployment@e7093e2]: deploy phab1004 for T366075 (duration: 00m 32s) | [production] | 
            
  | 15:44 | <brennen@deploy1002> | Started deploy [phabricator/deployment@e7093e2]: deploy phab1004 for T366075 | [production] | 
            
  | 15:44 | <brennen@deploy1002> | Finished deploy [phabricator/deployment@e7093e2]: deploy phab2002 for T366075 (duration: 00m 33s) | [production] | 
            
  | 15:44 | <ladsgroup@deploy1002> | Started scap: Backport for [[gerrit:1036633|Create electionadmin group on testwiki (T209892)]] | [production] | 
            
  | 15:43 | <brennen@deploy1002> | Started deploy [phabricator/deployment@e7093e2]: deploy phab2002 for T366075 | [production] | 
            
  | 15:41 | <dzahn@cumin1002> | END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 0:30:00 on phabricator.wikimedia.org with reason: phabricator deploy | [production] | 
            
  | 15:41 | <dzahn@cumin1002> | START - Cookbook sre.hosts.downtime for 0:30:00 on phabricator.wikimedia.org with reason: phabricator deploy | [production] | 
            
  | 15:40 | <dzahn@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on phab.wmfusercontent.org with reason: phabricator deploy | [production] | 
            
  | 15:40 | <jiji@deploy1002> | Unlocked for deployment [ALL REPOSITORIES]: Kubernetes masters trouble - no deployments - serviceops (duration: 114m 39s) | [production] | 
            
  | 15:40 | <dzahn@cumin1002> | START - Cookbook sre.hosts.downtime for 0:30:00 on phab.wmfusercontent.org with reason: phabricator deploy | [production] | 
            
  | 15:39 | <dzahn@cumin1002> | END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 0:30:00 on phabricator.wikimedia.org with reason: phabricator deploy | [production] | 
            
  | 15:39 | <dzahn@cumin1002> | START - Cookbook sre.hosts.downtime for 0:30:00 on phabricator.wikimedia.org with reason: phabricator deploy | [production] | 
            
  | 15:38 | <dzahn@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on phab2002.codfw.wmnet with reason: phabricator deploy | [production] | 
            
  | 15:38 | <dzahn@cumin1002> | START - Cookbook sre.hosts.downtime for 0:30:00 on phab2002.codfw.wmnet with reason: phabricator deploy | [production] | 
            
  | 15:38 | <dzahn@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on phab1004.eqiad.wmnet with reason: phabricator deploy | [production] | 
            
  | 15:38 | <dzahn@cumin1002> | START - Cookbook sre.hosts.downtime for 0:30:00 on phab1004.eqiad.wmnet with reason: phabricator deploy | [production] | 
            
  | 15:36 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db1207 (re)pooling @ 10%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63447 and previous config saved to /var/cache/conftool/dbconfig/20240528-153622-arnaudb.json | [production] |