| 2024-05-28
      
      ยง | 
    
  | 10:34 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db1243 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P63418 and previous config saved to /var/cache/conftool/dbconfig/20240528-103428-root.json | [production] | 
            
  | 10:33 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2139.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 10:32 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 6:00:00 on db2139.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 10:23 | <jmm@cumin2002> | END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host db2216.codfw.wmnet | [production] | 
            
  | 10:22 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db1219 (re)pooling @ 25%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63417 and previous config saved to /var/cache/conftool/dbconfig/20240528-102259-arnaudb.json | [production] | 
            
  | 10:21 | <jiji@cumin2002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc2049.codfw.wmnet with OS bookworm | [production] | 
            
  | 10:18 | <sfaci@deploy1002> | helmfile [staging] DONE helmfile.d/services/device-analytics: apply | [production] | 
            
  | 10:08 | <sfaci@deploy1002> | helmfile [staging] START helmfile.d/services/device-analytics: apply | [production] | 
            
  | 10:07 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db1219 (re)pooling @ 10%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63416 and previous config saved to /var/cache/conftool/dbconfig/20240528-100752-arnaudb.json | [production] | 
            
  | 10:05 | <jmm@cumin2002> | START - Cookbook sre.puppet.migrate-host for host db2216.codfw.wmnet | [production] | 
            
  | 10:02 | <moritzm> | installing jinja2 security updates | [production] | 
            
  | 09:57 | <jiji@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc2049.codfw.wmnet with reason: host reimage | [production] | 
            
  | 09:54 | <jiji@cumin2002> | START - Cookbook sre.hosts.downtime for 2:00:00 on mc2049.codfw.wmnet with reason: host reimage | [production] | 
            
  | 09:50 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db1228 (re)pooling @ 100%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63415 and previous config saved to /var/cache/conftool/dbconfig/20240528-095058-arnaudb.json | [production] | 
            
  | 09:49 | <jmm@cumin2002> | END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host db2212.codfw.wmnet | [production] | 
            
  | 09:46 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1219.eqiad.wmnet with OS bookworm | [production] | 
            
  | 09:45 | <stevemunene@deploy1002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 09:45 | <stevemunene@deploy1002> | helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 09:43 | <stevemunene@deploy1002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 09:43 | <stevemunene@deploy1002> | helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 09:39 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on db1243.eqiad.wmnet with reason: unknown lag | [production] | 
            
  | 09:39 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 3:00:00 on db1243.eqiad.wmnet with reason: unknown lag | [production] | 
            
  | 09:38 | <jmm@cumin2002> | START - Cookbook sre.puppet.migrate-host for host db2212.codfw.wmnet | [production] | 
            
  | 09:35 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db1228 (re)pooling @ 75%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63414 and previous config saved to /var/cache/conftool/dbconfig/20240528-093552-arnaudb.json | [production] | 
            
  | 09:35 | <zabe@deploy1002> | Finished scap: Backport for [[gerrit:1036573|Stop writing to af_user(_text)/afh_user(_text) everywhere (T337920)]], [[gerrit:1036586|Update interwiki cache]] (duration: 17m 49s) | [production] | 
            
  | 09:35 | <jiji@cumin2002> | START - Cookbook sre.hosts.reimage for host mc2049.codfw.wmnet with OS bookworm | [production] | 
            
  | 09:34 | <jiji@cumin2002> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts mc2049.codfw.wmnet | [production] | 
            
  | 09:33 | <jiji@cumin2002> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts mc2049.codfw.wmnet | [production] | 
            
  | 09:33 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2209 (T364299)', diff saved to https://phabricator.wikimedia.org/P63413 and previous config saved to /var/cache/conftool/dbconfig/20240528-093344-marostegui.json | [production] | 
            
  | 09:24 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1219.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 09:21 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on db1219.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 09:21 | <zabe@deploy1002> | zabe: Continuing with sync | [production] | 
            
  | 09:20 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db1228 (re)pooling @ 50%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63412 and previous config saved to /var/cache/conftool/dbconfig/20240528-092046-arnaudb.json | [production] | 
            
  | 09:20 | <zabe@deploy1002> | zabe: Backport for [[gerrit:1036573|Stop writing to af_user(_text)/afh_user(_text) everywhere (T337920)]], [[gerrit:1036586|Update interwiki cache]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 09:18 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2209', diff saved to https://phabricator.wikimedia.org/P63411 and previous config saved to /var/cache/conftool/dbconfig/20240528-091836-marostegui.json | [production] | 
            
  | 09:17 | <zabe@deploy1002> | Started scap: Backport for [[gerrit:1036573|Stop writing to af_user(_text)/afh_user(_text) everywhere (T337920)]], [[gerrit:1036586|Update interwiki cache]] | [production] | 
            
  | 09:14 | <jelto@cumin1002> | END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for gerrit1003.wikimedia.org | [production] | 
            
  | 09:14 | <jelto@cumin1002> | START - Cookbook sre.hosts.remove-downtime for gerrit1003.wikimedia.org | [production] | 
            
  | 09:14 | <jelto@cumin1002> | END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for gerrit2002.wikimedia.org | [production] | 
            
  | 09:14 | <jelto@cumin1002> | START - Cookbook sre.hosts.remove-downtime for gerrit2002.wikimedia.org | [production] | 
            
  | 09:13 | <zabe> | zabe@mwmaint1002:~$ mwscript extensions/CirrusSearch/maintenance/UpdateSearchIndexConfig.php --wiki=dtpwiki --cluster=all 2>&1 | tee /tmp/dtpwiki.UpdateSearchIndexConfig.log # T365220 | [production] | 
            
  | 09:09 | <zabe@deploy1002> | Finished scap: T365220 (duration: 19m 22s) | [production] | 
            
  | 09:08 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.reimage for host db1219.eqiad.wmnet with OS bookworm | [production] | 
            
  | 09:07 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Depool db1219 T364290', diff saved to https://phabricator.wikimedia.org/P63410 and previous config saved to /var/cache/conftool/dbconfig/20240528-090724-arnaudb.json | [production] | 
            
  | 09:05 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db1228 (re)pooling @ 25%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63409 and previous config saved to /var/cache/conftool/dbconfig/20240528-090538-arnaudb.json | [production] | 
            
  | 09:03 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2209', diff saved to https://phabricator.wikimedia.org/P63408 and previous config saved to /var/cache/conftool/dbconfig/20240528-090328-marostegui.json | [production] | 
            
  | 09:03 | <jiji@cumin2002> | END (ERROR) - Cookbook sre.hosts.reboot-single (exit_code=97) for host mc2049.codfw.wmnet | [production] | 
            
  | 08:58 | <jiji@cumin2002> | START - Cookbook sre.hosts.reboot-single for host mc2049.codfw.wmnet | [production] | 
            
  | 08:55 | <zabe@deploy1002> | zabe: Continuing with sync | [production] | 
            
  | 08:54 | <zabe@deploy1002> | zabe: T365220 synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] |