| 2024-05-28
      
      ยง | 
    
  | 14:24 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db1218 (re)pooling @ 100%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63444 and previous config saved to /var/cache/conftool/dbconfig/20240528-142431-arnaudb.json | [production] | 
            
  | 14:21 | <akosiaris@cumin1002> | conftool action : set/weight=10; selector: service=kubemaster,dc=eqiad,cluster=kubernetes,name=kubemaster1002.eqiad.wmnet | [production] | 
            
  | 14:19 | <akosiaris> | add another 4 vcpus to kubemaster1002 | [production] | 
            
  | 14:17 | <andrew@cloudcumin1001> | END (FAIL) - Cookbook wmcs.openstack.restart_openstack (exit_code=99) | [admin] | 
            
  | 14:16 | <andrew@cloudcumin1001> | START - Cookbook wmcs.openstack.restart_openstack | [admin] | 
            
  | 14:11 | <akosiaris> | restart kube-apiserver on kubemaster1002 | [production] | 
            
  | 14:09 | <akosiaris@deploy1002> | helmfile [eqiad] DONE helmfile.d/services/mw-api-int: sync | [production] | 
            
  | 14:09 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db1218 (re)pooling @ 75%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63442 and previous config saved to /var/cache/conftool/dbconfig/20240528-140925-arnaudb.json | [production] | 
            
  | 14:08 | <akosiaris@cumin1002> | conftool action : set/weight=1; selector: service=kubemaster,dc=eqiad,cluster=kubernetes,name=kubemaster1002.eqiad.wmnet | [production] | 
            
  | 14:07 | <andrew@cloudcumin1001> | END (FAIL) - Cookbook wmcs.openstack.restart_openstack (exit_code=99) | [admin] | 
            
  | 14:07 | <akosiaris@cumin1002> | conftool action : set/weight=5; selector: service=kubemaster,dc=eqiad,cluster=kubernetes,name=kubemaster1002.eqiad.wmnet | [production] | 
            
  | 14:06 | <andrew@cloudcumin1001> | START - Cookbook wmcs.openstack.restart_openstack | [admin] | 
            
  | 14:04 | <akosiaris> | roll restart mw-api-int pods | [production] | 
            
  | 14:03 | <akosiaris@deploy1002> | helmfile [eqiad] START helmfile.d/services/mw-api-int: sync | [production] | 
            
  | 14:03 | <akosiaris@deploy1002> | helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply | [production] | 
            
  | 14:03 | <akosiaris@deploy1002> | helmfile [eqiad] START helmfile.d/services/mw-api-int: apply | [production] | 
            
  | 14:01 | <akosiaris> | remove wikikube-ctrl1002 from the rotation to test a theory | [production] | 
            
  | 14:01 | <akosiaris@cumin1002> | conftool action : set/pooled=no; selector: service=kubemaster,dc=eqiad,cluster=kubernetes,name=wikikube-ctrl1001.eqiad.wmnet | [production] | 
            
  | 13:59 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Depooling db1170 (T364299)', diff saved to https://phabricator.wikimedia.org/P63440 and previous config saved to /var/cache/conftool/dbconfig/20240528-135912-marostegui.json | [production] | 
            
  | 13:59 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1170.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 13:58 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 6:00:00 on db1170.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 13:58 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1158 (T364299)', diff saved to https://phabricator.wikimedia.org/P63439 and previous config saved to /var/cache/conftool/dbconfig/20240528-135848-marostegui.json | [production] | 
            
  | 13:57 | <ejegg> | fundraising civicrm upgraded from 6c1fdd4f to e2dc8f4e | [production] | 
            
  | 13:55 | <moritzm> | installing pillow security updates | [production] | 
            
  | 13:54 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db1218 (re)pooling @ 50%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63438 and previous config saved to /var/cache/conftool/dbconfig/20240528-135419-arnaudb.json | [production] | 
            
  | 13:54 | <akosiaris> | add manually ferm client rule on wikikube-ctrl1002 and disable puppet | [production] | 
            
  | 13:51 | <akosiaris> | run puppet and restart ferm on wikikube-ctrl1001 | [production] | 
            
  | 13:51 | <akosiaris> | run puppet and restart ferm | [production] | 
            
  | 13:46 | <jiji@deploy1002> | Locking from deployment [ALL REPOSITORIES]: Kubernetes masters trouble - no deployments - serviceops | [production] | 
            
  | 13:43 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P63437 and previous config saved to /var/cache/conftool/dbconfig/20240528-134341-marostegui.json | [production] | 
            
  | 13:43 | <logmsgbot> | lucaswerkmeister-wmde@deploy1002 Finished scap: Backport for [[gerrit:1036633|Create electionadmin group on testwiki (T209892)]] (duration: 34m 29s) | [production] | 
            
  | 13:42 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.reimage for host db1207.eqiad.wmnet with OS bookworm | [production] | 
            
  | 13:42 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on db1207.eqiad.wmnet with reason: reimage | [production] | 
            
  | 13:42 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 3:00:00 on db1207.eqiad.wmnet with reason: reimage | [production] | 
            
  | 13:41 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Depool db1207 T364290', diff saved to https://phabricator.wikimedia.org/P63436 and previous config saved to /var/cache/conftool/dbconfig/20240528-134150-arnaudb.json | [production] | 
            
  | 13:39 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db1218 (re)pooling @ 25%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63435 and previous config saved to /var/cache/conftool/dbconfig/20240528-133913-arnaudb.json | [production] | 
            
  | 13:35 | <jmm@cumin2002> | END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host db1169.eqiad.wmnet | [production] | 
            
  | 13:28 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P63434 and previous config saved to /var/cache/conftool/dbconfig/20240528-132833-marostegui.json | [production] | 
            
  | 13:24 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db1218 (re)pooling @ 10%: post reimage repool', diff saved to https://phabricator.wikimedia.org/P63433 and previous config saved to /var/cache/conftool/dbconfig/20240528-132407-arnaudb.json | [production] | 
            
  | 13:20 | <sukhe> | sudo cumin -b1 -s120 'A:dnsbox and not P{dns6001*}' 'run-puppet-agent --enable "merging CR 1036644"' | [production] | 
            
  | 13:17 | <James_F> | Docker: [quibble] Set HOME=/tmp so Firefox etc. can work, for T365871 | [releng] | 
            
  | 13:13 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1158 (T364299)', diff saved to https://phabricator.wikimedia.org/P63432 and previous config saved to /var/cache/conftool/dbconfig/20240528-131325-marostegui.json | [production] | 
            
  | 13:12 | <logmsgbot> | lucaswerkmeister-wmde@deploy1002 lucaswerkmeister-wmde and tstarling: Continuing with sync | [production] | 
            
  | 13:11 | <moritzm> | installing bzip2 bugfix updates | [production] | 
            
  | 13:11 | <logmsgbot> | lucaswerkmeister-wmde@deploy1002 lucaswerkmeister-wmde and tstarling: Backport for [[gerrit:1036633|Create electionadmin group on testwiki (T209892)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 13:09 | <sukhe> | sudo cumin 'A:dnsbox' 'disable-puppet "merging CR 1036644"' | [production] | 
            
  | 13:09 | <logmsgbot> | lucaswerkmeister-wmde@deploy1002 Started scap: Backport for [[gerrit:1036633|Create electionadmin group on testwiki (T209892)]] | [production] | 
            
  | 13:06 | <moritzm> | installing man-db bugfix updates | [production] | 
            
  | 13:04 | <hnowlan@cumin1002> | START - Cookbook sre.hosts.move-vlan for host <spicerack.netbox.NetboxServer object at 0x7f5406426910> | [production] | 
            
  | 13:04 | <hnowlan@cumin1002> | START - Cookbook sre.hosts.reimage for host wikikube-worker2002.codfw.wmnet with OS bullseye | [production] |