| 
      
        2024-07-02
      
      ยง
     | 
  
    
  | 19:21 | 
  <eileen> | 
  civicrm upgraded from 64f23ed0 to 67bcfd72 | 
  [production] | 
            
  | 19:17 | 
  <wmbot~dcaro@urcuchillay> | 
  END (FAIL) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=99) (T309789) | 
  [admin] | 
            
  | 19:09 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2176', diff saved to https://phabricator.wikimedia.org/P65672 and previous config saved to /var/cache/conftool/dbconfig/20240702-190950-marostegui.json | 
  [production] | 
            
  | 18:54 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db2176 (T364069)', diff saved to https://phabricator.wikimedia.org/P65671 and previous config saved to /var/cache/conftool/dbconfig/20240702-185443-marostegui.json | 
  [production] | 
            
  | 17:40 | 
  <bking@deploy1002> | 
  helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow: apply | 
  [production] | 
            
  | 17:40 | 
  <bking@deploy1002> | 
  helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply | 
  [production] | 
            
  | 17:39 | 
  <bking@deploy1002> | 
  helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow: apply | 
  [production] | 
            
  | 17:39 | 
  <bking@deploy1002> | 
  helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply | 
  [production] | 
            
  | 17:36 | 
  <bking@deploy1002> | 
  helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow: apply | 
  [production] | 
            
  | 17:36 | 
  <bking@deploy1002> | 
  helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply | 
  [production] | 
            
  | 17:34 | 
  <bking@deploy1002> | 
  helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow: apply | 
  [production] | 
            
  | 17:34 | 
  <bking@deploy1002> | 
  helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply | 
  [production] | 
            
  | 17:20 | 
  <jforrester@deploy1002> | 
  Finished scap: Backport for [[gerrit:1051416|Update OOUI to v0.50.3]], [[gerrit:1051417|Update OOUI to v0.50.3 (T369010)]] (duration: 10m 06s) | 
  [production] | 
            
  | 17:16 | 
  <andrewbogott> | 
  draining (I hope) tools-elastic-3 and tools-elastic-1 for T311905 | 
  [tools] | 
            
  | 17:15 | 
  <jforrester@deploy1002> | 
  jforrester: Continuing with sync | 
  [production] | 
            
  | 17:14 | 
  <jforrester@deploy1002> | 
  jforrester: Backport for [[gerrit:1051416|Update OOUI to v0.50.3]], [[gerrit:1051417|Update OOUI to v0.50.3 (T369010)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | 
  [production] | 
            
  | 17:10 | 
  <jforrester@deploy1002> | 
  Started scap sync-world: Backport for [[gerrit:1051416|Update OOUI to v0.50.3]], [[gerrit:1051417|Update OOUI to v0.50.3 (T369010)]] | 
  [production] | 
            
  | 17:07 | 
  <dani@deploy1002> | 
  helmfile [codfw] DONE helmfile.d/services/miscweb: apply | 
  [production] | 
            
  | 17:07 | 
  <dcaro@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component api-gateway | 
  [tools] | 
            
  | 17:07 | 
  <dani@deploy1002> | 
  helmfile [codfw] START helmfile.d/services/miscweb: apply | 
  [production] | 
            
  | 17:07 | 
  <dani@deploy1002> | 
  helmfile [eqiad] DONE helmfile.d/services/miscweb: apply | 
  [production] | 
            
  | 17:07 | 
  <dcaro@cloudcumin1001> | 
  START - Cookbook wmcs.toolforge.k8s.component.deploy for component api-gateway | 
  [tools] | 
            
  | 17:06 | 
  <dani@deploy1002> | 
  helmfile [eqiad] START helmfile.d/services/miscweb: apply | 
  [production] | 
            
  | 17:06 | 
  <dani@deploy1002> | 
  helmfile [staging] DONE helmfile.d/services/miscweb: apply | 
  [production] | 
            
  | 17:06 | 
  <dani@deploy1002> | 
  helmfile [staging] START helmfile.d/services/miscweb: apply | 
  [production] | 
            
  | 17:06 | 
  <mutante> | 
  lists1004 - sudo systemctl start wmf_auto_restart_exim4 (T369017) | 
  [production] | 
            
  | 17:00 | 
  <dcaro@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component api-gateway | 
  [toolsbeta] | 
            
  | 17:00 | 
  <dcaro@cloudcumin1001> | 
  START - Cookbook wmcs.toolforge.k8s.component.deploy for component api-gateway | 
  [toolsbeta] | 
            
  | 16:59 | 
  <fnegri@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.openstack.quota_increase (exit_code=0) (T368669) | 
  [puppet-diffs] | 
            
  | 16:59 | 
  <fnegri@cloudcumin1001> | 
  START - Cookbook wmcs.openstack.quota_increase (T368669) | 
  [puppet-diffs] | 
            
  | 16:55 | 
  <dcaro@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component builds-api | 
  [tools] | 
            
  | 16:55 | 
  <dcaro@cloudcumin1001> | 
  START - Cookbook wmcs.toolforge.k8s.component.deploy for component builds-api | 
  [tools] | 
            
  | 16:54 | 
  <ejegg> | 
  fundraising civicrm upgraded from 41c1bd78 to 64f23ed0 | 
  [production] | 
            
  | 16:53 | 
  <wmbot~bsadowski1@tools-bastion-13> | 
  Restarted StewardBot/StewardBot because of a connection loss | 
  [tools.stewardbots] | 
            
  | 16:52 | 
  <wmbot~bsadowski1@tools-bastion-13> | 
  Restarted StewardBot/SULWatcher because of a connection loss | 
  [tools.stewardbots] | 
            
  | 16:48 | 
  <dcaro@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.toolforge.k8s.component.deploy (exit_code=0) for component builds-api | 
  [toolsbeta] | 
            
  | 16:48 | 
  <dcaro@cloudcumin1001> | 
  START - Cookbook wmcs.toolforge.k8s.component.deploy for component builds-api | 
  [toolsbeta] | 
            
  | 16:16 | 
  <ayounsi@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host testvm2007.codfw.wmnet with OS bookworm | 
  [production] | 
            
  | 16:13 | 
  <brett@cumin2002> | 
  START - Cookbook sre.cdn.roll-reboot rolling reboot on A:cp-upload_drmrs | 
  [production] | 
            
  | 16:02 | 
  <ayounsi@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on testvm2007.codfw.wmnet with reason: host reimage | 
  [production] | 
            
  | 16:01 | 
  <brett@cumin2002> | 
  START - Cookbook sre.cdn.roll-reboot rolling reboot on A:cp-text_drmrs | 
  [production] | 
            
  | 15:58 | 
  <btullis@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-master1004.eqiad.wmnet | 
  [production] | 
            
  | 15:57 | 
  <ayounsi@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on testvm2007.codfw.wmnet with reason: host reimage | 
  [production] | 
            
  | 15:51 | 
  <btullis@cumin1002> | 
  START - Cookbook sre.hosts.reboot-single for host an-master1004.eqiad.wmnet | 
  [production] | 
            
  | 15:50 | 
  <btullis> | 
  failing over hadoop yarn resourcemanager from an-master1004 to an-master1003 | 
  [analytics] | 
            
  | 15:50 | 
  <brouberol@deploy1002> | 
  helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset: apply | 
  [production] | 
            
  | 15:50 | 
  <brouberol@deploy1002> | 
  helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset: apply | 
  [production] | 
            
  | 15:49 | 
  <btullis> | 
  failing over hadoop namenode from an-master1004 to an-master1003 | 
  [analytics] | 
            
  | 15:49 | 
  <fabfur@cumin1002> | 
  END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-text_esams | 
  [production] | 
            
  | 15:46 | 
  <fabfur@cumin1002> | 
  END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-upload_esams | 
  [production] |