| 
      
        2024-05-14
      
      ยง
     | 
  
    
  | 08:29 | 
  <dcausse@deploy1002> | 
  dcausse and cscott: Backport for [[gerrit:1031067|Fix the loss of ParserOutput pointer in ContentDOMTransformStages (T364597)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | 
  [production] | 
            
  | 08:26 | 
  <dcausse@deploy1002> | 
  Started scap: Backport for [[gerrit:1031067|Fix the loss of ParserOutput pointer in ContentDOMTransformStages (T364597)]] | 
  [production] | 
            
  | 08:22 | 
  <jmm@cumin2002> | 
  END (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Bdgreenlee out of all services on: 2208 hosts | 
  [production] | 
            
  | 08:21 | 
  <jmm@cumin2002> | 
  START - Cookbook sre.idm.logout Logging Bdgreenlee out of all services on: 2208 hosts | 
  [production] | 
            
  | 08:15 | 
  <jayme@cumin1002> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kubestagemaster2005.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 08:09 | 
  <hashar> | 
  Reloaded Zuul for https://gerrit.wikimedia.org/r/1025313 | 
  [releng] | 
            
  | 07:56 | 
  <kartik@deploy1002> | 
  Finished scap: Backport for [[gerrit:1030325|CX: Add mw.cx.UserPermissionChecker (T349959)]] (duration: 17m 52s) | 
  [production] | 
            
  | 07:55 | 
  <klausman@deploy1002> | 
  helmfile [ml-staging-codfw] DONE helmfile.d/admin 'apply'. | 
  [production] | 
            
  | 07:54 | 
  <klausman@deploy1002> | 
  helmfile [ml-staging-codfw] START helmfile.d/admin 'apply'. | 
  [production] | 
            
  | 07:54 | 
  <moritzm> | 
  installing PHP 7.3 security updates | 
  [production] | 
            
  | 07:53 | 
  <ayounsi@cumin1002> | 
  END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 215887 | 
  [production] | 
            
  | 07:53 | 
  <ayounsi@cumin1002> | 
  START - Cookbook sre.network.peering with action 'configure' for AS: 215887 | 
  [production] | 
            
  | 07:48 | 
  <dcaro> | 
  draining tools-k8s-worker-nfs-9 as it's stuck on IO | 
  [tools] | 
            
  | 07:48 | 
  <dcaro@cloudcumin1001> | 
  END (FAIL) - Cookbook wmcs.toolforge.k8s.worker.drain (exit_code=99) for node tools-k8s-worker-nfs-9 | 
  [tools] | 
            
  | 07:48 | 
  <dcaro@cloudcumin1001> | 
  START - Cookbook wmcs.toolforge.k8s.worker.drain for node tools-k8s-worker-nfs-9 | 
  [tools] | 
            
  | 07:46 | 
  <moritzm> | 
  installing libgd2 security updates | 
  [production] | 
            
  | 07:44 | 
  <kartik@deploy1002> | 
  kartik: Continuing with sync | 
  [production] | 
            
  | 07:42 | 
  <kartik@deploy1002> | 
  kartik: Backport for [[gerrit:1030325|CX: Add mw.cx.UserPermissionChecker (T349959)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | 
  [production] | 
            
  | 07:39 | 
  <kartik@deploy1002> | 
  Started scap: Backport for [[gerrit:1030325|CX: Add mw.cx.UserPermissionChecker (T349959)]] | 
  [production] | 
            
  | 07:27 | 
  <kartik@deploy1002> | 
  Finished scap: Backport for [[gerrit:1030978|Set $wgSignatureValidation to 'disallow' on Polish Wikipedia (T364769)]] (duration: 18m 28s) | 
  [production] | 
            
  | 07:17 | 
  <marostegui@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2185.codfw.wmnet with OS bookworm | 
  [production] | 
            
  | 07:15 | 
  <kartik@deploy1002> | 
  kartik and msz2001: Continuing with sync | 
  [production] | 
            
  | 07:12 | 
  <kartik@deploy1002> | 
  kartik and msz2001: Backport for [[gerrit:1030978|Set $wgSignatureValidation to 'disallow' on Polish Wikipedia (T364769)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | 
  [production] | 
            
  | 07:09 | 
  <kartik@deploy1002> | 
  Started scap: Backport for [[gerrit:1030978|Set $wgSignatureValidation to 'disallow' on Polish Wikipedia (T364769)]] | 
  [production] | 
            
  | 07:04 | 
  <moritzm> | 
  installing glib2.0 security updates | 
  [production] | 
            
  | 06:56 | 
  <marostegui@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2185.codfw.wmnet with reason: host reimage | 
  [production] | 
            
  | 06:54 | 
  <marostegui@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on db2185.codfw.wmnet with reason: host reimage | 
  [production] | 
            
  | 06:35 | 
  <marostegui@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host db2185.codfw.wmnet with OS bookworm | 
  [production] | 
            
  | 06:33 | 
  <marostegui@cumin1002> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=93) for host db2185.codfw.wmnet with OS bookworm | 
  [production] | 
            
  | 06:33 | 
  <marostegui@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host db2185.codfw.wmnet with OS bookworm | 
  [production] | 
            
  | 05:31 | 
  <kart_> | 
  Updated cxserver to 2024-04-23-221507-production (T363263, T333969, T360303, T360310) | 
  [production] | 
            
  | 05:25 | 
  <kartik@deploy1002> | 
  helmfile [eqiad] DONE helmfile.d/services/cxserver: apply | 
  [production] | 
            
  | 05:24 | 
  <kartik@deploy1002> | 
  helmfile [eqiad] START helmfile.d/services/cxserver: apply | 
  [production] | 
            
  | 05:22 | 
  <kartik@deploy1002> | 
  helmfile [codfw] DONE helmfile.d/services/cxserver: apply | 
  [production] | 
            
  | 05:22 | 
  <kartik@deploy1002> | 
  helmfile [codfw] START helmfile.d/services/cxserver: apply | 
  [production] | 
            
  | 05:19 | 
  <kartik@deploy1002> | 
  helmfile [staging] DONE helmfile.d/services/cxserver: apply | 
  [production] | 
            
  | 05:19 | 
  <kartik@deploy1002> | 
  helmfile [staging] START helmfile.d/services/cxserver: apply | 
  [production] | 
            
  | 05:15 | 
  <kart_> | 
  Updated MinT to 2024-03-28-061726-production (T333969) | 
  [production] | 
            
  | 05:08 | 
  <kartik@deploy1002> | 
  helmfile [eqiad] DONE helmfile.d/services/machinetranslation: apply | 
  [production] | 
            
  | 04:59 | 
  <kartik@deploy1002> | 
  helmfile [eqiad] START helmfile.d/services/machinetranslation: apply | 
  [production] | 
            
  | 04:33 | 
  <kartik@deploy1002> | 
  helmfile [codfw] DONE helmfile.d/services/machinetranslation: apply | 
  [production] | 
            
  | 04:25 | 
  <kartik@deploy1002> | 
  helmfile [codfw] START helmfile.d/services/machinetranslation: apply | 
  [production] | 
            
  | 04:18 | 
  <kartik@deploy1002> | 
  helmfile [staging] DONE helmfile.d/services/machinetranslation: apply | 
  [production] | 
            
  | 04:14 | 
  <kartik@deploy1002> | 
  helmfile [staging] START helmfile.d/services/machinetranslation: apply | 
  [production] | 
            
  | 04:00 | 
  <mwpresync@deploy1002> | 
  Finished scap: testwikis wikis to 1.43.0-wmf.5  refs T361399 (duration: 57m 45s) | 
  [production] | 
            
  | 03:02 | 
  <mwpresync@deploy1002> | 
  Started scap: testwikis wikis to 1.43.0-wmf.5  refs T361399 | 
  [production] | 
            
  | 02:34 | 
  <ladsgroup@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1171.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 02:34 | 
  <ladsgroup@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1171.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 02:33 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1170 (T352010)', diff saved to Unable to send diff to phaste and previous config saved to /var/cache/conftool/dbconfig/20240514-023316-ladsgroup.json | 
  [production] | 
            
  | 02:18 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1170', diff saved to https://phabricator.wikimedia.org/P62375 and previous config saved to /var/cache/conftool/dbconfig/20240514-021809-ladsgroup.json | 
  [production] |