| 2023-11-29
      
      ยง | 
    
  | 14:55 | <lucaswerkmeister-wmde@deploy2002> | Finished scap: Backport for [[gerrit:978096|Configure wiki-highlights experiment stream (T348613)]] (duration: 42m 58s) | [production] | 
            
  | 14:48 | <pt1979@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logging-hd2001.codfw.wmnet with reason: host reimage | [production] | 
            
  | 14:45 | <pt1979@cumin2002> | START - Cookbook sre.hosts.downtime for 2:00:00 on logging-hd2001.codfw.wmnet with reason: host reimage | [production] | 
            
  | 14:43 | <btullis@cumin1001> | START - Cookbook sre.hosts.reimage for host schema1004.eqiad.wmnet with OS bookworm | [production] | 
            
  | 14:39 | <bblack> | cp4052 - depool and disable puppet agent, more pipe debug | [production] | 
            
  | 14:38 | <lucaswerkmeister-wmde@deploy2002> | sbisson and lucaswerkmeister-wmde: Continuing with sync | [production] | 
            
  | 14:38 | <btullis@cumin1001> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host schema1003.eqiad.wmnet with OS bookworm | [production] | 
            
  | 14:36 | <lucaswerkmeister-wmde@deploy2002> | sbisson and lucaswerkmeister-wmde: Backport for [[gerrit:978096|Configure wiki-highlights experiment stream (T348613)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 14:34 | <pt1979@cumin2002> | START - Cookbook sre.hosts.reimage for host logging-hd2002.codfw.wmnet with OS bullseye | [production] | 
            
  | 14:24 | <btullis@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on schema1003.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:21 | <btullis@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on schema1003.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 14:17 | <pt1979@cumin2002> | START - Cookbook sre.hosts.reimage for host logging-hd2001.codfw.wmnet with OS bullseye | [production] | 
            
  | 14:15 | <elukey> | reload thanos-rule on titan[12]001 to pick up new pyrra rec rules | [production] | 
            
  | 14:12 | <lucaswerkmeister-wmde@deploy2002> | Started scap: Backport for [[gerrit:978096|Configure wiki-highlights experiment stream (T348613)]] | [production] | 
            
  | 14:10 | <btullis@cumin1001> | START - Cookbook sre.hosts.reimage for host schema1003.eqiad.wmnet with OS bookworm | [production] | 
            
  | 13:42 | <moritzm> | installing tiff security updates | [production] | 
            
  | 13:33 | <jbond@cumin1001> | END (PASS) - Cookbook sre.swift.audit-labels (exit_code=0) for host ms-be[2044-2073].codfw.wmnet,ms-be[1044-1075].eqiad.wmnet | [production] | 
            
  | 13:33 | <jbond@cumin1001> | START - Cookbook sre.swift.audit-labels for host ms-be[2044-2073].codfw.wmnet,ms-be[1044-1075].eqiad.wmnet | [production] | 
            
  | 13:30 | <jbond@cumin1001> | END (FAIL) - Cookbook sre.swift.audit-labels (exit_code=99) for host ms-be[2044-2073].codfw.wmnet,ms-be[1044-1075].eqiad.wmnet | [production] | 
            
  | 13:30 | <jbond@cumin1001> | START - Cookbook sre.swift.audit-labels for host ms-be[2044-2073].codfw.wmnet,ms-be[1044-1075].eqiad.wmnet | [production] | 
            
  | 13:09 | <cmooney@cumin1001> | END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) | [production] | 
            
  | 13:09 | <cmooney@cumin1001> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 13:09 | <cmooney@cumin1001> | END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) | [production] | 
            
  | 13:05 | <jmm@cumin2002> | END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM netflow4002.ulsfo.wmnet | [production] | 
            
  | 13:05 | <cmooney@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:20:00 on netbox1002.eqiad.wmnet with reason: Restoring DB from backup on netboxdb1002 | [production] | 
            
  | 13:05 | <cmooney@cumin1001> | START - Cookbook sre.hosts.downtime for 0:20:00 on netbox1002.eqiad.wmnet with reason: Restoring DB from backup on netboxdb1002 | [production] | 
            
  | 13:01 | <cmooney@cumin1001> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 13:01 | <cmooney@cumin1001> | END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) | [production] | 
            
  | 13:00 | <jmm@cumin2002> | START - Cookbook sre.ganeti.reboot-vm for VM netflow4002.ulsfo.wmnet | [production] | 
            
  | 12:58 | <topranks> | restoring DB snapshot from 11:37 UTC to netboxdb1002 | [production] | 
            
  | 12:52 | <cmooney@cumin1001> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 12:52 | <cmooney@cumin1001> | END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) | [production] | 
            
  | 12:46 | <cmooney@cumin1001> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 12:44 | <hashar@deploy2002> | Finished deploy [gerrit/gerrit@6b23c27]: Verify scap deployment after changing the scap user from gerrit2 to gerrit-deploy - T317412 (duration: 00m 07s) | [production] | 
            
  | 12:43 | <hashar@deploy2002> | Started deploy [gerrit/gerrit@6b23c27]: Verify scap deployment after changing the scap user from gerrit2 to gerrit-deploy - T317412 | [production] | 
            
  | 12:36 | <hashar@deploy2002> | Finished deploy [gerrit/gerrit@6b23c27]: Verify scap deployment after changing the scap user from gerrit2 to gerrit-deploy - T317412 (duration: 00m 06s) | [production] | 
            
  | 12:35 | <hashar@deploy2002> | Started deploy [gerrit/gerrit@6b23c27]: Verify scap deployment after changing the scap user from gerrit2 to gerrit-deploy - T317412 | [production] | 
            
  | 12:35 | <hashar@deploy2002> | Finished deploy [gervert/deploy@ca6bba0]: Verify scap deployment after changing the scap user from gerrit2 to gerrit-deploy - T317412 (duration: 00m 12s) | [production] | 
            
  | 12:35 | <hashar@deploy2002> | Started deploy [gervert/deploy@ca6bba0]: Verify scap deployment after changing the scap user from gerrit2 to gerrit-deploy - T317412 | [production] | 
            
  | 12:25 | <vgutierrez> | rolling restart of pybal on lvs4008 and lvs4010, effectively enabling IPIP encapsulation for ncredir@ulsfo - T351069 | [production] | 
            
  | 12:22 | <hashar@deploy2002> | Finished deploy [gerrit/gerrit@a087269]: Verify scap deployment after changing the scap user from gerrit2 to gerrit-deploy - T317412 (duration: 00m 15s) | [production] | 
            
  | 12:22 | <hashar@deploy2002> | Started deploy [gerrit/gerrit@a087269]: Verify scap deployment after changing the scap user from gerrit2 to gerrit-deploy - T317412 | [production] | 
            
  | 12:06 | <fabfur@cumin1001> | END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts cp[1075-1090].eqiad.wmnet | [production] | 
            
  | 12:06 | <fabfur@cumin1001> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 12:05 | <fabfur@cumin1001> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cp[1075-1090].eqiad.wmnet decommissioned, removing all IPs except the asset tag one - fabfur@cumin1001" | [production] | 
            
  | 12:05 | <klausman@deploy2002> | helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . | [production] | 
            
  | 12:04 | <fabfur@cumin1001> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cp[1075-1090].eqiad.wmnet decommissioned, removing all IPs except the asset tag one - fabfur@cumin1001" | [production] | 
            
  | 12:02 | <hashar> | Disabled Puppet agent on gerrit1003 and gerrit2002 to roll https://gerrit.wikimedia.org/r/844998 which requires some manual steps | T317412 | [production] | 
            
  | 11:26 | <jiji@deploy2002> | helmfile [codfw] DONE helmfile.d/services/tegola-vector-tiles: apply | [production] | 
            
  | 11:26 | <jiji@deploy2002> | helmfile [codfw] START helmfile.d/services/tegola-vector-tiles: apply | [production] |