| 2025-09-03
      
      ยง | 
    
  | 12:52 | <sukhe@cumin1003> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 12:52 | <ayounsi@cumin1003> | END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host atlas3001.wikimedia.org | [production] | 
            
  | 12:52 | <ayounsi@cumin1003> | END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) atlas3001.wikimedia.org on all recursors | [production] | 
            
  | 12:52 | <ayounsi@cumin1003> | START - Cookbook sre.dns.wipe-cache atlas3001.wikimedia.org on all recursors | [production] | 
            
  | 12:52 | <ayounsi@cumin1003> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 12:52 | <ayounsi@cumin1003> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Remove records for VM atlas3001.wikimedia.org - ayounsi@cumin1003" | [production] | 
            
  | 12:52 | <ayounsi@cumin1003> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Remove records for VM atlas3001.wikimedia.org - ayounsi@cumin1003" | [production] | 
            
  | 12:47 | <ayounsi@cumin1003> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 12:47 | <ayounsi@cumin1003> | END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) | [production] | 
            
  | 12:45 | <sukhe@cumin1003> | START - Cookbook sre.hosts.decommission for hosts doh3003.wikimedia.org | [production] | 
            
  | 12:44 | <ayounsi@cumin1003> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 12:44 | <ayounsi@cumin1003> | END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) atlas3001.wikimedia.org on all recursors | [production] | 
            
  | 12:44 | <ayounsi@cumin1003> | START - Cookbook sre.dns.wipe-cache atlas3001.wikimedia.org on all recursors | [production] | 
            
  | 12:44 | <ayounsi@cumin1003> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 12:44 | <ayounsi@cumin1003> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM atlas3001.wikimedia.org - ayounsi@cumin1003" | [production] | 
            
  | 12:43 | <ayounsi@cumin1003> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM atlas3001.wikimedia.org - ayounsi@cumin1003" | [production] | 
            
  | 12:43 | <sukhe@cumin1003> | START - Cookbook sre.hosts.decommission for hosts durum3003.esams.wmnet | [production] | 
            
  | 12:40 | <ayounsi@cumin1003> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 12:40 | <ayounsi@cumin1003> | START - Cookbook sre.ganeti.makevm for new host atlas3001.wikimedia.org | [production] | 
            
  | 12:16 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2165 (T401906)', diff saved to https://phabricator.wikimedia.org/P82476 and previous config saved to /var/cache/conftool/dbconfig/20250903-121648-fceratto.json | [production] | 
            
  | 12:15 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Depooling db2165 (T401906)', diff saved to https://phabricator.wikimedia.org/P82475 and previous config saved to /var/cache/conftool/dbconfig/20250903-121538-fceratto.json | [production] | 
            
  | 12:15 | <fceratto@cumin1002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2165.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 12:15 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2164 (T401906)', diff saved to https://phabricator.wikimedia.org/P82474 and previous config saved to /var/cache/conftool/dbconfig/20250903-121514-fceratto.json | [production] | 
            
  | 12:04 | <Amir1> | dropping objectcache table in group0 (T397367) | [production] | 
            
  | 12:00 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2164', diff saved to https://phabricator.wikimedia.org/P82473 and previous config saved to /var/cache/conftool/dbconfig/20250903-120007-fceratto.json | [production] | 
            
  | 11:46 | <btullis@cumin1003> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host an-worker1234.eqiad.wmnet with OS bullseye | [production] | 
            
  | 11:45 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2164', diff saved to https://phabricator.wikimedia.org/P82472 and previous config saved to /var/cache/conftool/dbconfig/20250903-114500-fceratto.json | [production] | 
            
  | 11:29 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2164 (T401906)', diff saved to https://phabricator.wikimedia.org/P82471 and previous config saved to /var/cache/conftool/dbconfig/20250903-112952-fceratto.json | [production] | 
            
  | 11:29 | <btullis@cumin1003> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1234.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 11:28 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Depooling db2164 (T401906)', diff saved to https://phabricator.wikimedia.org/P82470 and previous config saved to /var/cache/conftool/dbconfig/20250903-112842-fceratto.json | [production] | 
            
  | 11:28 | <fceratto@cumin1002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2164.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 11:28 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2163 (T401906)', diff saved to https://phabricator.wikimedia.org/P82469 and previous config saved to /var/cache/conftool/dbconfig/20250903-112820-fceratto.json | [production] | 
            
  | 11:25 | <mvolz@deploy1003> | helmfile [eqiad] DONE helmfile.d/services/citoid: apply | [production] | 
            
  | 11:25 | <btullis@cumin1003> | START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1234.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 11:25 | <mvolz@deploy1003> | helmfile [eqiad] START helmfile.d/services/citoid: apply | [production] | 
            
  | 11:24 | <mvolz@deploy1003> | helmfile [codfw] DONE helmfile.d/services/citoid: apply | [production] | 
            
  | 11:24 | <mvolz@deploy1003> | helmfile [codfw] START helmfile.d/services/citoid: apply | [production] | 
            
  | 11:21 | <mvolz@deploy1003> | helmfile [staging] DONE helmfile.d/services/citoid: apply | [production] | 
            
  | 11:21 | <mvolz@deploy1003> | helmfile [staging] START helmfile.d/services/citoid: apply | [production] | 
            
  | 11:13 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2163', diff saved to https://phabricator.wikimedia.org/P82468 and previous config saved to /var/cache/conftool/dbconfig/20250903-111313-fceratto.json | [production] | 
            
  | 11:12 | <mvolz@deploy1003> | helmfile [eqiad] DONE helmfile.d/services/citoid: apply | [production] | 
            
  | 11:12 | <mvolz@deploy1003> | helmfile [eqiad] START helmfile.d/services/citoid: apply | [production] | 
            
  | 11:10 | <mvolz@deploy1003> | helmfile [codfw] DONE helmfile.d/services/citoid: apply | [production] | 
            
  | 11:10 | <mvolz@deploy1003> | helmfile [codfw] START helmfile.d/services/citoid: apply | [production] | 
            
  | 11:08 | <mvolz@deploy1003> | helmfile [staging] DONE helmfile.d/services/citoid: apply | [production] | 
            
  | 11:08 | <mvolz@deploy1003> | helmfile [staging] START helmfile.d/services/citoid: apply | [production] | 
            
  | 11:05 | <brouberol@deploy1003> | helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 11:05 | <brouberol@deploy1003> | helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 11:01 | <btullis@cumin1003> | START - Cookbook sre.hosts.reimage for host an-worker1234.eqiad.wmnet with OS bullseye | [production] | 
            
  | 11:01 | <btullis@cumin1003> | END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-worker1234.eqiad.wmnet with OS bullseye | [production] |