| 
      
        2025-01-27
      
      ยง
     | 
  
    
  | 10:56 | 
  <cmooney@cumin1002> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on asw1-b[3-4]-magru.mgmt with reason: upgrading JunOS on magru core routers | 
  [production] | 
            
  | 10:47 | 
  <topranks> | 
  rebooting cr2-magru to complete upgrade T384774 | 
  [production] | 
            
  | 10:44 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1185', diff saved to https://phabricator.wikimedia.org/P72459 and previous config saved to /var/cache/conftool/dbconfig/20250127-104415-marostegui.json | 
  [production] | 
            
  | 10:43 | 
  <vgutierrez> | 
  testing pybal 1.15.15 in lvs4010 | 
  [production] | 
            
  | 10:43 | 
  <jmm@cumin2002> | 
  END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2025.codfw.wmnet | 
  [production] | 
            
  | 10:43 | 
  <root@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host db1171.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 10:42 | 
  <jmm@cumin2002> | 
  START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2025.codfw.wmnet | 
  [production] | 
            
  | 10:42 | 
  <jmm@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host idp-test1004.wikimedia.org | 
  [production] | 
            
  | 10:38 | 
  <jmm@cumin2002> | 
  START - Cookbook sre.hosts.reboot-single for host idp-test1004.wikimedia.org | 
  [production] | 
            
  | 10:37 | 
  <jynus@cumin1002> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db1171.eqiad.wmnet with reason: reimage | 
  [production] | 
            
  | 10:36 | 
  <jmm@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host rpki1001.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 10:36 | 
  <jmm@cumin2002> | 
  END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host rpki1001.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 10:34 | 
  <jmm@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host rpki1001.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 10:34 | 
  <fabfur> | 
  installing haproxykafka on esams (https://gerrit.wikimedia.org/r/c/operations/puppet/+/1114329) (T378578) | 
  [production] | 
            
  | 10:33 | 
  <jmm@cumin2002> | 
  END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host rpki1001.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 10:29 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1185', diff saved to https://phabricator.wikimedia.org/P72458 and previous config saved to /var/cache/conftool/dbconfig/20250127-102908-marostegui.json | 
  [production] | 
            
  | 10:26 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Depool es1024 T384820', diff saved to https://phabricator.wikimedia.org/P72457 and previous config saved to /var/cache/conftool/dbconfig/20250127-102657-marostegui.json | 
  [production] | 
            
  | 10:24 | 
  <jmm@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host idp-test2004.wikimedia.org | 
  [production] | 
            
  | 10:20 | 
  <jmm@cumin2002> | 
  START - Cookbook sre.hosts.reboot-single for host idp-test2004.wikimedia.org | 
  [production] | 
            
  | 10:20 | 
  <topranks> | 
  installing updated JunOS image on cr2-magru T384774 | 
  [production] | 
            
  | 10:16 | 
  <marostegui@cumin1002> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on dbstore[1007,1009].eqiad.wmnet with reason: Index rebuild + upgrade | 
  [production] | 
            
  | 10:14 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1185 (T384592)', diff saved to https://phabricator.wikimedia.org/P72456 and previous config saved to /var/cache/conftool/dbconfig/20250127-101401-marostegui.json | 
  [production] | 
            
  | 10:13 | 
  <jmm@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host idp-test2005.wikimedia.org | 
  [production] | 
            
  | 10:11 | 
  <marostegui@cumin1002> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on dbstore[1007,1009].eqiad.wmnet with reason: Index rebuild + upgrade | 
  [production] | 
            
  | 10:09 | 
  <jmm@cumin2002> | 
  START - Cookbook sre.hosts.reboot-single for host idp-test2005.wikimedia.org | 
  [production] | 
            
  | 10:07 | 
  <jmm@cumin2002> | 
  END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2020.codfw.wmnet | 
  [production] | 
            
  | 10:04 | 
  <cmooney@cumin1002> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on cr[1-2]-magru,cr[1-2]-magru IPv6 with reason: upgrading JunOS on magru core routers | 
  [production] | 
            
  | 09:54 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Depooling db1185 (T384592)', diff saved to https://phabricator.wikimedia.org/P72455 and previous config saved to /var/cache/conftool/dbconfig/20250127-095416-marostegui.json | 
  [production] | 
            
  | 09:54 | 
  <marostegui@cumin1002> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on db1185.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 09:53 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1161 (T384592)', diff saved to https://phabricator.wikimedia.org/P72454 and previous config saved to /var/cache/conftool/dbconfig/20250127-095354-marostegui.json | 
  [production] | 
            
  | 09:47 | 
  <jmm@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host rpki1001.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 09:47 | 
  <moritzm> | 
  reimaging rpki1001 to bookworm | 
  [production] | 
            
  | 09:42 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'db1241 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P72453 and previous config saved to /var/cache/conftool/dbconfig/20250127-094206-root.json | 
  [production] | 
            
  | 09:38 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P72452 and previous config saved to /var/cache/conftool/dbconfig/20250127-093847-marostegui.json | 
  [production] | 
            
  | 09:32 | 
  <jmm@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-test-presto1001.eqiad.wmnet | 
  [production] | 
            
  | 09:27 | 
  <jmm@cumin2002> | 
  START - Cookbook sre.hosts.reboot-single for host an-test-presto1001.eqiad.wmnet | 
  [production] | 
            
  | 09:27 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'db1241 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72451 and previous config saved to /var/cache/conftool/dbconfig/20250127-092701-root.json | 
  [production] | 
            
  | 09:23 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1161', diff saved to https://phabricator.wikimedia.org/P72450 and previous config saved to /var/cache/conftool/dbconfig/20250127-092340-marostegui.json | 
  [production] | 
            
  | 09:11 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'db1241 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P72449 and previous config saved to /var/cache/conftool/dbconfig/20250127-091155-root.json | 
  [production] | 
            
  | 09:08 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1161 (T384592)', diff saved to https://phabricator.wikimedia.org/P72448 and previous config saved to /var/cache/conftool/dbconfig/20250127-090833-marostegui.json | 
  [production] | 
            
  | 09:02 | 
  <jmm@dns1004> | 
  END - running authdns-update | 
  [production] | 
            
  | 09:00 | 
  <jmm@dns1004> | 
  START - running authdns-update | 
  [production] | 
            
  | 08:56 | 
  <moritzm> | 
  installing net-tools bugfix updates on bullseye | 
  [production] | 
            
  | 08:56 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'db1241 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P72447 and previous config saved to /var/cache/conftool/dbconfig/20250127-085650-root.json | 
  [production] | 
            
  | 08:56 | 
  <root@cumin1002> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1223.eqiad.wmnet with reason: Index rebuild | 
  [production] | 
            
  | 08:54 | 
  <root@cumin1002> | 
  END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db1223.eqiad.wmnet | 
  [production] | 
            
  | 08:50 | 
  <root@cumin1002> | 
  DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1223.eqiad.wmnet with reason: Index rebuild + upgrade | 
  [production] | 
            
  | 08:49 | 
  <marostegui> | 
  Upgrade db1223 T384807 | 
  [production] | 
            
  | 08:49 | 
  <root@cumin1002> | 
  START - Cookbook sre.mysql.upgrade for db1223.eqiad.wmnet | 
  [production] | 
            
  | 08:48 | 
  <marostegui@cumin1002> | 
  dbctl commit (dc=all): 'Depool db1223', diff saved to https://phabricator.wikimedia.org/P72446 and previous config saved to /var/cache/conftool/dbconfig/20250127-084857-marostegui.json | 
  [production] |