| 2024-08-02
      
      ยง | 
    
  | 23:19 | <jclark@cumin1002> | END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wikikube-worker1260.mgmt.eqiad.wmnet with reboot policy FORCED | [production] | 
            
  | 22:48 | <jclark@cumin1002> | START - Cookbook sre.hosts.provision for host wikikube-worker1260.mgmt.eqiad.wmnet with reboot policy FORCED | [production] | 
            
  | 22:44 | <jclark@cumin1002> | END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host wikikube-worker1260.mgmt.eqiad.wmnet with reboot policy FORCED | [production] | 
            
  | 22:44 | <jclark@cumin1002> | START - Cookbook sre.hosts.provision for host wikikube-worker1260.mgmt.eqiad.wmnet with reboot policy FORCED | [production] | 
            
  | 22:06 | <wmbot~anticomposite@tools-bastion-13> | ./stewardbots/StewardBot/manage.sh restart # IRC connection failed, try again | [tools.stewardbots] | 
            
  | 21:55 | <ejegg> | standalone (IPN listener) SmashPig upgraded from 1b2d9a6e to 5e784691 | [production] | 
            
  | 21:31 | <wmbot~bsadowski1@tools-bastion-13> | Restarted StewardBot/StewardBot because of a connection loss | [tools.stewardbots] | 
            
  | 16:01 | <xcollazo@deploy1003> | Finished deploy [airflow-dags/analytics@d573c40]: Deploy latest DAGs for analytics Airflow instance. T368756 (duration: 01m 02s) | [production] | 
            
  | 16:00 | <xcollazo> | Deploy latest DAGs for analytics Airflow instance | [analytics] | 
            
  | 16:00 | <xcollazo@deploy1003> | Started deploy [airflow-dags/analytics@d573c40]: Deploy latest DAGs for analytics Airflow instance. T368756 | [production] | 
            
  | 15:10 | <elukey@cumin1002> | END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host db2235.mgmt.codfw.wmnet with reboot policy GRACEFUL | [production] | 
            
  | 15:05 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host db2235.mgmt.codfw.wmnet with reboot policy GRACEFUL | [production] | 
            
  | 15:00 | <elukey@cumin1002> | END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host db2234.mgmt.codfw.wmnet with reboot policy GRACEFUL | [production] | 
            
  | 14:53 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host db2234.mgmt.codfw.wmnet with reboot policy GRACEFUL | [production] | 
            
  | 14:52 | <elukey@cumin1002> | END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host db2233.mgmt.codfw.wmnet with reboot policy GRACEFUL | [production] | 
            
  | 14:49 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host db2233.mgmt.codfw.wmnet with reboot policy GRACEFUL | [production] | 
            
  | 14:41 | <elukey@cumin1002> | END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host db2232.mgmt.codfw.wmnet with reboot policy GRACEFUL | [production] | 
            
  | 14:34 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host db2232.mgmt.codfw.wmnet with reboot policy GRACEFUL | [production] | 
            
  | 14:34 | <elukey@cumin1002> | END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host db2231.mgmt.codfw.wmnet with reboot policy GRACEFUL | [production] | 
            
  | 14:27 | <elukey@cumin1002> | START - Cookbook sre.hosts.provision for host db2231.mgmt.codfw.wmnet with reboot policy GRACEFUL | [production] | 
            
  | 14:11 | <pt1979@cumin2002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host prometheus2008.codfw.wmnet with OS bookworm | [production] | 
            
  | 13:56 | <pt1979@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on prometheus2008.codfw.wmnet with reason: host reimage | [production] | 
            
  | 13:52 | <pt1979@cumin2002> | START - Cookbook sre.hosts.downtime for 2:00:00 on prometheus2008.codfw.wmnet with reason: host reimage | [production] | 
            
  | 13:50 | <pt1979@cumin2002> | START - Cookbook sre.hosts.reimage for host prometheus2008.codfw.wmnet with OS bookworm | [production] | 
            
  | 13:48 | <pt1979@cumin2002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host prometheus2007.codfw.wmnet with OS bookworm | [production] | 
            
  | 13:44 | <sukhe> | running authdns-update for CR: 1059362 T371304 | [production] | 
            
  | 13:44 | <sukhe> | running authdns-update for CR: T3713041059362 | [production] | 
            
  | 13:38 | <pt1979@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on prometheus2007.codfw.wmnet with reason: host reimage | [production] | 
            
  | 13:37 | <pt1979@cumin2002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host alert2002.wikimedia.org with OS bookworm | [production] | 
            
  | 13:35 | <pt1979@cumin2002> | START - Cookbook sre.hosts.downtime for 2:00:00 on prometheus2007.codfw.wmnet with reason: host reimage | [production] | 
            
  | 13:33 | <pt1979@cumin2002> | START - Cookbook sre.hosts.reimage for host prometheus2007.codfw.wmnet with OS bookworm | [production] | 
            
  | 13:27 | <pt1979@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on alert2002.wikimedia.org with reason: host reimage | [production] | 
            
  | 13:24 | <pt1979@cumin2002> | START - Cookbook sre.hosts.downtime for 2:00:00 on alert2002.wikimedia.org with reason: host reimage | [production] | 
            
  | 13:21 | <pt1979@cumin2002> | START - Cookbook sre.hosts.reimage for host alert2002.wikimedia.org with OS bookworm | [production] | 
            
  | 13:11 | <ayounsi@cumin1002> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "prometheus - ayounsi@cumin1002" | [production] | 
            
  | 13:10 | <ayounsi@cumin1002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "prometheus - ayounsi@cumin1002" | [production] | 
            
  | 11:03 | <brouberol@deploy1003> | helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 11:03 | <brouberol@deploy1003> | helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 10:55 | <brouberol@deploy1003> | helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 10:23 | <ayounsi@cumin1002> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "alert2002 - ayounsi@cumin1002" | [production] | 
            
  | 10:18 | <ayounsi@cumin1002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "alert2002 - ayounsi@cumin1002" | [production] | 
            
  | 10:18 | <elukey> | manually start dump_cloud_ip_ranges.service on puppetmaster1001 as test | [production] | 
            
  | 10:11 | <brouberol@deploy1003> | helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 10:11 | <brouberol@deploy1003> | helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 09:23 | <brouberol@deploy1003> | helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 09:14 | <brouberol@deploy1003> | helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 09:09 | <brouberol@deploy1003> | helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 09:06 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Depooling db1195 (T367856)', diff saved to https://phabricator.wikimedia.org/P67203 and previous config saved to /var/cache/conftool/dbconfig/20240802-090649-marostegui.json | [production] | 
            
  | 09:06 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 6:00:00 on db1195.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 09:06 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 6:00:00 on db1195.eqiad.wmnet with reason: Maintenance | [production] |