| 2025-08-07
      
      § | 
    
  | 23:38 | <vriley@cumin1002> | START - Cookbook sre.hosts.provision for host cloudcephosd1047.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED | [production] | 
            
  | 23:38 | <vriley@cumin1002> | END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cloudcephosd1047 | [production] | 
            
  | 23:37 | <vriley@cumin1002> | START - Cookbook sre.network.configure-switch-interfaces for host cloudcephosd1047 | [production] | 
            
  | 23:37 | <vriley@cumin1002> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 23:37 | <vriley@cumin1002> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update  mgmt  cloudcephosd1047 - vriley@cumin1002" | [production] | 
            
  | 23:37 | <vriley@cumin1002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update  mgmt  cloudcephosd1047 - vriley@cumin1002" | [production] | 
            
  | 23:33 | <vriley@cumin1002> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 22:59 | <vriley@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1042.eqiad.wmnet with OS bookworm | [production] | 
            
  | 22:59 | <vriley@cumin1002> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1002" | [production] | 
            
  | 22:58 | <vriley@cumin1002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1002" | [production] | 
            
  | 22:38 | <vriley@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1042.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 22:34 | <vriley@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1042.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 22:15 | <bking@cumin2002> | END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) (T386098, transfer newly-reloaded data) xfer wikidata_main from wdqs1022.eqiad.wmnet -> wdqs1016.eqiad.wmnet w/ force delete existing files, repooling both afterwards | [production] | 
            
  | 22:15 | <vriley@cumin1002> | START - Cookbook sre.hosts.reimage for host cloudcephosd1042.eqiad.wmnet with OS bookworm | [production] | 
            
  | 22:09 | <bking@cumin2002> | END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) (T386098, transfer newly-reloaded data) xfer wikidata_main from wdqs2007.codfw.wmnet -> wdqs2018.codfw.wmnet w/ force delete existing files, repooling both afterwards | [production] | 
            
  | 21:16 | <jgleeson> | payments-wiki upgraded from 0ab5bab9 to 0a1084a8 | [production] | 
            
  | 21:15 | <bking@cumin2002> | START - Cookbook sre.wdqs.data-transfer (T386098, transfer newly-reloaded data) xfer wikidata_main from wdqs2007.codfw.wmnet -> wdqs2018.codfw.wmnet w/ force delete existing files, repooling both afterwards | [production] | 
            
  | 21:15 | <bking@cumin2002> | START - Cookbook sre.wdqs.data-transfer (T386098, transfer newly-reloaded data) xfer wikidata_main from wdqs1022.eqiad.wmnet -> wdqs1016.eqiad.wmnet w/ force delete existing files, repooling both afterwards | [production] | 
            
  | 21:14 | <vriley@cumin1002> | START - Cookbook sre.hosts.reimage for host cloudcephosd1042.eqiad.wmnet with OS bullseye | [production] | 
            
  | 21:01 | <swfrench@deploy1003> | Finished scap sync-world: No-op deployment to clear chart version diffs from https://gerrit.wikimedia.org/r/1176543 (duration: 02m 45s) | [production] | 
            
  | 20:58 | <swfrench@deploy1003> | Started scap sync-world: No-op deployment to clear chart version diffs from https://gerrit.wikimedia.org/r/1176543 | [production] | 
            
  | 20:30 | <cjming@deploy1003> | Finished scap sync-world: Backport for [[gerrit:1176532|Update PageVisit instruments for a logged-in synth experiment (T397140)]] (duration: 07m 34s) | [production] | 
            
  | 20:25 | <cjming@deploy1003> | cjming: Continuing with sync | [production] | 
            
  | 20:24 | <cjming@deploy1003> | cjming: Backport for [[gerrit:1176532|Update PageVisit instruments for a logged-in synth experiment (T397140)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. | [production] | 
            
  | 20:23 | <cjming@deploy1003> | Started scap sync-world: Backport for [[gerrit:1176532|Update PageVisit instruments for a logged-in synth experiment (T397140)]] | [production] | 
            
  | 20:10 | <cjming@deploy1003> | Finished scap sync-world: Backport for [[gerrit:1176528|XLab/Hooks: Only fetch experiment configs when user is registered]] (duration: 08m 05s) | [production] | 
            
  | 20:05 | <cjming@deploy1003> | cjming: Continuing with sync | [production] | 
            
  | 20:04 | <cjming@deploy1003> | cjming: Backport for [[gerrit:1176528|XLab/Hooks: Only fetch experiment configs when user is registered]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. | [production] | 
            
  | 20:02 | <cjming@deploy1003> | Started scap sync-world: Backport for [[gerrit:1176528|XLab/Hooks: Only fetch experiment configs when user is registered]] | [production] | 
            
  | 20:01 | <jhancock@cumin1003> | END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ms-fe2020.codfw.wmnet with OS bullseye | [production] | 
            
  | 19:55 | <vriley@cumin1002> | START - Cookbook sre.hosts.reimage for host cloudcephosd1042.eqiad.wmnet with OS bookworm | [production] | 
            
  | 19:28 | <jhancock@cumin1003> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-fe2019.codfw.wmnet with OS bullseye | [production] | 
            
  | 19:28 | <jhancock@cumin1003> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin1003" | [production] | 
            
  | 19:28 | <jhancock@cumin1003> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin1003" | [production] | 
            
  | 19:07 | <rzl@deploy1003> | helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply | [production] | 
            
  | 19:07 | <rzl@deploy1003> | helmfile [eqiad] START helmfile.d/services/mw-cron: apply | [production] | 
            
  | 19:03 | <jhancock@cumin1003> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-fe2019.codfw.wmnet with reason: host reimage | [production] | 
            
  | 18:57 | <jhancock@cumin1003> | START - Cookbook sre.hosts.downtime for 2:00:00 on ms-fe2019.codfw.wmnet with reason: host reimage | [production] | 
            
  | 18:50 | <cjming@deploy1003> | mwscript-k8s job started: extensions/MetricsPlatform/maintenance/UpdateConfigs.php --wiki aawiki  # Test run for T398422 | [production] | 
            
  | 18:41 | <jhancock@cumin1003> | START - Cookbook sre.hosts.reimage for host ms-fe2020.codfw.wmnet with OS bullseye | [production] | 
            
  | 18:41 | <jhancock@cumin1003> | START - Cookbook sre.hosts.reimage for host ms-fe2019.codfw.wmnet with OS bullseye | [production] | 
            
  | 18:24 | <bking@cumin2002> | END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) (T386098, transfer newly-reloaded data) xfer wikidata_main from wdqs1022.eqiad.wmnet -> wdqs1015.eqiad.wmnet w/ force delete existing files, repooling both afterwards | [production] | 
            
  | 18:23 | <bking@cumin2002> | END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) (T386098, transfer newly-reloaded data) xfer wikidata_main from wdqs2007.codfw.wmnet -> wdqs2022.codfw.wmnet w/ force delete existing files, repooling both afterwards | [production] | 
            
  | 18:16 | <vriley@cumin1002> | END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd1042.eqiad.wmnet with OS bookworm | [production] |