| 2024-08-28
      
      ยง | 
    
  | 10:41 | <godog> | start prometheus2005 bookworm upgrade - T326657 | [production] | 
            
  | 10:40 | <cmooney@cumin1002> | START - Cookbook sre.deploy.python-code homer to cumin2002.codfw.wmnet,cumin1002.eqiad.wmnet with reason: Relase v0.7.0 with updated plugin - cmooney@cumin1002 | [production] | 
            
  | 10:38 | <ladsgroup@deploy1003> | Started scap sync-world: Backport for [[gerrit:1067930|Set ruwiki to non simple UI (T372694)]] | [production] | 
            
  | 10:38 | <filippo@cumin1002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host prometheus2005.codfw.wmnet | [production] | 
            
  | 10:27 | <filippo@cumin1002> | START - Cookbook sre.hosts.reboot-single for host prometheus2005.codfw.wmnet | [production] | 
            
  | 10:27 | <mvernon@cumin2002> | END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe-codfw | [production] | 
            
  | 10:27 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1157', diff saved to https://phabricator.wikimedia.org/P68026 and previous config saved to /var/cache/conftool/dbconfig/20240828-102721-ladsgroup.json | [production] | 
            
  | 10:24 | <mvernon@cumin2002> | START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe-codfw | [production] | 
            
  | 10:12 | <arnaudb@cumin1002> | END (FAIL) - Cookbook sre.switchdc.databases.prepare (exit_code=99) for the switch from test-s1 to test-s1 | [production] | 
            
  | 10:12 | <arnaudb@cumin1002> | START - Cookbook sre.switchdc.databases.prepare for the switch from test-s1 to test-s1 | [production] | 
            
  | 10:12 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1157 (T370903)', diff saved to https://phabricator.wikimedia.org/P68025 and previous config saved to /var/cache/conftool/dbconfig/20240828-101214-ladsgroup.json | [production] | 
            
  | 10:11 | <arnaudb@cumin1002> | END (ERROR) - Cookbook sre.switchdc.databases.prepare (exit_code=97) for the switch from test-s1 to test-s1 | [production] | 
            
  | 10:11 | <arnaudb@cumin1002> | START - Cookbook sre.switchdc.databases.prepare for the switch from test-s1 to test-s1 | [production] | 
            
  | 10:08 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Depooling db1157 (T370903)', diff saved to https://phabricator.wikimedia.org/P68024 and previous config saved to /var/cache/conftool/dbconfig/20240828-100803-ladsgroup.json | [production] | 
            
  | 10:07 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1157.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 10:07 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 8:00:00 on db1157.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 10:07 | <arnaudb@cumin1002> | END (ERROR) - Cookbook sre.switchdc.databases.prepare (exit_code=97) for the switch from test-s1 to test-s1 | [production] | 
            
  | 10:07 | <arnaudb@cumin1002> | START - Cookbook sre.switchdc.databases.prepare for the switch from test-s1 to test-s1 | [production] | 
            
  | 10:05 | <arnaudb@cumin1002> | END (ERROR) - Cookbook sre.switchdc.databases.prepare (exit_code=97) for the switch from test-s1 to test-s1 | [production] | 
            
  | 10:05 | <arnaudb@cumin1002> | START - Cookbook sre.switchdc.databases.prepare for the switch from test-s1 to test-s1 | [production] | 
            
  | 10:01 | <filippo@cumin1002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host prometheus1005.eqiad.wmnet | [production] | 
            
  | 09:58 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1150.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 09:58 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 8:00:00 on db1150.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 09:57 | <arnaudb@cumin1002> | END (ERROR) - Cookbook sre.switchdc.databases.prepare (exit_code=97) for the (test) switch | [production] | 
            
  | 09:57 | <arnaudb@cumin1002> | START - Cookbook sre.switchdc.databases.prepare for the (test) switch | [production] | 
            
  | 09:57 | <arnaudb@cumin1002> | END (FAIL) - Cookbook sre.switchdc.databases.prepare (exit_code=99) for the (test) switch | [production] | 
            
  | 09:54 | <arnaudb@cumin1002> | START - Cookbook sre.switchdc.databases.prepare for the (test) switch | [production] | 
            
  | 09:49 | <filippo@cumin1002> | START - Cookbook sre.hosts.reboot-single for host prometheus1005.eqiad.wmnet | [production] | 
            
  | 09:49 | <arnaudb@cumin1002> | END (FAIL) - Cookbook sre.switchdc.databases.prepare (exit_code=99) for the (test) switch | [production] | 
            
  | 09:48 | <arnaudb@cumin1002> | START - Cookbook sre.switchdc.databases.prepare for the (test) switch | [production] | 
            
  | 09:40 | <godog> | start prometheus1005 bookworm upgrade - T326657 | [production] | 
            
  | 09:36 | <claime> | homer 'cr*codfw*' commit 'T372878' | [production] | 
            
  | 09:35 | <cgoubert@cumin1002> | END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker2043.codfw.wmnet | [production] | 
            
  | 09:35 | <cgoubert@cumin1002> | START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker2043.codfw.wmnet | [production] | 
            
  | 09:35 | <claime> | pooling wikikube-worker2043.codfw.wmnet - T372878 | [production] | 
            
  | 09:33 | <claime> | homer 'lsw1-a3-codfw*' commit T372878 | [production] | 
            
  | 09:10 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.switchdc.databases.prepare (exit_code=0) for the (test) switch | [production] | 
            
  | 09:02 | <arnaudb@cumin1002> | START - Cookbook sre.switchdc.databases.prepare for the (test) switch | [production] | 
            
  | 08:52 | <jayme> | running homer commit on on cr*codfw* - T372878 | [production] | 
            
  | 08:50 | <jayme@cumin1002> | END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker2047.codfw.wmnet | [production] | 
            
  | 08:50 | <jayme@cumin1002> | START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker2047.codfw.wmnet | [production] | 
            
  | 08:48 | <jayme> | running homer commit on on lsw1-a6-codfw* - T372878 | [production] | 
            
  | 08:46 | <arnaudb@cumin1002> | END (FAIL) - Cookbook sre.switchdc.databases.prepare (exit_code=99) for the (test) switch | [production] | 
            
  | 08:45 | <arnaudb@cumin1002> | START - Cookbook sre.switchdc.databases.prepare for the (test) switch | [production] | 
            
  | 08:45 | <jayme@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2047.codfw.wmnet with OS bullseye | [production] | 
            
  | 08:40 | <ladsgroup@cumin1002> | dbctl commit (dc=all): 'Depooling db2147 (T371742)', diff saved to https://phabricator.wikimedia.org/P68023 and previous config saved to /var/cache/conftool/dbconfig/20240828-084045-ladsgroup.json | [production] | 
            
  | 08:40 | <ladsgroup@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2147.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 08:40 | <ladsgroup@cumin1002> | START - Cookbook sre.hosts.downtime for 12:00:00 on db2147.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 08:37 | <hashar@deploy1003> | rebuilt and synchronized wikiversions files: group1 to 1.43.0-wmf.20  refs T366965 | [production] | 
            
  | 08:26 | <jayme@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2047.codfw.wmnet with reason: host reimage | [production] |