| 2022-08-08
      
      ยง | 
    
  | 20:11 | <mwdebug-deploy@deploy1002> | helmfile [codfw] START helmfile.d/services/mwdebug: apply | [production] | 
            
  | 20:11 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | [production] | 
            
  | 20:11 | <cjming@deploy1002> | Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:817785|Disable sticky header edit A/B test for pilot wikis (T312296)]] (duration: 03m 35s) | [production] | 
            
  | 20:08 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] START helmfile.d/services/mwdebug: apply | [production] | 
            
  | 17:34 | <bking@cumin1001> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic1088.eqiad.wmnet with OS bullseye | [production] | 
            
  | 17:15 | <bking@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic1088.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 17:12 | <bking@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on elastic1088.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 17:00 | <bking@cumin1001> | START - Cookbook sre.hosts.reimage for host elastic1088.eqiad.wmnet with OS bullseye | [production] | 
            
  | 16:54 | <bking@cumin1001> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic1085.eqiad.wmnet with OS bullseye | [production] | 
            
  | 16:49 | <ryankemper@cumin1001> | END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_eqiad: eqiad cluster reimage (bullseye upgrade) - ryankemper@cumin1001 - T289135 | [production] | 
            
  | 16:43 | <pt1979@cumin2002> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 16:41 | <bking@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic1085.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 16:39 | <pt1979@cumin2002> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 16:38 | <bking@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on elastic1085.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 16:26 | <bking@cumin1001> | START - Cookbook sre.hosts.reimage for host elastic1085.eqiad.wmnet with OS bullseye | [production] | 
            
  | 16:24 | <bking@cumin1001> | END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host elastic1085.eqiad.wmnet with OS bullseye | [production] | 
            
  | 16:19 | <bking@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic1085.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 16:16 | <bking@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on elastic1085.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 16:16 | <ryankemper@cumin1001> | START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_eqiad: eqiad cluster reimage (bullseye upgrade) - ryankemper@cumin1001 - T289135 | [production] | 
            
  | 16:14 | <elukey@deploy1002> | helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . | [production] | 
            
  | 16:12 | <elukey@deploy1002> | helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . | [production] | 
            
  | 16:10 | <elukey@deploy1002> | helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . | [production] | 
            
  | 16:09 | <ryankemper@cumin1001> | END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_eqiad: eqiad cluster reimage (bullseye upgrade) - ryankemper@cumin1001 - T289135 | [production] | 
            
  | 16:04 | <bking@cumin1001> | START - Cookbook sre.hosts.reimage for host elastic1085.eqiad.wmnet with OS bullseye | [production] | 
            
  | 16:00 | <bking@cumin1001> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic1084.eqiad.wmnet with OS bullseye | [production] | 
            
  | 15:58 | <ryankemper@cumin1001> | START - Cookbook sre.elasticsearch.rolling-operation Operation.REIMAGE (1 nodes at a time) for ElasticSearch cluster search_eqiad: eqiad cluster reimage (bullseye upgrade) - ryankemper@cumin1001 - T289135 | [production] | 
            
  | 15:47 | <bking@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic1084.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 15:46 | <sukhe> | upload reprepro -C main include bullseye-wikimedia python-pynetbox_6.6.0-1+wmf11u1_amd64.changes | [production] | 
            
  | 15:45 | <bking@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on elastic1084.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 15:37 | <ladsgroup@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es2021.codfw.wmnet with reason: Maint | [production] | 
            
  | 15:37 | <ladsgroup@cumin1001> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on es2021.codfw.wmnet with reason: Maint | [production] | 
            
  | 15:32 | <bking@cumin1001> | START - Cookbook sre.hosts.reimage for host elastic1084.eqiad.wmnet with OS bullseye | [production] | 
            
  | 14:59 | <elukey@deploy1002> | helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . | [production] | 
            
  | 14:55 | <elukey@deploy1002> | helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . | [production] | 
            
  | 14:47 | <sukhe@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on cp5001.eqsin.wmnet with reason: depooled: faulty DIMM: T314256 | [production] | 
            
  | 14:46 | <sukhe@cumin2002> | START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on cp5001.eqsin.wmnet with reason: depooled: faulty DIMM: T314256 | [production] | 
            
  | 14:34 | <elukey@deploy1002> | helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . | [production] | 
            
  | 14:11 | <kevinbazira@deploy1002> | helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-drafttopic' for release 'main' . | [production] | 
            
  | 13:03 | <mwdebug-deploy@deploy1002> | helmfile [codfw] DONE helmfile.d/services/mwdebug: apply | [production] | 
            
  | 13:01 | <mwdebug-deploy@deploy1002> | helmfile [codfw] START helmfile.d/services/mwdebug: apply | [production] | 
            
  | 13:01 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | [production] | 
            
  | 12:58 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] START helmfile.d/services/mwdebug: apply | [production] | 
            
  | 12:56 | <urbanecm@deploy1002> | Synchronized wmf-config/CommonSettings.php: 77fd5abdd7d9462869259e1511bbcf2d7ce62246: Growth: Add new rights to wgAvailableRights (duration: 03m 24s) | [production] | 
            
  | 12:30 | <btullis@cumin1001> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-worker1102.eqiad.wmnet | [production] | 
            
  | 12:09 | <mwdebug-deploy@deploy1002> | helmfile [codfw] START helmfile.d/services/mwdebug: apply | [production] | 
            
  | 12:09 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply | [production] | 
            
  | 12:06 | <urbanecm@deploy1002> | Synchronized php-1.39.0-wmf.23/extensions/GrowthExperiments/: 3eaf155678b7313c55dcca0cd39ab29f73eead37: MentorTools: Do not use MentorWeightManager (T314362) (duration: 03m 31s) | [production] | 
            
  | 12:04 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] START helmfile.d/services/mwdebug: apply | [production] | 
            
  | 11:43 | <btullis@cumin1001> | START - Cookbook sre.hosts.reboot-single for host an-worker1102.eqiad.wmnet | [production] | 
            
  | 11:21 | <jelto@cumin1001> | conftool action : set/pooled=yes; selector: name=kubernetes2022.codfw.wmnet | [production] |