| 
      
        2023-06-29
      
      ยง
     | 
  
    
  | 21:28 | 
  <bking@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs2021.codfw.wmnet with reason: host reimage | 
  [production] | 
            
  | 21:25 | 
  <bking@cumin1001> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs2021.codfw.wmnet with reason: host reimage | 
  [production] | 
            
  | 21:22 | 
  <ryankemper@cumin1001> | 
  END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) | 
  [production] | 
            
  | 21:18 | 
  <samtar@deploy1002> | 
  Finished scap: Backport for [[gerrit:934391|IS: Phonos, reorder and enable for mediawikiwiki (T336763)]] (duration: 08m 26s) | 
  [production] | 
            
  | 21:11 | 
  <samtar@deploy1002> | 
  samtar: Backport for [[gerrit:934391|IS: Phonos, reorder and enable for mediawikiwiki (T336763)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet | 
  [production] | 
            
  | 21:10 | 
  <samtar@deploy1002> | 
  Started scap: Backport for [[gerrit:934391|IS: Phonos, reorder and enable for mediawikiwiki (T336763)]] | 
  [production] | 
            
  | 20:13 | 
  <bking@cumin1001> | 
  START - Cookbook sre.hosts.reimage for host wdqs2021.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 20:01 | 
  <mutante> | 
  contint* servers: restarted apache after deploying gerrit:932435 | 
  [production] | 
            
  | 19:50 | 
  <bking@cumin1001> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs2021.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 19:48 | 
  <ryankemper@cumin1001> | 
  START - Cookbook sre.wdqs.data-transfer | 
  [production] | 
            
  | 19:30 | 
  <urbanecm@deploy1002> | 
  helmfile [codfw] DONE helmfile.d/services/linkrecommendation: apply | 
  [production] | 
            
  | 19:30 | 
  <urbanecm@deploy1002> | 
  helmfile [codfw] START helmfile.d/services/linkrecommendation: apply | 
  [production] | 
            
  | 19:29 | 
  <urbanecm@deploy1002> | 
  helmfile [eqiad] DONE helmfile.d/services/linkrecommendation: apply | 
  [production] | 
            
  | 19:29 | 
  <urbanecm@deploy1002> | 
  helmfile [eqiad] START helmfile.d/services/linkrecommendation: apply | 
  [production] | 
            
  | 19:29 | 
  <urbanecm@deploy1002> | 
  helmfile [staging] DONE helmfile.d/services/linkrecommendation: apply | 
  [production] | 
            
  | 19:28 | 
  <urbanecm@deploy1002> | 
  helmfile [staging] START helmfile.d/services/linkrecommendation: apply | 
  [production] | 
            
  | 19:17 | 
  <rzl@deploy1002> | 
  helmfile [eqiad] DONE helmfile.d/services/opentelemetry-collector: apply | 
  [production] | 
            
  | 19:16 | 
  <rzl@deploy1002> | 
  helmfile [eqiad] START helmfile.d/services/opentelemetry-collector: apply | 
  [production] | 
            
  | 19:10 | 
  <rzl@deploy1002> | 
  helmfile [codfw] DONE helmfile.d/services/opentelemetry-collector: apply | 
  [production] | 
            
  | 19:10 | 
  <rzl@deploy1002> | 
  helmfile [codfw] START helmfile.d/services/opentelemetry-collector: apply | 
  [production] | 
            
  | 18:37 | 
  <eevans@cumin1001> | 
  END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:cassandra-dev: Restarting to upgraded JVM - eevans@cumin1001 | 
  [production] | 
            
  | 18:33 | 
  <eevans@cumin1001> | 
  END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching sessionstore200[1-3]*: Restarting to upgraded JVM - eevans@cumin1001 | 
  [production] | 
            
  | 18:29 | 
  <bking@cumin1001> | 
  START - Cookbook sre.hosts.reimage for host wdqs2021.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 18:17 | 
  <eevans@cumin1001> | 
  START - Cookbook sre.cassandra.roll-restart for nodes matching A:cassandra-dev: Restarting to upgraded JVM - eevans@cumin1001 | 
  [production] | 
            
  | 18:16 | 
  <brennen@deploy1002> | 
  rebuilt and synchronized wikiversions files: group2 wikis to 1.41.0-wmf.15  refs T340243 | 
  [production] | 
            
  | 18:15 | 
  <eevans@cumin1001> | 
  START - Cookbook sre.cassandra.roll-restart for nodes matching sessionstore200[1-3]*: Restarting to upgraded JVM - eevans@cumin1001 | 
  [production] | 
            
  | 18:06 | 
  <brennen> | 
  train 1.41.0-wmf.15 (T340243): no current blockers, logs calm, rolling to all wikis | 
  [production] | 
            
  | 17:46 | 
  <taavi@deploy1002> | 
  Finished scap: Backport for [[gerrit:934350|Revert "Add extends warning to reference dialog" (T247922 T340757)]] (duration: 11m 06s) | 
  [production] | 
            
  | 17:38 | 
  <taavi@deploy1002> | 
  matmarex and taavi: Backport for [[gerrit:934350|Revert "Add extends warning to reference dialog" (T247922 T340757)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet | 
  [production] | 
            
  | 17:35 | 
  <taavi@deploy1002> | 
  Started scap: Backport for [[gerrit:934350|Revert "Add extends warning to reference dialog" (T247922 T340757)]] | 
  [production] | 
            
  | 17:10 | 
  <btullis@deploy1002> | 
  helmfile [staging] DONE helmfile.d/services/datahub: sync on main | 
  [production] | 
            
  | 17:09 | 
  <jiji@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubestagemaster1002.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 17:07 | 
  <bd808@deploy1002> | 
  helmfile [eqiad] DONE helmfile.d/services/developer-portal: apply | 
  [production] | 
            
  | 17:06 | 
  <bd808@deploy1002> | 
  helmfile [eqiad] START helmfile.d/services/developer-portal: apply | 
  [production] | 
            
  | 17:06 | 
  <bd808@deploy1002> | 
  helmfile [codfw] DONE helmfile.d/services/developer-portal: apply | 
  [production] | 
            
  | 17:05 | 
  <bd808@deploy1002> | 
  helmfile [codfw] START helmfile.d/services/developer-portal: apply | 
  [production] | 
            
  | 17:05 | 
  <bd808@deploy1002> | 
  helmfile [staging] DONE helmfile.d/services/developer-portal: apply | 
  [production] | 
            
  | 17:04 | 
  <bd808@deploy1002> | 
  helmfile [staging] START helmfile.d/services/developer-portal: apply | 
  [production] | 
            
  | 16:59 | 
  <btullis@deploy1002> | 
  helmfile [staging] START helmfile.d/services/datahub: apply on main | 
  [production] | 
            
  | 16:55 | 
  <jiji@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubestagemaster1002.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 16:51 | 
  <jiji@cumin1001> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on kubestagemaster1002.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 16:50 | 
  <jiji@cumin1001> | 
  END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host kubestagemaster2002.codfw.wmnet | 
  [production] | 
            
  | 16:50 | 
  <jiji@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubestagemaster2002.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 16:41 | 
  <jiji@cumin1001> | 
  START - Cookbook sre.hosts.reimage for host kubestagemaster1002.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 16:38 | 
  <jiji@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubestagemaster2002.codfw.wmnet with reason: host reimage | 
  [production] | 
            
  | 16:35 | 
  <jiji@cumin1001> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on kubestagemaster2002.codfw.wmnet with reason: host reimage | 
  [production] | 
            
  | 16:22 | 
  <klausman@deploy1002> | 
  helmfile [eqiad] DONE helmfile.d/services/changeprop: apply | 
  [production] | 
            
  | 16:21 | 
  <klausman@deploy1002> | 
  helmfile [eqiad] START helmfile.d/services/changeprop: apply | 
  [production] | 
            
  | 16:18 | 
  <mutante> | 
  releases1003 - re-enabling puppet after recent webserver debugging | 
  [production] | 
            
  | 16:18 | 
  <jiji@cumin1001> | 
  START - Cookbook sre.hosts.reimage for host kubestagemaster2002.codfw.wmnet with OS bullseye | 
  [production] |