| 
      
        2024-12-24
      
      §
     | 
  
    
  | 10:53 | 
  <ladsgroup@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10 days, 0:00:00 on db2123.codfw.wmnet with reason: Broken T382743 T382743 | 
  [production] | 
            
  | 10:53 | 
  <ladsgroup@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 10 days, 0:00:00 on db2123.codfw.wmnet with reason: Broken T382743 T382743 | 
  [production] | 
            
  | 10:52 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Depool db2123 T382743', diff saved to https://phabricator.wikimedia.org/P71746 and previous config saved to /var/cache/conftool/dbconfig/20241224-105203-ladsgroup.json | 
  [production] | 
            
  | 10:33 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Promote db2213 to s5 primary and set section read-write T382743', diff saved to https://phabricator.wikimedia.org/P71745 and previous config saved to /var/cache/conftool/dbconfig/20241224-103304-ladsgroup.json | 
  [production] | 
            
  | 10:22 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Set db2213 with weight 0 T382743', diff saved to https://phabricator.wikimedia.org/P71744 and previous config saved to /var/cache/conftool/dbconfig/20241224-102200-ladsgroup.json | 
  [production] | 
            
  | 10:21 | 
  <ladsgroup@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 25 hosts with reason: Primary switchover s5 T382743 | 
  [production] | 
            
  | 10:21 | 
  <ladsgroup@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 1:00:00 on 25 hosts with reason: Primary switchover s5 T382743 | 
  [production] | 
            
  | 10:21 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Set s5 codfw as read-only for maintenance - T382743', diff saved to https://phabricator.wikimedia.org/P71743 and previous config saved to /var/cache/conftool/dbconfig/20241224-102102-ladsgroup.json | 
  [production] | 
            
  | 09:39 | 
  <akosiaris@cumin1002> | 
  conftool action : set/weight=10; selector: dc=eqiad,cluster=kubernetes,service=kubemaster,name=wikikube-ctrl1004.eqiad.wmnet | 
  [production] | 
            
  | 09:39 | 
  <akosiaris@cumin1002> | 
  conftool action : set/pooled=yes; selector: dc=eqiad,cluster=kubernetes,service=kubemaster,name=wikikube-ctrl1004.eqiad.wmnet | 
  [production] | 
            
  | 05:01 | 
  <mwpresync@deploy2002> | 
  Pruned MediaWiki: 1.44.0-wmf.5 (duration: 01m 21s) | 
  [production] | 
            
  
    | 
      
        2024-12-23
      
      §
     | 
  
    
  | 23:54 | 
  <wfan> | 
  payments-wiki upgraded from 65775042 to 8294c9ec | 
  [production] | 
            
  | 22:18 | 
  <zabe@deploy2002> | 
  Finished scap sync-world: T382717 (duration: 15m 07s) | 
  [production] | 
            
  | 22:06 | 
  <zabe@deploy2002> | 
  zabe: Continuing with sync | 
  [production] | 
            
  | 22:05 | 
  <zabe@deploy2002> | 
  zabe: T382717 synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | 
  [production] | 
            
  | 22:03 | 
  <zabe@deploy2002> | 
  Started scap sync-world: T382717 | 
  [production] | 
            
  | 22:03 | 
  <zabe@deploy2002> | 
  Sync cancelled. | 
  [production] | 
            
  | 22:03 | 
  <zabe@deploy2002> | 
  zabe: T382717 synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | 
  [production] | 
            
  | 21:55 | 
  <zabe@deploy2002> | 
  Started scap sync-world: T382717 | 
  [production] | 
            
  | 21:54 | 
  <zabe@deploy2002> | 
  scap failed: <CalledProcessError> Command 'sudo -u mwbuilder /srv/mwbuilder/release/make-container-image/build-images.py /srv/mediawiki-staging/scap/image-build --staging-dir /srv/mediawiki-staging --mediawiki-versions 1.44.0-wmf.8 --multiversion-image-name docker-registry.discovery.wmnet/restricted/mediawiki-multiversion --multiversion-debug-image-name docker-registry.discovery.wmnet/restricted/media | 
  [production] | 
            
  | 21:53 | 
  <zabe@deploy2002> | 
  Started scap sync-world: T382717 | 
  [production] | 
            
  | 21:24 | 
  <zabe@deploy2002> | 
  Started scap sync-world: Backport for [[gerrit:1106340|Fix Azeri alias lang code (T382717 T381048)]] | 
  [production] | 
            
  | 21:04 | 
  <Emperor> | 
  depool/restart/repoo ms-fe1013 | 
  [production] | 
            
  | 20:59 | 
  <mvernon@cumin2002> | 
  conftool action : set/pooled=true; selector: dnsdisc=swift,name=codfw | 
  [production] | 
            
  | 20:37 | 
  <Emperor> | 
  cumin run on swift nodes | 
  [production] | 
            
  | 20:16 | 
  <Emperor> | 
  weighted ms-be2075 to zero T382705 T382707 | 
  [production] | 
            
  | 19:33 | 
  <Emperor> | 
  restart swift-container-reconciler on ms-be1075 | 
  [production] | 
            
  | 17:22 | 
  <Emperor> | 
  swift delete wikipedia-commons-local-public.88 8/88/Model_4000-First_of_Odakyu_Electric_Railway_2.JPG T382694 | 
  [production] | 
            
  | 16:20 | 
  <jayme@cumin1002> | 
  END (PASS) - Cookbook sre.k8s.roll-reimage-nodes (exit_code=0) rolling reimage on P{wikikube-worker[1031-1033].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad) | 
  [production] | 
            
  | 16:20 | 
  <jayme@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1033.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 16:01 | 
  <jayme@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1033.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 15:56 | 
  <jayme@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1033.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 15:39 | 
  <jayme@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host wikikube-worker1033.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 15:37 | 
  <jayme@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1032.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 15:14 | 
  <jayme@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1032.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 15:11 | 
  <jayme@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1032.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 14:58 | 
  <jayme@cumin1002> | 
  END (PASS) - Cookbook sre.k8s.roll-reimage-nodes (exit_code=0) rolling reimage on P{wikikube-worker[1028-1030].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad) | 
  [production] | 
            
  | 14:58 | 
  <jayme@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1030.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 14:52 | 
  <jayme@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host wikikube-worker1032.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 14:51 | 
  <akosiaris@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-ctrl1004.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 14:50 | 
  <jayme@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1031.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 14:39 | 
  <jayme@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1030.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 14:35 | 
  <jayme@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1030.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 14:32 | 
  <jayme@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1031.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 14:27 | 
  <jayme@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1031.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 14:19 | 
  <jayme@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host wikikube-worker1030.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 14:17 | 
  <jayme@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1029.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 14:11 | 
  <jayme@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host wikikube-worker1031.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 14:10 | 
  <mvernon@cumin2002> | 
  conftool action : set/pooled=false; selector: dnsdisc=swift,name=codfw | 
  [production] | 
            
  | 14:10 | 
  <jayme@cumin1002> | 
  START - Cookbook sre.k8s.roll-reimage-nodes rolling reimage on P{wikikube-worker[1031-1033].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad) | 
  [production] |