| 
      
        2023-02-02
      
      §
     | 
  
    
  | 14:39 | 
  <cgoubert@deploy1002> | 
  helmfile [staging] DONE helmfile.d/services/zotero: apply | 
  [production] | 
            
  | 14:31 | 
  <btullis@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host aqs2005.codfw.wmnet | 
  [production] | 
            
  | 14:29 | 
  <cgoubert@deploy1002> | 
  helmfile [staging] START helmfile.d/services/zotero: apply | 
  [production] | 
            
  | 14:25 | 
  <moritzm> | 
  installing containerd security updates on codfw k8s nodes | 
  [production] | 
            
  | 14:24 | 
  <btullis@cumin1001> | 
  START - Cookbook sre.hosts.reboot-single for host aqs2005.codfw.wmnet | 
  [production] | 
            
  | 13:34 | 
  <sukhe@puppetmaster1001> | 
  conftool action : set/pooled=yes; selector: name=cp1076.eqiad.wmnet,service=ats-be | 
  [production] | 
            
  | 13:34 | 
  <sukhe@puppetmaster1001> | 
  conftool action : set/pooled=yes; selector: name=cp1076.eqiad.wmnet,service=cdn | 
  [production] | 
            
  | 13:10 | 
  <kharlan:> | 
  Deployed security patch for T328643 | 
  [production] | 
            
  | 13:09 | 
  <sukhe@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1076.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 13:04 | 
  <isaranto@deploy1002> | 
  helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . | 
  [production] | 
            
  | 13:03 | 
  <isaranto@deploy1002> | 
  helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . | 
  [production] | 
            
  | 13:03 | 
  <kharlan:> | 
  Deployed security patch for T328643 | 
  [production] | 
            
  | 13:02 | 
  <isaranto@deploy1002> | 
  helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . | 
  [production] | 
            
  | 13:01 | 
  <btullis@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host aqs2004.codfw.wmnet | 
  [production] | 
            
  | 13:00 | 
  <isaranto@deploy1002> | 
  helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . | 
  [production] | 
            
  | 12:55 | 
  <btullis@cumin1001> | 
  START - Cookbook sre.hosts.reboot-single for host aqs2004.codfw.wmnet | 
  [production] | 
            
  | 12:47 | 
  <sukhe@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1076.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 12:47 | 
  <isaranto@deploy1002> | 
  helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . | 
  [production] | 
            
  | 12:46 | 
  <isaranto@deploy1002> | 
  helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-editquality-damaging' for release 'main' . | 
  [production] | 
            
  | 12:44 | 
  <sukhe@cumin2002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on cp1076.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 12:42 | 
  <isaranto@deploy1002> | 
  helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-drafttopic' for release 'main' . | 
  [production] | 
            
  | 12:42 | 
  <isaranto@deploy1002> | 
  helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-drafttopic' for release 'main' . | 
  [production] | 
            
  | 12:39 | 
  <isaranto@deploy1002> | 
  helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-draftquality' for release 'main' . | 
  [production] | 
            
  | 12:39 | 
  <isaranto@deploy1002> | 
  helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-draftquality' for release 'main' . | 
  [production] | 
            
  | 12:29 | 
  <btullis@deploy1002> | 
  Finished deploy [analytics/superset/deploy@5175ad7]: Production deployment for numpy downgrade (duration: 00m 42s) | 
  [production] | 
            
  | 12:29 | 
  <claime> | 
  Work ongoing on m2 and m3 | 
  [production] | 
            
  | 12:29 | 
  <btullis@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host aqs2003.codfw.wmnet | 
  [production] | 
            
  | 12:29 | 
  <btullis@deploy1002> | 
  Started deploy [analytics/superset/deploy@5175ad7]: Production deployment for numpy downgrade | 
  [production] | 
            
  | 12:23 | 
  <sukhe@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host cp1076.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 12:21 | 
  <btullis@cumin1001> | 
  START - Cookbook sre.hosts.reboot-single for host aqs2003.codfw.wmnet | 
  [production] | 
            
  | 12:08 | 
  <isaranto@deploy1002> | 
  helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articletopic' for release 'main' . | 
  [production] | 
            
  | 12:08 | 
  <isaranto@deploy1002> | 
  helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articletopic' for release 'main' . | 
  [production] | 
            
  | 11:46 | 
  <isaranto@deploy1002> | 
  helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . | 
  [production] | 
            
  | 11:42 | 
  <mvolz@deploy1002> | 
  helmfile [eqiad] DONE helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 11:42 | 
  <mvolz@deploy1002> | 
  helmfile [eqiad] START helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 11:41 | 
  <mvolz@deploy1002> | 
  helmfile [eqiad] DONE helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 11:41 | 
  <mvolz@deploy1002> | 
  helmfile [eqiad] START helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 11:40 | 
  <mvolz@deploy1002> | 
  helmfile [codfw] DONE helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 11:39 | 
  <mvolz@deploy1002> | 
  helmfile [codfw] START helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 11:38 | 
  <mvolz@deploy1002> | 
  helmfile [staging] DONE helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 11:37 | 
  <mvolz@deploy1002> | 
  helmfile [staging] START helmfile.d/services/citoid: apply | 
  [production] | 
            
  | 11:37 | 
  <Lucas_WMDE> | 
  lucaswerkmeister-wmde@mwmaint1002:~$ mwscript namespaceDupes.php shnwikibooks --fix | tee T328634-namespaceDupes-4.out # T328634 – made some progress then errored out again | 
  [production] | 
            
  | 11:32 | 
  <Lucas_WMDE> | 
  lucaswerkmeister-wmde@mwmaint1002:~$ mwscript namespaceDupes.php shnwikibooks --fix --add-prefix=T328634/ | tee T328634-namespaceDupes-3.out # T328634 – seemed to finish the first 20 pages and then go into an infinite loop, I Ctrl+Ced it | 
  [production] | 
            
  | 11:28 | 
  <Lucas_WMDE> | 
  lucaswerkmeister-wmde@mwmaint1002:~$ mwscript namespaceDupes.php shnwikibooks --fix --add-prefix=T328634/ | tee T328634-namespaceDupes-2.out # T328634 – another error but made more progress | 
  [production] | 
            
  | 11:23 | 
  <Lucas_WMDE> | 
  lucaswerkmeister-wmde@mwmaint1002:~$ mwscript namespaceDupes.php shnwikibooks --fix | tee T328634-namespaceDupes.out # T328634 – failed quickly, details in task | 
  [production] | 
            
  | 11:22 | 
  <elukey@deploy1002> | 
  helmfile [staging] DONE helmfile.d/services/changeprop: sync | 
  [production] | 
            
  | 11:22 | 
  <elukey@deploy1002> | 
  helmfile [staging] START helmfile.d/services/changeprop: sync | 
  [production] | 
            
  | 11:12 | 
  <mvolz@deploy1002> | 
  helmfile [staging] DONE helmfile.d/services/zotero: apply | 
  [production] | 
            
  | 11:02 | 
  <mvolz@deploy1002> | 
  helmfile [staging] START helmfile.d/services/zotero: apply | 
  [production] | 
            
  | 10:27 | 
  <btullis@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host aqs2002.codfw.wmnet | 
  [production] |