| 2025-09-01
      
      ยง | 
    
  | 11:25 | <jmm@cumin2002> | END (PASS) - Cookbook sre.ganeti.changedisk (exit_code=0) for changing disk type of install3003.wikimedia.org to plain | [production] | 
            
  | 11:24 | <jmm@cumin2002> | START - Cookbook sre.ganeti.changedisk for changing disk type of install3003.wikimedia.org to plain | [production] | 
            
  | 11:22 | <ayounsi@cumin1003> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 11:22 | <jmm@cumin2002> | END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti3006.esams.wmnet | [production] | 
            
  | 11:21 | <ladsgroup@cumin1003> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1223.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 11:20 | <jmm@cumin2002> | START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3006.esams.wmnet | [production] | 
            
  | 11:16 | <ladsgroup@deploy1003> | Finished scap sync-world: Backport for [[gerrit:1183632|ParserTestRunner: Update category counts for articles (T365303)]], [[gerrit:1183633|CategoryCacheTest: Update category count]], [[gerrit:1183269|Drop support for categorylinks read old (T299951 T403147 T403337)]] (duration: 12m 28s) | [production] | 
            
  | 11:12 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P82300 and previous config saved to /var/cache/conftool/dbconfig/20250901-111232-fceratto.json | [production] | 
            
  | 11:11 | <ladsgroup@deploy1003> | ladsgroup: Continuing with sync | [production] | 
            
  | 11:09 | <ladsgroup@deploy1003> | ladsgroup: Backport for [[gerrit:1183632|ParserTestRunner: Update category counts for articles (T365303)]], [[gerrit:1183633|CategoryCacheTest: Update category count]], [[gerrit:1183269|Drop support for categorylinks read old (T299951 T403147 T403337)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. | [production] | 
            
  | 11:03 | <ladsgroup@deploy1003> | Started scap sync-world: Backport for [[gerrit:1183632|ParserTestRunner: Update category counts for articles (T365303)]], [[gerrit:1183633|CategoryCacheTest: Update category count]], [[gerrit:1183269|Drop support for categorylinks read old (T299951 T403147 T403337)]] | [production] | 
            
  | 10:57 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2177 (T401906)', diff saved to https://phabricator.wikimedia.org/P82299 and previous config saved to /var/cache/conftool/dbconfig/20250901-105724-fceratto.json | [production] | 
            
  | 10:45 | <moritzm> | installing luajit security updates | [production] | 
            
  | 10:44 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Depooling db2177 (T401906)', diff saved to https://phabricator.wikimedia.org/P82298 and previous config saved to /var/cache/conftool/dbconfig/20250901-104407-fceratto.json | [production] | 
            
  | 10:44 | <fceratto@cumin1002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2177.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 10:43 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2156 (T401906)', diff saved to https://phabricator.wikimedia.org/P82297 and previous config saved to /var/cache/conftool/dbconfig/20250901-104345-fceratto.json | [production] | 
            
  | 10:28 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2156', diff saved to https://phabricator.wikimedia.org/P82296 and previous config saved to /var/cache/conftool/dbconfig/20250901-102837-fceratto.json | [production] | 
            
  | 10:13 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2156', diff saved to https://phabricator.wikimedia.org/P82295 and previous config saved to /var/cache/conftool/dbconfig/20250901-101330-fceratto.json | [production] | 
            
  | 10:07 | <jmm@cumin2002> | START - Cookbook sre.postgresql.postgres-init | [production] | 
            
  | 10:06 | <jmm@cumin2002> | START - Cookbook sre.postgresql.postgres-init | [production] | 
            
  | 10:05 | <jmm@cumin2002> | END (FAIL) - Cookbook sre.postgresql.postgres-init (exit_code=99) | [production] | 
            
  | 10:04 | <jmm@cumin2002> | START - Cookbook sre.postgresql.postgres-init | [production] | 
            
  | 10:01 | <ladsgroup@cumin1003> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1216.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 10:00 | <ladsgroup@cumin1003> | dbctl commit (dc=all): 'Repooling after maintenance db1179 (T403362)', diff saved to https://phabricator.wikimedia.org/P82294 and previous config saved to /var/cache/conftool/dbconfig/20250901-100054-ladsgroup.json | [production] | 
            
  | 09:58 | <dcausse@deploy1003> | Finished scap sync-world: Backport for [[gerrit:1183454|SECURITY: declare PoolCounter settings for cirrusbuilddoc (T401220)]] (duration: 11m 12s) | [production] | 
            
  | 09:58 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2156 (T401906)', diff saved to https://phabricator.wikimedia.org/P82293 and previous config saved to /var/cache/conftool/dbconfig/20250901-095822-fceratto.json | [production] | 
            
  | 09:53 | <dcausse@deploy1003> | dcausse: Continuing with sync | [production] | 
            
  | 09:52 | <dcausse@deploy1003> | dcausse: Backport for [[gerrit:1183454|SECURITY: declare PoolCounter settings for cirrusbuilddoc (T401220)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. | [production] | 
            
  | 09:47 | <dcausse@deploy1003> | Started scap sync-world: Backport for [[gerrit:1183454|SECURITY: declare PoolCounter settings for cirrusbuilddoc (T401220)]] | [production] | 
            
  | 09:47 | <hnowlan@deploy1003> | helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply | [production] | 
            
  | 09:47 | <hnowlan@deploy1003> | helmfile [eqiad] START helmfile.d/services/rest-gateway: apply | [production] | 
            
  | 09:45 | <ladsgroup@cumin1003> | dbctl commit (dc=all): 'Repooling after maintenance db1179', diff saved to https://phabricator.wikimedia.org/P82292 and previous config saved to /var/cache/conftool/dbconfig/20250901-094547-ladsgroup.json | [production] | 
            
  | 09:45 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Depooling db2156 (T401906)', diff saved to https://phabricator.wikimedia.org/P82291 and previous config saved to /var/cache/conftool/dbconfig/20250901-094504-fceratto.json | [production] | 
            
  | 09:44 | <fceratto@cumin1002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2156.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 09:44 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2149 (T401906)', diff saved to https://phabricator.wikimedia.org/P82290 and previous config saved to /var/cache/conftool/dbconfig/20250901-094442-fceratto.json | [production] | 
            
  | 09:43 | <hnowlan@deploy1003> | helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply | [production] | 
            
  | 09:43 | <hnowlan@deploy1003> | helmfile [codfw] START helmfile.d/services/rest-gateway: apply | [production] | 
            
  | 09:41 | <hnowlan@deploy1003> | helmfile [staging] DONE helmfile.d/services/rest-gateway: apply | [production] | 
            
  | 09:41 | <hnowlan@deploy1003> | helmfile [staging] START helmfile.d/services/rest-gateway: apply | [production] | 
            
  | 09:38 | <dcausse@deploy1003> | dcausse: Continuing with sync | [production] | 
            
  | 09:33 | <dcausse@deploy1003> | dcausse: Backport for [[gerrit:1183454|SECURITY: declare PoolCounter settings for cirrusbuilddoc (T401220)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. | [production] | 
            
  | 09:30 | <ladsgroup@cumin1003> | dbctl commit (dc=all): 'Repooling after maintenance db1179', diff saved to https://phabricator.wikimedia.org/P82289 and previous config saved to /var/cache/conftool/dbconfig/20250901-093039-ladsgroup.json | [production] | 
            
  | 09:29 | <fceratto@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2149', diff saved to https://phabricator.wikimedia.org/P82288 and previous config saved to /var/cache/conftool/dbconfig/20250901-092934-fceratto.json | [production] | 
            
  | 09:27 | <dcausse@deploy1003> | Started scap sync-world: Backport for [[gerrit:1183454|SECURITY: declare PoolCounter settings for cirrusbuilddoc (T401220)]] | [production] | 
            
  | 09:25 | <brouberol@deploy1003> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/postgresql-airflow-main: apply | [production] | 
            
  | 09:25 | <brouberol@deploy1003> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/postgresql-airflow-main: apply | [production] | 
            
  | 09:24 | <jmm@cumin2002> | END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ganeti3005.esams.wmnet with OS bookworm | [production] | 
            
  | 09:24 | <dcausse@deploy1003> | Finished scap sync-world: Backport for [[gerrit:1183112|hCaptcha: Provide label/help in authmanagerinfo API calls (T403253)]] (duration: 16m 15s) | [production] | 
            
  | 09:19 | <dcausse@deploy1003> | kharlan, dcausse: Continuing with sync | [production] | 
            
  | 09:15 | <ladsgroup@cumin1003> | dbctl commit (dc=all): 'Repooling after maintenance db1179 (T403362)', diff saved to https://phabricator.wikimedia.org/P82287 and previous config saved to /var/cache/conftool/dbconfig/20250901-091531-ladsgroup.json | [production] |