| 2025-01-28
      
      ยง | 
    
  | 14:21 | <jelto> | Imported helm311 | 3.11.3-3 to bookworm-wikimedia - T341984 | [production] | 
            
  | 14:18 | <lucaswerkmeister-wmde@deploy2002> | lucaswerkmeister-wmde, daimona: Continuing with sync | [production] | 
            
  | 14:18 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. | [production] | 
            
  | 14:18 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. | [production] | 
            
  | 14:11 | <jmm@cumin2002> | END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti2026.codfw.wmnet to cluster codfw and group D | [production] | 
            
  | 14:09 | <jmm@cumin2002> | START - Cookbook sre.ganeti.addnode for new host ganeti2026.codfw.wmnet to cluster codfw and group D | [production] | 
            
  | 14:09 | <lucaswerkmeister-wmde@deploy2002> | lucaswerkmeister-wmde, daimona: Backport for [[gerrit:1114440|prod: Enable $wgCampaignEventsEnableEventTopics (T380818)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 14:07 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P72626 and previous config saved to /var/cache/conftool/dbconfig/20250128-140715-marostegui.json | [production] | 
            
  | 14:06 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti2026.codfw.wmnet | [production] | 
            
  | 14:04 | <lucaswerkmeister-wmde@deploy2002> | Started scap sync-world: Backport for [[gerrit:1114440|prod: Enable $wgCampaignEventsEnableEventTopics (T380818)]] | [production] | 
            
  | 13:58 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host ganeti2026.codfw.wmnet | [production] | 
            
  | 13:52 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P72625 and previous config saved to /var/cache/conftool/dbconfig/20250128-135208-marostegui.json | [production] | 
            
  | 13:51 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply | [production] | 
            
  | 13:50 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply | [production] | 
            
  | 13:49 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti2026.codfw.wmnet with OS bookworm | [production] | 
            
  | 13:39 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply | [production] | 
            
  | 13:39 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply | [production] | 
            
  | 13:37 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1194 (T384592)', diff saved to https://phabricator.wikimedia.org/P72624 and previous config saved to /var/cache/conftool/dbconfig/20250128-133701-marostegui.json | [production] | 
            
  | 13:33 | <fabfur> | installing/enabling haproxykafka on eqiad (https://gerrit.wikimedia.org/r/c/operations/puppet/+/1114417) (T378578) | [production] | 
            
  | 13:27 | <root@cumin1002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1166.eqiad.wmnet with reason: Index rebuild | [production] | 
            
  | 13:27 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ganeti2026.codfw.wmnet with reason: host reimage | [production] | 
            
  | 13:26 | <root@cumin1002> | END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db1166.eqiad.wmnet | [production] | 
            
  | 13:23 | <jmm@cumin2002> | START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti2026.codfw.wmnet with reason: host reimage | [production] | 
            
  | 13:22 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Depooling db1194 (T384592)', diff saved to https://phabricator.wikimedia.org/P72623 and previous config saved to /var/cache/conftool/dbconfig/20250128-132238-marostegui.json | [production] | 
            
  | 13:22 | <marostegui@cumin1002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1194.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 13:22 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1191 (T384592)', diff saved to https://phabricator.wikimedia.org/P72622 and previous config saved to /var/cache/conftool/dbconfig/20250128-132227-marostegui.json | [production] | 
            
  | 13:22 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply | [production] | 
            
  | 13:20 | <brouberol@deploy2002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply | [production] | 
            
  | 13:19 | <fceratto@dns1004> | END - running authdns-update | [production] | 
            
  | 13:19 | <root@cumin1002> | START - Cookbook sre.mysql.upgrade for db1166.eqiad.wmnet | [production] | 
            
  | 13:18 | <root@cumin1002> | END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db2190 gradually with 4 steps - Repooling after rebuild index | [production] | 
            
  | 13:17 | <fceratto@dns1004> | START - running authdns-update | [production] | 
            
  | 13:15 | <dbrant@deploy2002> | helmfile [codfw] DONE helmfile.d/services/mobileapps: apply | [production] | 
            
  | 13:15 | <dbrant@deploy2002> | helmfile [codfw] START helmfile.d/services/mobileapps: apply | [production] | 
            
  | 13:14 | <dbrant@deploy2002> | helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply | [production] | 
            
  | 13:13 | <dbrant@deploy2002> | helmfile [eqiad] START helmfile.d/services/mobileapps: apply | [production] | 
            
  | 13:13 | <dbrant@deploy2002> | helmfile [staging] DONE helmfile.d/services/mobileapps: apply | [production] | 
            
  | 13:12 | <dbrant@deploy2002> | helmfile [staging] START helmfile.d/services/mobileapps: apply | [production] | 
            
  | 13:07 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1191', diff saved to https://phabricator.wikimedia.org/P72619 and previous config saved to /var/cache/conftool/dbconfig/20250128-130720-marostegui.json | [production] | 
            
  | 13:07 | <dbrant@deploy2002> | helmfile [codfw] DONE helmfile.d/services/wikifeeds: apply | [production] | 
            
  | 13:06 | <dbrant@deploy2002> | helmfile [codfw] START helmfile.d/services/wikifeeds: apply | [production] | 
            
  | 13:06 | <dbrant@deploy2002> | helmfile [eqiad] DONE helmfile.d/services/wikifeeds: apply | [production] | 
            
  | 13:05 | <dbrant@deploy2002> | helmfile [eqiad] START helmfile.d/services/wikifeeds: apply | [production] | 
            
  | 13:04 | <dbrant@deploy2002> | helmfile [staging] DONE helmfile.d/services/wikifeeds: apply | [production] | 
            
  | 13:03 | <dbrant@deploy2002> | helmfile [staging] START helmfile.d/services/wikifeeds: apply | [production] | 
            
  | 13:03 | <jmm@cumin2002> | START - Cookbook sre.hosts.reimage for host ganeti2026.codfw.wmnet with OS bookworm | [production] | 
            
  | 13:02 | <jmm@cumin2002> | END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ganeti2026.codfw.wmnet with OS bookworm | [production] | 
            
  | 12:52 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1191', diff saved to https://phabricator.wikimedia.org/P72617 and previous config saved to /var/cache/conftool/dbconfig/20250128-125213-marostegui.json | [production] | 
            
  | 12:51 | <cmooney@cumin1002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on netflow3003.esams.wmnet with reason: disabling alerts as I'm running gnmic manually rather than with systemd | [production] | 
            
  | 12:50 | <andrew@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudgw1003.eqiad.wmnet with OS bookworm | [production] |