| 2024-07-22
      
      ยง | 
    
  | 13:07 | <claime> | power cycling rdb1014.eqiad.wmnet | [production] | 
            
  | 12:22 | <godog> | restore retention.ms=172800000 for mediawiki.httpd.accesslog | [production] | 
            
  | 11:54 | <hnowlan@deploy1002> | helmfile [eqiad] DONE helmfile.d/services/shellbox-video: apply | [production] | 
            
  | 11:53 | <hnowlan@deploy1002> | helmfile [eqiad] START helmfile.d/services/shellbox-video: apply | [production] | 
            
  | 11:17 | <ladsgroup@deploy1002> | Finished scap: Backport for [[gerrit:1054641|Enable ICU provided alphabetical order in the Kurdish wikis categories (T48235)]] (duration: 08m 02s) | [production] | 
            
  | 11:12 | <ladsgroup@deploy1002> | ebrahim, ladsgroup: Continuing with sync | [production] | 
            
  | 11:11 | <ladsgroup@deploy1002> | ebrahim, ladsgroup: Backport for [[gerrit:1054641|Enable ICU provided alphabetical order in the Kurdish wikis categories (T48235)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 11:09 | <ladsgroup@deploy1002> | Started scap sync-world: Backport for [[gerrit:1054641|Enable ICU provided alphabetical order in the Kurdish wikis categories (T48235)]] | [production] | 
            
  | 10:33 | <volans> | upgraded manually prometheus-ipmi-exporter to v 1.8.0-1~wmf12+1 on db1179 (leftover because was down) T368088 | [production] | 
            
  | 10:32 | <Dreamy_Jazz> | Running `mwscript extensions/MediaModeration/maintenance/updateMetrics.php --wiki=commonswiki --verbose` | [production] | 
            
  | 10:28 | <Dreamy_Jazz> | Restarting MediaModeration scanning script - https://wikitech.wikimedia.org/wiki/MediaModeration | [production] | 
            
  | 10:24 | <elukey> | kafka preferred-replica-election on kafka-main - T370574 | [production] | 
            
  | 09:51 | <godog> | set mediawiki.httpd.accesslog topic retention to 26h temporarily | [production] | 
            
  | 09:50 | <mlitn@deploy1002> | Finished scap: Backport for [[gerrit:1055258|Reduce weight of 'main subject' as it's used inconsistently (T367774)]] (duration: 08m 19s) | [production] | 
            
  | 09:45 | <mlitn@deploy1002> | cparle, mlitn: Continuing with sync | [production] | 
            
  | 09:44 | <mlitn@deploy1002> | cparle, mlitn: Backport for [[gerrit:1055258|Reduce weight of 'main subject' as it's used inconsistently (T367774)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 09:42 | <mlitn@deploy1002> | Started scap sync-world: Backport for [[gerrit:1055258|Reduce weight of 'main subject' as it's used inconsistently (T367774)]] | [production] | 
            
  | 09:40 | <claime> | homer 'cr*codfw*' commit 'T351074' | [production] | 
            
  | 09:30 | <ayounsi@cumin1002> | END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 | [production] | 
            
  | 09:21 | <ayounsi@cumin1002> | START - Cookbook sre.deploy.python-code netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 | [production] | 
            
  | 09:03 | <ayounsi@cumin1002> | END (FAIL) - Cookbook sre.deploy.python-code (exit_code=99) netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 | [production] | 
            
  | 09:00 | <ayounsi@cumin1002> | START - Cookbook sre.deploy.python-code netbox to netbox2003.codfw.wmnet,netbox1003.eqiad.wmnet with reason: Release v4.0.7 to future netbox prod - ayounsi@cumin1002 - T336275 | [production] | 
            
  | 08:56 | <godog> | rebalance mediawiki.httpd.accesslog partitions across brokers - T370129 | [production] | 
            
  | 08:55 | <ayounsi@cumin1002> | END (PASS) - Cookbook sre.postgresql.postgres-init (exit_code=0) | [production] | 
            
  | 08:50 | <ayounsi@cumin1002> | START - Cookbook sre.postgresql.postgres-init | [production] | 
            
  | 08:32 | <elukey> | restart kafka on kafka-main2005 - T370574 | [production] | 
            
  | 08:31 | <elukey@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on kafka-main2005.codfw.wmnet with reason: restart attempt | [production] | 
            
  | 08:30 | <elukey@cumin1002> | START - Cookbook sre.hosts.downtime for 0:30:00 on kafka-main2005.codfw.wmnet with reason: restart attempt | [production] | 
            
  | 08:24 | <brouberol@deploy1002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset: apply | [production] | 
            
  | 08:23 | <brouberol@deploy1002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset: apply | [production] | 
            
  | 08:07 | <elukey> | restart kafka on kafka-main2001 - T370574 | [production] | 
            
  | 08:06 | <elukey> | restart kafka on kafka-main2001 - sre.hosts.downtime | [production] | 
            
  | 08:06 | <elukey@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on kafka-main2001.codfw.wmnet with reason: restart attempt | [production] | 
            
  | 08:05 | <elukey@cumin1002> | START - Cookbook sre.hosts.downtime for 0:30:00 on kafka-main2001.codfw.wmnet with reason: restart attempt | [production] | 
            
  | 08:03 | <brouberol@cumin1002> | END (FAIL) - Cookbook sre.hosts.decommission (exit_code=99) for hosts karapace1002.eqiad.wmnet | [production] | 
            
  | 08:00 | <brouberol@cumin1002> | START - Cookbook sre.hosts.decommission for hosts karapace1002.eqiad.wmnet | [production] | 
            
  | 07:39 | <ayounsi@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on netbox2003.codfw.wmnet with reason: netbox upgrade prep work | [production] | 
            
  | 07:39 | <ayounsi@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on netbox2003.codfw.wmnet with reason: netbox upgrade prep work | [production] | 
            
  | 07:35 | <stran@deploy1002> | Finished scap: Backport for [[gerrit:1055771|IPInfoHandler: Move token param definition to getBodyParamSettings (T370500)]] (duration: 12m 18s) | [production] | 
            
  | 07:30 | <stran@deploy1002> | stran: Continuing with sync | [production] | 
            
  | 07:25 | <stran@deploy1002> | stran: Backport for [[gerrit:1055771|IPInfoHandler: Move token param definition to getBodyParamSettings (T370500)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 07:23 | <stran@deploy1002> | Started scap sync-world: Backport for [[gerrit:1055771|IPInfoHandler: Move token param definition to getBodyParamSettings (T370500)]] | [production] | 
            
  | 07:12 | <ayounsi@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on netbox1003.eqiad.wmnet with reason: netbox upgrade prep work | [production] | 
            
  | 07:12 | <ayounsi@cumin1002> | START - Cookbook sre.hosts.downtime for 2:00:00 on netbox1003.eqiad.wmnet with reason: netbox upgrade prep work | [production] | 
            
  | 02:55 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Depooling db2170 (T367856)', diff saved to https://phabricator.wikimedia.org/P66880 and previous config saved to /var/cache/conftool/dbconfig/20240722-025552-marostegui.json | [production] | 
            
  | 02:55 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 6:00:00 on db2170.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 02:55 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 6:00:00 on db2170.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 02:55 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2153 (T367856)', diff saved to https://phabricator.wikimedia.org/P66879 and previous config saved to /var/cache/conftool/dbconfig/20240722-025530-marostegui.json | [production] | 
            
  | 02:40 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2153', diff saved to https://phabricator.wikimedia.org/P66878 and previous config saved to /var/cache/conftool/dbconfig/20240722-024023-marostegui.json | [production] | 
            
  | 02:25 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2153', diff saved to https://phabricator.wikimedia.org/P66877 and previous config saved to /var/cache/conftool/dbconfig/20240722-022516-marostegui.json | [production] |