| 2024-07-16
      
      ยง | 
    
  | 08:13 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1157.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 08:13 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 4:00:00 on db1157.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 08:11 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db1157 (re)pooling @ 10%: Repooling', diff saved to https://phabricator.wikimedia.org/P66586 and previous config saved to /var/cache/conftool/dbconfig/20240716-081129-root.json | [production] | 
            
  | 08:09 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1150.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 08:09 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 4:00:00 on db1150.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 08:07 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db1157 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P66585 and previous config saved to /var/cache/conftool/dbconfig/20240716-080720-root.json | [production] | 
            
  | 08:07 | <marostegui@cumin1002> | dbctl commit (dc=all): 'db1174 (re)pooling @ 1%: Repooling', diff saved to https://phabricator.wikimedia.org/P66584 and previous config saved to /var/cache/conftool/dbconfig/20240716-080707-root.json | [production] | 
            
  | 07:46 | <klausman@cumin1002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve1006.eqiad.wmnet | [production] | 
            
  | 07:40 | <Dreamy_Jazz> | Morning UTC backport window done | [production] | 
            
  | 07:38 | <klausman@cumin1002> | START - Cookbook sre.hosts.reboot-single for host ml-serve1006.eqiad.wmnet | [production] | 
            
  | 07:38 | <klausman@cumin1002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve1002.eqiad.wmnet | [production] | 
            
  | 07:29 | <Dreamy_Jazz> | Restarted MediaModeration scanning scrpt | [production] | 
            
  | 07:28 | <klausman@cumin1002> | START - Cookbook sre.hosts.reboot-single for host ml-serve1002.eqiad.wmnet | [production] | 
            
  | 07:19 | <dreamyjazz@deploy1002> | Finished scap: Backport for [[gerrit:1053297|[CheckUser] Remove wgCheckUserEventTablesMigrationStage config (T366546)]] (duration: 12m 09s) | [production] | 
            
  | 07:14 | <dreamyjazz@deploy1002> | dreamyjazz: Continuing with sync | [production] | 
            
  | 07:14 | <dreamyjazz@deploy1002> | dreamyjazz: Backport for [[gerrit:1053297|[CheckUser] Remove wgCheckUserEventTablesMigrationStage config (T366546)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 07:13 | <volans@cumin1002> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 07:13 | <volans@cumin1002> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Merging pending changes for frack hosts as per IRC discussion - volans@cumin1002" | [production] | 
            
  | 07:10 | <volans@cumin1002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Merging pending changes for frack hosts as per IRC discussion - volans@cumin1002" | [production] | 
            
  | 07:07 | <dreamyjazz@deploy1002> | Started scap sync-world: Backport for [[gerrit:1053297|[CheckUser] Remove wgCheckUserEventTablesMigrationStage config (T366546)]] | [production] | 
            
  | 07:07 | <volans@cumin1002> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 06:59 | <ayounsi@cumin1002> | END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 52999 | [production] | 
            
  | 06:59 | <ayounsi@cumin1002> | START - Cookbook sre.network.peering with action 'configure' for AS: 52999 | [production] | 
            
  | 06:18 | <kart_> | Updated cxserver to 2024-07-15-100650-production (T354666) | [production] | 
            
  | 06:16 | <kartik@deploy1002> | helmfile [eqiad] DONE helmfile.d/services/cxserver: apply | [production] | 
            
  | 06:16 | <kartik@deploy1002> | helmfile [eqiad] START helmfile.d/services/cxserver: apply | [production] | 
            
  | 06:12 | <kevinbazira@deploy1002> | helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'readability' for release 'main' . | [production] | 
            
  | 06:12 | <kevinbazira@deploy1002> | helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'readability' for release 'main' . | [production] | 
            
  | 06:11 | <kartik@deploy1002> | helmfile [codfw] DONE helmfile.d/services/cxserver: apply | [production] | 
            
  | 06:11 | <kartik@deploy1002> | helmfile [codfw] START helmfile.d/services/cxserver: apply | [production] | 
            
  | 06:06 | <kartik@deploy1002> | helmfile [staging] DONE helmfile.d/services/cxserver: apply | [production] | 
            
  | 06:05 | <kartik@deploy1002> | helmfile [staging] START helmfile.d/services/cxserver: apply | [production] | 
            
  | 05:43 | <marostegui> | Deploy schema change on s7 eqiad db1174 dbmaint T367856 | [production] | 
            
  | 05:43 | <marostegui> | Deploy schema change on s3 eqiad db1157 dbmaint T367856 | [production] | 
            
  | 05:25 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2116.codfw.wmnet with reason: Long schema change | [production] | 
            
  | 05:25 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2116.codfw.wmnet with reason: Long schema change | [production] | 
            
  | 05:17 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1157.eqiad.wmnet with reason: Long schema change | [production] | 
            
  | 05:17 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1157.eqiad.wmnet with reason: Long schema change | [production] | 
            
  | 05:17 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Depool db1157 T370019', diff saved to https://phabricator.wikimedia.org/P66581 and previous config saved to /var/cache/conftool/dbconfig/20240716-051718-root.json | [production] | 
            
  | 05:15 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Promote db1223 to s3 primary and set section read-write T370019', diff saved to https://phabricator.wikimedia.org/P66580 and previous config saved to /var/cache/conftool/dbconfig/20240716-051538-root.json | [production] | 
            
  | 05:15 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Set s3 eqiad as read-only for maintenance - T370019', diff saved to https://phabricator.wikimedia.org/P66579 and previous config saved to /var/cache/conftool/dbconfig/20240716-051516-root.json | [production] | 
            
  | 05:15 | <marostegui> | Starting s3 eqiad failover from db1157 to db1223 - T370019 | [production] | 
            
  | 04:58 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Set db1223 with weight 0 T370019', diff saved to https://phabricator.wikimedia.org/P66578 and previous config saved to /var/cache/conftool/dbconfig/20240716-045839-root.json | [production] | 
            
  | 04:58 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1174.eqiad.wmnet with reason: Long schema change | [production] | 
            
  | 04:58 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1174.eqiad.wmnet with reason: Long schema change | [production] | 
            
  | 04:58 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Depool db1174', diff saved to https://phabricator.wikimedia.org/P66577 and previous config saved to /var/cache/conftool/dbconfig/20240716-045807-marostegui.json | [production] | 
            
  | 04:57 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 25 hosts with reason: Primary switchover s3 T370019 | [production] | 
            
  | 04:57 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 1:00:00 on 25 hosts with reason: Primary switchover s3 T370019 | [production] | 
            
  | 04:01 | <mwpresync@deploy1002> | Pruned MediaWiki: 1.43.0-wmf.11 (duration: 00m 58s) | [production] | 
            
  | 03:53 | <mwpresync@deploy1002> | Finished scap: testwikis wikis to 1.43.0-wmf.14  refs T366959 (duration: 50m 56s) | [production] |