| 2021-08-31
      
      § | 
    
  | 07:44 | <marostegui> | Optimize ruwiki.flaggedtemplates T290057 | [production] | 
            
  | 07:01 | <XioNoX> | drain eqsin-codfw link | [production] | 
            
  | 06:56 | <marostegui@cumin1001> | dbctl commit (dc=all): 'db2110 (re)pooling @ 100%: Slowly repool after reimage T288803', diff saved to https://phabricator.wikimedia.org/P17113 and previous config saved to /var/cache/conftool/dbconfig/20210831-065600-root.json | [production] | 
            
  | 06:40 | <marostegui@cumin1001> | dbctl commit (dc=all): 'db2110 (re)pooling @ 75%: Slowly repool after reimage T288803', diff saved to https://phabricator.wikimedia.org/P17112 and previous config saved to /var/cache/conftool/dbconfig/20210831-064056-root.json | [production] | 
            
  | 06:25 | <marostegui@cumin1001> | dbctl commit (dc=all): 'db2110 (re)pooling @ 50%: Slowly repool after reimage T288803', diff saved to https://phabricator.wikimedia.org/P17111 and previous config saved to /var/cache/conftool/dbconfig/20210831-062553-root.json | [production] | 
            
  | 06:10 | <marostegui@cumin1001> | dbctl commit (dc=all): 'db2110 (re)pooling @ 25%: Slowly repool after reimage T288803', diff saved to https://phabricator.wikimedia.org/P17110 and previous config saved to /var/cache/conftool/dbconfig/20210831-061049-root.json | [production] | 
            
  | 06:06 | <marostegui> | Rename flaggedrevs_stats2 and flaggedrevs_stats on dewiki codfw T289050 | [production] | 
            
  | 05:55 | <marostegui@cumin1001> | dbctl commit (dc=all): 'db2110 (re)pooling @ 10%: Slowly repool after reimage T288803', diff saved to https://phabricator.wikimedia.org/P17109 and previous config saved to /var/cache/conftool/dbconfig/20210831-055546-root.json | [production] | 
            
  | 03:39 | <eileen> | civicrm revision changed from e89504652a to 718aa9cad3, config revision is cb0a008cad | [production] | 
            
  | 02:33 | <mwdebug-deploy@deploy1002> | helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | [production] | 
            
  | 02:31 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | [production] | 
            
  | 02:09 | <mwdebug-deploy@deploy1002> | helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | [production] | 
            
  | 02:08 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | [production] | 
            
  | 02:04 | <eileen> | tools revision changed from 14e4125f73 to 1d67c52c12 | [production] | 
            
  
    | 2021-08-30
      
      § | 
    
  | 23:14 | <mwdebug-deploy@deploy1002> | helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | [production] | 
            
  | 23:13 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | [production] | 
            
  | 23:11 | <urbanecm> | Evening B&C done | [production] | 
            
  | 23:11 | <urbanecm@deploy1002> | Synchronized php-1.37.0-wmf.20/extensions/GrowthExperiments/includes/Specials/SpecialMentorDashboard.php: 9e2264a0c9a48548da4795b2a5b9d7275d254ac7: Instrument Special:MentorDashboard (T289369) (duration: 00m 55s) | [production] | 
            
  | 23:08 | <urbanecm@deploy1002> | Synchronized php-1.37.0-wmf.20/extensions/GrowthExperiments/includes/Specials/SpecialHomepage.php: 9e2264a0c9a48548da4795b2a5b9d7275d254ac7: Instrument Special:MentorDashboard (T289369) (duration: 00m 57s) | [production] | 
            
  | 21:56 | <eileen> | civicrm revision changed from 13bf3a02df to e89504652a, config revision is cb0a008cad | [production] | 
            
  | 19:59 | <mwdebug-deploy@deploy1002> | helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | [production] | 
            
  | 19:57 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | [production] | 
            
  | 19:52 | <urbanecm@deploy1002> | Synchronized wmf-config/InitialiseSettings.php: 9a92e2ae7526717a0a42b825a34b4595e75a544b: Fix mediawiki.mentor_dashboard.visits definition (duration: 00m 56s) | [production] | 
            
  | 19:08 | <tgr> | morning deploys done for real | [production] | 
            
  | 19:06 | <tgr@deploy1002> | Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:715579|Fix schema definition for mediawiki.mentor_dashboard.visit (T289369)]] (duration: 00m 56s) | [production] | 
            
  | 19:05 | <mwdebug-deploy@deploy1002> | helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | [production] | 
            
  | 19:03 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | [production] | 
            
  | 18:49 | <tgr@deploy1002> | Synchronized wmf-config/InitialiseSettings.php: Config: Revert: [[gerrit:715529|Add mediawiki.mentor_dashboard.visit schema (T289369)]] (duration: 00m 26s) | [production] | 
            
  | 18:48 | <tgr@deploy1002> | Scap failed!: 5/6 canaries failed their endpoint checks(https://en.wikipedia.org) | [production] | 
            
  | 18:43 | <tgr> | morning deploys done | [production] | 
            
  | 18:43 | <tgr@deploy1002> | scap failed: average error rate on 3/6 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org/goto/83629bcb5560d11e61d3085c89dd9ed6 for details) | [production] | 
            
  | 18:41 | <mwdebug-deploy@deploy1002> | helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | [production] | 
            
  | 18:38 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | [production] | 
            
  | 18:26 | <mwdebug-deploy@deploy1002> | helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | [production] | 
            
  | 18:24 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | [production] | 
            
  | 18:22 | <tgr@deploy1002> | Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:715568|GrowthExperiments: Enable link recommendation for dewiki and nlwiki (T288420 T285254)]] (duration: 00m 56s) | [production] | 
            
  | 18:18 | <mwdebug-deploy@deploy1002> | helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | [production] | 
            
  | 18:16 | <mwdebug-deploy@deploy1002> | helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | [production] | 
            
  | 18:14 | <tgr@deploy1002> | Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:714548|GrowthExperiments: Switch image recommendations flag off (T288797)]] (duration: 00m 57s) | [production] | 
            
  | 17:44 | <ryankemper> | [WDQS Deploy] Test query passing on `query.wikidata.org` and icinga looks good. This deploy is done. | [production] | 
            
  | 17:12 | <ryankemper> | [WDQS Deploy] Restarting `wdqs-categories` across lvs-managed hosts, one node at a time: `sudo -E cumin -b 1 'A:wdqs-all and not A:wdqs-test' 'depool && sleep 45 && systemctl restart wdqs-categories && sleep 45 && pool'` | [production] | 
            
  | 17:12 | <ryankemper> | [WDQS Deploy] Restarted `wdqs-categories` across both test hosts simultaneously: `sudo -E cumin 'A:wdqs-test' 'systemctl restart wdqs-categories'` | [production] | 
            
  | 17:12 | <ryankemper> | [WDQS Deploy] Restarted `wdqs-updater` across all hosts, 4 hosts at a time: `sudo -E cumin -b 4 'A:wdqs-all' 'systemctl restart wdqs-updater'` | [production] | 
            
  | 17:10 | <ryankemper@deploy1002> | Finished deploy [wdqs/wdqs@a17833c]: 0.3.84 (duration: 08m 16s) | [production] | 
            
  | 17:04 | <ryankemper> | [WDQS Deploy] Tests passing following deploy of `0.3.84` on canary `wdqs1003`; proceeding to rest of fleet | [production] | 
            
  | 17:02 | <ryankemper@deploy1002> | Started deploy [wdqs/wdqs@a17833c]: 0.3.84 | [production] | 
            
  | 17:02 | <ryankemper> | [WDQS Deploy] Gearing up for deploy of wdqs `0.3.84`. Pre-deploy tests passing on canary `wdqs1003` | [production] | 
            
  | 17:00 | <ryankemper> | T289483 Pooled `wdqs1013` | [production] | 
            
  | 16:36 | <dzahn@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ganeti1024.eqiad.wmnet with reason: REIMAGE | [production] | 
            
  | 16:34 | <dzahn@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti1024.eqiad.wmnet with reason: REIMAGE | [production] |