| 
      
        2021-09-20
      
      §
     | 
  
    
  | 10:41 | 
  <hnowlan> | 
  roll restarting kartotherian and tilerator on maps1* | 
  [production] | 
            
  | 10:36 | 
  <jynus> | 
  rolling restart bacula & minio daemons on backup hosts | 
  [production] | 
            
  | 09:59 | 
  <moritzm> | 
  restarting apache2 on thorium | 
  [production] | 
            
  | 09:48 | 
  <hnowlan@cumin1001> | 
  START - Cookbook sre.postgresql.postgres-init | 
  [production] | 
            
  | 09:47 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Remove s10 from eqiad T167973', diff saved to https://phabricator.wikimedia.org/P17300 and previous config saved to /var/cache/conftool/dbconfig/20210920-094739-marostegui.json | 
  [production] | 
            
  | 09:10 | 
  <moritzm> | 
  installing openssl1.0 updates for stretch with backport for forthcoming Let's encrypt issuance chain update (T283165) | 
  [production] | 
            
  | 08:35 | 
  <moritzm> | 
  updating clamav on ticket.wikimedia.org/otrs1001 to 0.103.3 | 
  [production] | 
            
  | 08:02 | 
  <mwdebug-deploy@deploy1002> | 
  helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | 
  [production] | 
            
  | 07:58 | 
  <mwdebug-deploy@deploy1002> | 
  helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | 
  [production] | 
            
  | 07:58 | 
  <oblivian@deploy1002> | 
  helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | 
  [production] | 
            
  | 07:49 | 
  <moritzm> | 
  uploaded maps-deduped-tilelist 0.0.3~deb10u1 to buster-wikimedia/main T290982 | 
  [production] | 
            
  | 07:48 | 
  <moritzm> | 
  uploaded maps-deduped-tilelist 0.0.3~deb10u1 to buster-wikimedia/main | 
  [production] | 
            
  | 07:48 | 
  <mwdebug-deploy@deploy1002> | 
  helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | 
  [production] | 
            
  | 07:43 | 
  <oblivian@deploy1002> | 
  helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | 
  [production] | 
            
  | 07:43 | 
  <mwdebug-deploy@deploy1002> | 
  helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | 
  [production] | 
            
  | 07:35 | 
  <marostegui> | 
  Stop db1168 and db2129 in sync T167973 | 
  [production] | 
            
  | 07:34 | 
  <mwdebug-deploy@deploy1002> | 
  helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | 
  [production] | 
            
  | 07:34 | 
  <urbanecm@deploy1002> | 
  Synchronized wmf-config/throttle.php: af9d6e4e29e5f53ad8cf5aa2c235d54500c433bd: Revert "Add throttle rule for Czech wiki course" (duration: 00m 56s) | 
  [production] | 
            
  | 07:32 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Depool db1168 T167973', diff saved to https://phabricator.wikimedia.org/P17299 and previous config saved to /var/cache/conftool/dbconfig/20210920-073256-marostegui.json | 
  [production] | 
            
  | 07:32 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Repool db1096:3316 T167973', diff saved to https://phabricator.wikimedia.org/P17298 and previous config saved to /var/cache/conftool/dbconfig/20210920-073206-marostegui.json | 
  [production] | 
            
  | 07:31 | 
  <marostegui@cumin1001> | 
  dbctl commit (dc=all): 'Depool db1096:3316 T167973', diff saved to https://phabricator.wikimedia.org/P17297 and previous config saved to /var/cache/conftool/dbconfig/20210920-073141-marostegui.json | 
  [production] | 
            
  | 07:31 | 
  <moritzm> | 
  uploaded PHP 7.2.34-18+0~20210223.60+debian10~1.gbpb21322+wmf2 to apt.wikimedia.org (component/php7.2 for buster-wikimedia) T291052 | 
  [production] | 
            
  | 07:29 | 
  <mwdebug-deploy@deploy1002> | 
  helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | 
  [production] | 
            
  | 07:28 | 
  <urbanecm@deploy1002> | 
  Synchronized wmf-config/InitialiseSettings.php: 8c1d665b5e83f6b1dd1cc4a9c367cb6881473bba: enwiki: Bump Growth features to 25% (mentorship limited to 20% of those users) (T290927) (duration: 00m 57s) | 
  [production] | 
            
  | 07:20 | 
  <urbanecm> | 
  Revert undeployed config patch (https://gerrit.wikimedia.org/r/c/operations/mediawiki-config/+/721959); not even pulled to deployment, so assuming it never hit prod (T289771) | 
  [production] | 
            
  | 06:00 | 
  <marostegui> | 
  Upgrade db2071, db2072, db2094 | 
  [production] | 
            
  
    | 
      
        2021-09-17
      
      §
     | 
  
    
  | 21:28 | 
  <legoktm@cumin1001> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 21:19 | 
  <legoktm@cumin1001> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 19:00 | 
  <hnowlan@cumin1001> | 
  END (PASS) - Cookbook sre.postgresql.postgres-init (exit_code=0) | 
  [production] | 
            
  | 17:02 | 
  <cmjohnson@cumin1001> | 
  END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on cloudcephosd1022.eqiad.wmnet with reason: REIMAGE | 
  [production] | 
            
  | 17:02 | 
  <hnowlan@cumin1001> | 
  START - Cookbook sre.postgresql.postgres-init | 
  [production] | 
            
  | 17:00 | 
  <cmjohnson@cumin1001> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1022.eqiad.wmnet with reason: REIMAGE | 
  [production] | 
            
  | 16:48 | 
  <hnowlan@cumin1001> | 
  END (PASS) - Cookbook sre.postgresql.postgres-init (exit_code=0) | 
  [production] | 
            
  | 16:27 | 
  <cmjohnson@cumin1001> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 16:25 | 
  <cmjohnson@cumin1001> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 16:11 | 
  <cmjohnson@cumin1001> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 16:04 | 
  <cmjohnson@cumin1001> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 14:49 | 
  <hnowlan@cumin1001> | 
  START - Cookbook sre.postgresql.postgres-init | 
  [production] | 
            
  | 14:29 | 
  <hnowlan@cumin1001> | 
  END (PASS) - Cookbook sre.postgresql.postgres-init (exit_code=0) | 
  [production] | 
            
  | 13:06 | 
  <moritzm> | 
  installing 4.9.272 kernels on stretch hosts (no reboots yet) | 
  [production] | 
            
  | 11:28 | 
  <hnowlan@cumin1001> | 
  START - Cookbook sre.postgresql.postgres-init | 
  [production] | 
            
  | 11:14 | 
  <mwdebug-deploy@deploy1002> | 
  helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | 
  [production] | 
            
  | 11:09 | 
  <mwdebug-deploy@deploy1002> | 
  helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . | 
  [production] | 
            
  | 09:37 | 
  <milimetric@deploy1002> | 
  Finished deploy [analytics/refinery@37e904a] (thin): Only syncing sanitize allowlist, deploying THIN for consistency (duration: 00m 07s) | 
  [production] | 
            
  | 09:37 | 
  <milimetric@deploy1002> | 
  Started deploy [analytics/refinery@37e904a] (thin): Only syncing sanitize allowlist, deploying THIN for consistency | 
  [production] | 
            
  | 09:36 | 
  <milimetric@deploy1002> | 
  Finished deploy [analytics/refinery@37e904a]: Only syncing sanitize allowlist (duration: 17m 43s) | 
  [production] | 
            
  | 09:19 | 
  <milimetric@deploy1002> | 
  Started deploy [analytics/refinery@37e904a]: Only syncing sanitize allowlist | 
  [production] | 
            
  | 08:00 | 
  <jayme> | 
  restarting php-fpm on wtp1037 and wtp1030 | 
  [production] |