| 2019-04-18
      
      § | 
    
  | 18:41 | <mutante> | mw2150 - reimaging, not in confctl | [production] | 
            
  | 18:02 | <dzahn@puppetmaster1001> | conftool action : set/pooled=yes; selector: name=mw2151.codfw.wmnet,cluster=jobrunner,service=nginx | [production] | 
            
  | 17:49 | <mutante> | mw2151 - scap pull | [production] | 
            
  | 17:46 | <mobrovac@deploy1001> | Synchronized php-1.34.0-wmf.1/extensions/EventBus/includes/JobExecutor.php: Default to a dummy title for invalid titles - T221368 (duration: 01m 01s) | [production] | 
            
  | 17:20 | <twentyafterfour@deploy1001> | Synchronized php-1.34.0-wmf.1/extensions/AbuseFilter/includes/: sync https://gerrit.wikimedia.org/r/c/mediawiki/extensions/AbuseFilter/+/504863 (duration: 01m 00s) | [production] | 
            
  | 16:20 | <bblack> | Experimental DNS-level changes deploying for wikipedia.org domain - if wikipedia.org DNS problems appear, revert https://gerrit.wikimedia.org/r/c/operations/dns/+/504588 - T208263 | [production] | 
            
  | 16:17 | <XioNoX> | remove peering to 63199 in eqsin (down for 1 month, no reply to emails) | [production] | 
            
  | 16:13 | <XioNoX> | rollback dhcp option 82 test from asw2-b-eqiad | [production] | 
            
  | 14:55 | <fsero> | synchronizing docker_registry_codfw swift container from docker_registry | [production] | 
            
  | 14:40 | <XioNoX> | push firewall change to pfw3-eqiad - T221278 | [production] | 
            
  | 13:30 | <jbond42> | rolling updates of ruby2.1 on jessie | [production] | 
            
  | 13:08 | <elukey> | roll restart of cassandra on aqs* to pick up new openjdk upgrades | [production] | 
            
  | 13:05 | <jmm@cumin2001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) | [production] | 
            
  | 13:05 | <jmm@cumin2001> | START - Cookbook sre.hosts.downtime | [production] | 
            
  | 12:58 | <reedy@deploy1001> | rebuilt and synchronized wikiversions files: group1 back to .25 | [production] | 
            
  | 12:36 | <anomie> | Ran `php7adm /opcache-free` on mw1274 to test a theory related to T221347. The log entries related to that task stopped immediately. | [production] | 
            
  | 12:30 | <gehel> | restarting blazegraph + updater on wdqs* for jvm upgrade | [production] | 
            
  | 12:22 | <moritzm> | installing Java security updates on restbase-dev hosts (along with Cassandra restarts) | [production] | 
            
  | 12:21 | <gehel> | restarting blazegraph + updater on wdqs1009 / wdqs1010 for jvm upgrade | [production] | 
            
  | 12:19 | <moritzm> | installing Java security updates on WDQS autodeploy/test hosts | [production] | 
            
  | 10:40 | <jmm@cumin2001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) | [production] | 
            
  | 10:40 | <jmm@cumin2001> | START - Cookbook sre.hosts.downtime | [production] | 
            
  | 10:35 | <moritzm> | installing rails security updates on jessie hosts | [production] | 
            
  | 10:21 | <moritzm> | installing jasper updates on jessie hosts | [production] | 
            
  | 09:44 | <akosiaris> | update grafana service/ dashboard to have user, system, throttled CPU metrics under the CPU saturation row | [production] | 
            
  | 09:41 | <gilles@deploy1001> | Synchronized wmf-config/InitialiseSettings.php: T216597 Run CPU benchmark for all samples on eswiki/ruwiki (duration: 01m 06s) | [production] | 
            
  | 09:11 | <jmm@cumin2001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) | [production] | 
            
  | 09:10 | <jmm@cumin2001> | START - Cookbook sre.hosts.downtime | [production] | 
            
  | 09:00 | <jmm@cumin2001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) | [production] | 
            
  | 09:00 | <jmm@cumin2001> | START - Cookbook sre.hosts.downtime | [production] | 
            
  | 08:54 | <elukey@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) | [production] | 
            
  | 08:54 | <elukey@cumin1001> | START - Cookbook sre.hosts.downtime | [production] | 
            
  | 08:54 | <elukey@cumin1001> | END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) | [production] | 
            
  | 08:54 | <elukey@cumin1001> | START - Cookbook sre.hosts.downtime | [production] | 
            
  | 08:53 | <elukey> | reboot kafka10[12-23] (old Analytics cluster) for kernel + openjdk upgrades | [production] | 
            
  | 08:23 | <gehel@cumin1001> | END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) | [production] | 
            
  | 08:14 | <moritzm> | installing libssh2 security updates on jessie | [production] | 
            
  | 08:01 | <moritzm> | restarting mw1261-mw1265 to pick up new libssh2 | [production] | 
            
  | 07:55 | <jmm@cumin2001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) | [production] | 
            
  | 07:55 | <jmm@cumin2001> | START - Cookbook sre.hosts.downtime | [production] | 
            
  | 07:53 | <filippo@puppetmaster1001> | conftool action : set/pooled=yes; selector: name=prometheus2004.codfw.wmnet | [production] | 
            
  | 07:28 | <moritzm> | installing libssh2 security updates | [production] | 
            
  | 07:19 | <gehel@cumin1001> | START - Cookbook sre.wdqs.data-transfer | [production] | 
            
  | 06:58 | <moritzm> | restarting icinga on icinga1001 (T196336) | [production] | 
            
  | 06:37 | <moritzm> | rolling reboots of Swift backends in eqiad for combined kernel/glibc/OpenSSL update | [production] |