801-850 of 10000 results (44ms)
2019-04-18 ยง
20:56 <twentyafterfour@deploy1001> rebuilt and synchronized wikiversions files: all wikis to 1.34.0-wmf.1 refs T220726 [production]
20:55 <mutante> - coverting instance pk8s back to labs-puppetmaster, fixed puppet runs [planet]
20:52 <cdanis> root@icinga1001.wikimedia.org /var/lib/icinga # for DOWNTIME in $(fgrep -B12 'comment=mobrovac: temp stop JQ for T221368 - cdanis@cumin1001' retention.dat | grep -A13 servicedowntime | grep downtime_id | cut -d= -f2); do printf "[%lu] DEL_SVC_DOWNTIME;%u\n" $(date +%s) $DOWNTIME ; done > rw/icinga.cmd [production]
20:49 <andrewbogott> deleting vagrant logfiles on rec-wiki because the drive is 100% full [recommendation-api]
20:40 <mobrovac@deploy1001> Synchronized php-1.34.0-wmf.1/extensions/Translate/utils/MessageUpdateJob.php: Translate jobs: Remove problematic Job::$params assignments, dir 2/2 - T221368 (duration: 01m 00s) [production]
20:38 <mobrovac@deploy1001> Synchronized php-1.34.0-wmf.1/extensions/Translate/tag: Translate jobs: Remove problematic Job::$params assignments, dir 1/2 - T221368 (duration: 01m 01s) [production]
20:32 <cdanis> cdanis@cumin1001.eqiad.wmnet ~ % sudo cumin 'scb*' 'enable-puppet "mobrovac: temp stop JQ for T221368"' [production]
20:31 <mobrovac@deploy1001> Finished deploy [cpjobqueue/deploy@71941b1]: Ignore Kafka disconnect errors (duration: 00m 51s) [production]
20:30 <mobrovac@deploy1001> Started deploy [cpjobqueue/deploy@71941b1]: Ignore Kafka disconnect errors [production]
20:28 <andrewbogott> deleting old log files on mcr-full because the drive is full [mcr-dev]
20:18 <bd808> Manually fixed broken /etc/puppet/puppet.conf on redirects-nginx01 [redirects]
20:13 <andrewbogott> rebooting dumps-1 to try to workaround nfs issues [dumps]
19:56 <lucaswerkmeister> deployed 1d4e4fb16b [tools.quickcategories]
19:39 <bd808> Restarting webservice, seems to have lost connectivity to the proxy [tools.admin]
19:36 <cdanis> cdanis@cumin1001.eqiad.wmnet ~ % sudo cookbook sre.hosts.downtime -r "mobrovac: temp stop JQ for T221368" 'scb*' [production]
19:36 <cdanis@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
19:36 <cdanis@cumin1001> START - Cookbook sre.hosts.downtime [production]
19:29 <cdanis> cdanis@cumin1001.eqiad.wmnet ~ % sudo cumin 'scb*' 'disable-puppet "mobrovac: temp stop JQ for T221368" && systemctl stop cpjobqueue' [production]
19:25 <Reedy> reloading zuul to deploy https://gerrit.wikimedia.org/r/504824 [releng]
19:17 <mobrovac@deploy1001> Started restart [cpjobqueue/deploy@922cbc0]: Bounce CP4JQ, lots of transport broken failures - T221368 [production]
19:11 <mobrovac@deploy1001> Synchronized php-1.34.0-wmf.1/extensions/EventBus/includes/EventFactory.php: Remove the use of page titles in JobExecutor, file 2/2 - T221368 (duration: 00m 59s) [production]
19:10 <mobrovac@deploy1001> Synchronized php-1.34.0-wmf.1/extensions/EventBus/includes/JobExecutor.php: Remove the use of page titles in JobExecutor, file 1/2 - T221368 (duration: 01m 01s) [production]
19:05 <bd808> deleting some project- and prefix-wide puppet config to try to get puppet running again on VMs (re-logged for andrewbogott) [wikistats]
18:55 <fdans> updated jars [analytics]
18:53 <fdans> Release of v0.0.86 in maven succeeded [analytics]
18:47 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) [production]
18:47 <robh@cumin1001> START - Cookbook sre.hosts.decommission [production]
18:47 <robh@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) [production]
18:47 <robh@cumin1001> START - Cookbook sre.hosts.decommission [production]
18:41 <mutante> mw2150 - reimaging, not in confctl [production]
18:02 <dzahn@puppetmaster1001> conftool action : set/pooled=yes; selector: name=mw2151.codfw.wmnet,cluster=jobrunner,service=nginx [production]
17:49 <mutante> mw2151 - scap pull [production]
17:46 <mobrovac@deploy1001> Synchronized php-1.34.0-wmf.1/extensions/EventBus/includes/JobExecutor.php: Default to a dummy title for invalid titles - T221368 (duration: 01m 01s) [production]
17:20 <twentyafterfour@deploy1001> Synchronized php-1.34.0-wmf.1/extensions/AbuseFilter/includes/: sync https://gerrit.wikimedia.org/r/c/mediawiki/extensions/AbuseFilter/+/504863 (duration: 01m 00s) [production]
16:57 <bd808> Added milimetric as a co-maintainer [tools.meetbot]
16:26 <paladox> rebuilding gerrit-test3 as gerrit-test5 due to removal of a kernel during a upgrade to stretch. [git]
16:20 <bblack> Experimental DNS-level changes deploying for wikipedia.org domain - if wikipedia.org DNS problems appear, revert https://gerrit.wikimedia.org/r/c/operations/dns/+/504588 - T208263 [production]
16:17 <XioNoX> remove peering to 63199 in eqsin (down for 1 month, no reply to emails) [production]
16:13 <XioNoX> rollback dhcp option 82 test from asw2-b-eqiad [production]
15:27 <paladox> upgrading gerrit-test3 to debian 9 (stretch) [git]
15:22 <fdans> restarting release of version 0.0.86 of refinery source to maven [analytics]
14:55 <fsero> synchronizing docker_registry_codfw swift container from docker_registry [production]
14:40 <XioNoX> push firewall change to pfw3-eqiad - T221278 [production]
14:29 <fdans> releasing version 0.0.86 of refinery source to maven [analytics]
13:30 <jbond42> rolling updates of ruby2.1 on jessie [production]
13:08 <elukey> roll restart of cassandra on aqs* to pick up new openjdk upgrades [production]
13:05 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
13:05 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
12:58 <reedy@deploy1001> rebuilt and synchronized wikiversions files: group1 back to .25 [production]
12:36 <anomie> Ran `php7adm /opcache-free` on mw1274 to test a theory related to T221347. The log entries related to that task stopped immediately. [production]