551-600 of 10000 results (67ms)
2019-10-03 ยง
19:30 <marxarelli> 1.34.0-wmf.25 promoted to all wikis, cc: T220750. no rise in relevant error rates. no new errors [production]
19:21 <dduvall@deploy1001> rebuilt and synchronized wikiversions files: all wikis to 1.34.0-wmf.25 [production]
19:19 <mutante> puppetmaster1001 - revoke cert for parsoid.discovery.wmnet - creating new ones for each DC and a unified one with both (T233654) [production]
19:12 <James_F> [wikimedia/fundraising/crm] AAdd experimental php70 job T230446 [releng]
19:11 <@> helmfile [STAGING] Ran 'apply' command on namespace 'sessionstore' for release 'staging' . [production]
18:52 <krinkle@deploy1001> Synchronized wmf-config/InitialiseSettings.php: no-op / config cached? (duration: 00m 59s) [production]
18:43 <krinkle@deploy1001> Synchronized wmf-config/InitialiseSettings.php: c2b3d7ce57e9c422 (duration: 00m 59s) [production]
18:25 <James_F> Zuul: [mediawiki/tools/codesniffer] Move to PHP7.2+ [releng]
18:14 <krinkle@deploy1001> Synchronized wmf-config/InitialiseSettings.php: no-op / config cache issue? (duration: 01m 00s) [production]
18:03 <krinkle@deploy1001> Synchronized wmf-config/InitialiseSettings.php: 5389d0243ee9c (duration: 01m 01s) [production]
17:13 <mholloway-shell@deploy1001> Finished deploy [mobileapps/deploy@31b2703]: Update mobileapps to 1db84a7 (duration: 06m 06s) [production]
17:07 <mholloway-shell@deploy1001> Started deploy [mobileapps/deploy@31b2703]: Update mobileapps to 1db84a7 [production]
14:17 <elukey> stop the Hadoop test cluster to migrate it to the new kerberos cluster [analytics]
13:49 <elukey> roll restart hadoop yarn resource managers for openssl updates on Hadoop workers [production]
13:44 <marostegui> Stop MySQL and shutdown es1019 for on-site maintenance - T233698 [production]
13:40 <marostegui@deploy1001> Synchronized wmf-config/db-eqiad.php: Depool es1019 for on-site maintenance T233698 (duration: 01m 01s) [production]
13:29 <hashar> Gerrit should be back [production]
13:26 <hashar> restarting Gerrit due to a deadlock in SendEmail task and AccountCacheImpl [production]
13:26 <elukey> re-run refinery-download-project-namespace-map (modified with recent fixes for encoding and python3) [analytics]
13:22 <hashar> Gerrit might be dead again; taking traces [production]
13:05 <arturo> delete servers tools-sssd-sgeexec-test-[1,2], no longer required [tools]
13:04 <_joe_> restarting php7 on mw1275 [production]
12:54 <onimisionipe> force shard allocation on eqiad chi cluster [production]
11:18 <arturo> project created, added Yarl and Local Profil as project admings (T233656) [finding-glams]
11:09 <hashar> integration-castor03: pruning HHVM jobs caches: rm -fR /srv/jenkins-workspace/caches/*/*/*hhvm* # T234384 [releng]
11:07 <hashar> jenkins: removing global environment variable: HHVM_REPO_CENTRAL_PATH=$WORKSPACE/central.hhbc | https://integration.wikimedia.org/ci/configure | T234384 [releng]
10:27 <elukey> killed rsync processes in "D" state on stat1007, force umount/mount of /mnt/hdfs [production]
10:25 <jbond42> rolling upgrade of openssl packages [production]
10:21 <Urbanecm> Manually cleared signup throttle for IP 80.188.128.54 at cswiki, issue with introduced throttle rule [production]
10:20 <Urbanecm> Manually cleared signup throttle for IP 88.100.221.84 at cswiki, issue with introduced throttle rule [production]
10:18 <Urbanecm> Manually cleared signup throttle for IP 90.176.155.12 at cswiki, issue with introduced throttle rule [production]
09:48 <elukey> ran apt-get autoremove -y on all Hadoop workers to remove old Python 2 deps [analytics]
09:32 <elukey> run apt-get autoremove incrementally on all the hadoop prod workers to remove python2 deps (and verify that they are not used anymore by Hadoop) [production]
08:43 <elukey> apply 5% threshold to the HDFS balancer - T231828 [analytics]
08:33 <marostegui> Deploy schema change on db2087:3316 T233135 T234066 [production]
08:28 <marostegui> Deploy schema change on db1096:3316 - T233625 [production]
08:26 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1096:3316 for schema change T233135 T234066', diff saved to https://phabricator.wikimedia.org/P9236 and previous config saved to /var/cache/conftool/dbconfig/20191003-082651-marostegui.json [production]
08:15 <akosiaris> slowly rolling restart all pods in eqiad, codfw, staging for log rollover before merging https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/539912 [production]
07:49 <marostegui> Set notes on the sanitarium masters - T234039 [production]
07:48 <elukey> restart druid-broker on druid1003 (used by superset) [analytics]
07:47 <elukey> restart superset to test if a stale status might cause data not to be shown [analytics]
07:19 <marostegui> Remove unused labspuppet database from m5 - T233281 [production]
07:03 <@> helmfile [CODFW] Ran 'apply' command on namespace 'zotero' for release 'production' . [production]
07:00 <@> helmfile [STAGING] Ran 'apply' command on namespace 'zotero' for release 'staging' . [production]
06:59 <eileen> tools revision changed from e1b81688c6 to b3c7453be2 [production]
06:59 <@> helmfile [EQIAD] Ran 'apply' command on namespace 'zotero' for release 'production' . [production]
06:48 <marostegui> Drop database grants on m5 for labspuppet - T233281 [production]
06:37 <marostegui> Rename tables on m5 master on designate_pool_manager - T233978 [production]
06:16 <marostegui> Deploy schema change on db2089:3316 T233135 T234066 [production]
05:46 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) [production]