751-800 of 10000 results (64ms)
2019-10-03 §
14:17 <elukey> stop the Hadoop test cluster to migrate it to the new kerberos cluster [analytics]
13:49 <elukey> roll restart hadoop yarn resource managers for openssl updates on Hadoop workers [production]
13:44 <marostegui> Stop MySQL and shutdown es1019 for on-site maintenance - T233698 [production]
13:40 <marostegui@deploy1001> Synchronized wmf-config/db-eqiad.php: Depool es1019 for on-site maintenance T233698 (duration: 01m 01s) [production]
13:29 <hashar> Gerrit should be back [production]
13:26 <hashar> restarting Gerrit due to a deadlock in SendEmail task and AccountCacheImpl [production]
13:26 <elukey> re-run refinery-download-project-namespace-map (modified with recent fixes for encoding and python3) [analytics]
13:22 <hashar> Gerrit might be dead again; taking traces [production]
13:05 <arturo> delete servers tools-sssd-sgeexec-test-[1,2], no longer required [tools]
13:04 <_joe_> restarting php7 on mw1275 [production]
12:54 <onimisionipe> force shard allocation on eqiad chi cluster [production]
11:18 <arturo> project created, added Yarl and Local Profil as project admings (T233656) [finding-glams]
11:09 <hashar> integration-castor03: pruning HHVM jobs caches: rm -fR /srv/jenkins-workspace/caches/*/*/*hhvm* # T234384 [releng]
11:07 <hashar> jenkins: removing global environment variable: HHVM_REPO_CENTRAL_PATH=$WORKSPACE/central.hhbc | https://integration.wikimedia.org/ci/configure | T234384 [releng]
10:27 <elukey> killed rsync processes in "D" state on stat1007, force umount/mount of /mnt/hdfs [production]
10:25 <jbond42> rolling upgrade of openssl packages [production]
10:21 <Urbanecm> Manually cleared signup throttle for IP 80.188.128.54 at cswiki, issue with introduced throttle rule [production]
10:20 <Urbanecm> Manually cleared signup throttle for IP 88.100.221.84 at cswiki, issue with introduced throttle rule [production]
10:18 <Urbanecm> Manually cleared signup throttle for IP 90.176.155.12 at cswiki, issue with introduced throttle rule [production]
09:48 <elukey> ran apt-get autoremove -y on all Hadoop workers to remove old Python 2 deps [analytics]
09:32 <elukey> run apt-get autoremove incrementally on all the hadoop prod workers to remove python2 deps (and verify that they are not used anymore by Hadoop) [production]
08:43 <elukey> apply 5% threshold to the HDFS balancer - T231828 [analytics]
08:33 <marostegui> Deploy schema change on db2087:3316 T233135 T234066 [production]
08:28 <marostegui> Deploy schema change on db1096:3316 - T233625 [production]
08:26 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1096:3316 for schema change T233135 T234066', diff saved to https://phabricator.wikimedia.org/P9236 and previous config saved to /var/cache/conftool/dbconfig/20191003-082651-marostegui.json [production]
08:15 <akosiaris> slowly rolling restart all pods in eqiad, codfw, staging for log rollover before merging https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/539912 [production]
07:49 <marostegui> Set notes on the sanitarium masters - T234039 [production]
07:48 <elukey> restart druid-broker on druid1003 (used by superset) [analytics]
07:47 <elukey> restart superset to test if a stale status might cause data not to be shown [analytics]
07:19 <marostegui> Remove unused labspuppet database from m5 - T233281 [production]
07:03 <@> helmfile [CODFW] Ran 'apply' command on namespace 'zotero' for release 'production' . [production]
07:00 <@> helmfile [STAGING] Ran 'apply' command on namespace 'zotero' for release 'staging' . [production]
06:59 <eileen> tools revision changed from e1b81688c6 to b3c7453be2 [production]
06:59 <@> helmfile [EQIAD] Ran 'apply' command on namespace 'zotero' for release 'production' . [production]
06:48 <marostegui> Drop database grants on m5 for labspuppet - T233281 [production]
06:37 <marostegui> Rename tables on m5 master on designate_pool_manager - T233978 [production]
06:16 <marostegui> Deploy schema change on db2089:3316 T233135 T234066 [production]
05:46 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) [production]
05:45 <marostegui@cumin1001> START - Cookbook sre.hosts.decommission [production]
05:28 <eileen> civicrm revision changed from 12c5727a23 to c12f7bb51f, config revision is 422a0f7d48 [production]
02:07 <krinkle@deploy1001> Synchronized wmf-config/InitialiseSettings.php: 1c599baea51f9 (duration: 01m 03s) [production]
01:05 <mutante> gerrit1001 - shutdown - scheduled downtime [production]
00:51 <mutante> gerrit1001 - removing wrong IPv6 address from interface, running puppet [production]
2019-10-02 §
23:42 <XioNoX> enable cr2-eqiad:xe-4/0/0 - T234416 [production]
23:38 <XioNoX> disable cr2-eqiad:xe-4/0/0 - T234416 [production]
23:22 <ebernhardson@deploy1001> Synchronized php-1.34.0-wmf.24/extensions/CirrusSearch/: T234445: CirrusSearch: Fix Precondition failed: Must have a resultset set (duration: 01m 00s) [production]
23:21 <ebernhardson@deploy1001> Synchronized php-1.34.0-wmf.25/extensions/CirrusSearch/: T234445: CirrusSearch: Fix Precondition failed: Must have a resultset set (duration: 01m 02s) [production]
22:29 <godog> remove queued messages from mx1001 for fr-tech-ops@, triggering sender rate limit from gmail [production]
22:12 <jforrester@deploy1001> Synchronized php-1.34.0-wmf.24/extensions/VisualEditor/includes/ApiVisualEditorEdit.php: VE unstructured logging, part II (duration: 00m 58s) [production]
22:11 <jforrester@deploy1001> Synchronized php-1.34.0-wmf.24/extensions/VisualEditor/includes/ApiVisualEditor.php: VE unstructured logging, part I (duration: 00m 59s) [production]