3151-3200 of 10000 results (62ms)
2019-12-16 §
15:20 <mforns@deploy1001> Started deploy [analytics/refinery@1c72a71]: deploying analytics refinery for kerberos migration [production]
15:20 <mforns> deploying analytics refinery for kerberos migration [analytics]
15:15 <elukey@cumin1001> START - Cookbook sre.druid.roll-restart-workers [production]
14:58 <cdanis> ✔️ cdanis@mwdebug2001.codfw.wmnet ~ 🕤☕ scap pull [production]
14:55 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1084 schema change', diff saved to https://phabricator.wikimedia.org/P9877 and previous config saved to /var/cache/conftool/dbconfig/20191216-145520-marostegui.json [production]
14:49 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db1121 after schema change', diff saved to https://phabricator.wikimedia.org/P9876 and previous config saved to /var/cache/conftool/dbconfig/20191216-144902-marostegui.json [production]
14:46 <cdanis@deploy1001> Synchronized wmf-config/db-eqiad.php: db-eqiad: remove dbctl-obsoleted externalLoads section 5413a6d73 T229686 (duration: 00m 54s) [production]
14:45 <cdanis@deploy1001> Synchronized wmf-config/db-codfw.php: db-codfw: remove dbctl-obsoleted externalLoads section 519e37461 T229686 (duration: 00m 54s) [production]
14:39 <oblivian@deploy1001> helmfile [EQIAD] Ran 'apply' command on namespace 'blubberoid' for release 'production' . [production]
14:39 <cdanis@deploy1001> Synchronized wmf-config/etcd.php: db-codfw: remove dbctl-obsoleted externalLoads section 519e37461 T229686 (duration: 00m 53s) [production]
14:38 <oblivian@deploy1001> helmfile [CODFW] Ran 'apply' command on namespace 'blubberoid' for release 'production' . [production]
14:36 <oblivian@deploy1001> helmfile [STAGING] Ran 'apply' command on namespace 'blubberoid' for release 'staging' . [production]
14:35 <XioNoX> delete virtual chassis ID on asw-a-codfw [production]
14:34 <XioNoX> delete virtual chassis ID on asw-b-codfw [production]
14:32 <XioNoX> delete virtual chassis ID on asw-c-codfw [production]
14:30 <cdanis> manual testing of I219711eb on mwdebug2001 [production]
14:11 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db1127 after testing', diff saved to https://phabricator.wikimedia.org/P9875 and previous config saved to /var/cache/conftool/dbconfig/20191216-141141-marostegui.json [production]
14:09 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1127 from x1 for testing', diff saved to https://phabricator.wikimedia.org/P9874 and previous config saved to /var/cache/conftool/dbconfig/20191216-140951-marostegui.json [production]
14:03 <cdanis@deploy1001> Synchronized wmf-config/etcd.php: enable dbctl for externalLoads 6dfb30c76 T229686 (duration: 00m 53s) [production]
13:59 <arturo> powering down `puppet-stretch-test` VM to test stuff related to T240851 [testlabs]
13:50 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
13:50 <elukey@cumin1001> START - Cookbook sre.hosts.downtime [production]
13:33 <ema> cp-ats: rolling ats-backend-restart to apply ram cache size changes T238494 [production]
13:33 <moritzm> restarting systemd-timesyncd on stat1005 [production]
12:56 <joal> Kill all oozie jobs after having dumped their statuses [analytics]
12:52 <elukey> shutdown of the Analytics Hadoop cluster to enable Kerberos [production]
12:26 <joal> Reference for killed backfilling mediarequest-per-file job: https://hue.wikimedia.org/oozie/list_oozie_coordinator/0003296-191212123816836-oozie-oozi-C/ [analytics]
12:26 <joal> Reference for killed backfillin jo [analytics]
12:23 <joal> Kill backfilling job for mediarequest-per-file with 2017-07-0[2345] days not done [analytics]
12:22 <joal> Rerun cassandra-daily-wf-local_group_default_T_pageviews_per_article_flat-2019-12-15 [analytics]
12:17 <elukey> kill netflow realtime druid supervisor as prep step for kerberos [analytics]
12:16 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
12:15 <elukey@cumin1001> START - Cookbook sre.hosts.downtime [production]
12:12 <Urbanecm> EU SWAT done [production]
12:11 <urbanecm@deploy1001> Synchronized wmf-config/InitialiseSettings.php: SWAT: 026913d: Add no=>nb in $wgInterlanguageLinkCodeMap (T174160) (duration: 00m 53s) [production]
11:58 <jynus@cumin1001> dbctl commit (dc=all): 'Depool db1130', diff saved to https://phabricator.wikimedia.org/P9873 and previous config saved to /var/cache/conftool/dbconfig/20191216-115841-jynus.json [production]
11:55 <hashar> Restarting Jenkins completely to flush out stall Gearman functions in Zuul [production]
11:41 <jdrewniak@deploy1001> Synchronized portals: Wikimedia Portals Update: [[gerrit:558017| Bumping portals to master (T128546)]] (duration: 00m 52s) [production]
11:40 <jdrewniak@deploy1001> Synchronized portals/wikipedia.org/assets: Wikimedia Portals Update: [[gerrit:558017| Bumping portals to master (T128546)]] (duration: 00m 56s) [production]
11:14 <joal> Clean spark-shell drivers on cluster before kerberos [analytics]
10:57 <elukey> disable puppet on labstore100[6,7] and stop analytics-related systemd timers - prep step for Kerberos [production]
10:46 <elukey> stop airflow-* on an-airflow1001 [analytics]
10:41 <XioNoX> delete virtual chassis ID on asw-d-codfw [production]
10:41 <elukey> stop jupyterhub on notebook100[3,4] as prep step for kerberos [analytics]
10:38 <elukey> kill Nuria's spark shell application masters in Yarn [analytics]
10:17 <elukey> stop hadoop-related timers on stat1007 [analytics]
10:14 <hashar> Restarting CI Jenkins due to out of sync state between Zuul Gearman and what is actually running (some jobs got lost) [production]
10:04 <joal> Killing user-app eating all cluster (application_1573208467349_190044) [analytics]
09:50 <marostegui> Stop replication in the same position in labsdb1010 and labsdb1012 - T238399 [production]
09:35 <hashar> doc1001: sudo -u doc-uploader rm -fR /srv/docroot/org/wikimedia/doc/DOCKER-mediawiki-core [releng]