6001-6050 of 10000 results (28ms)
2021-09-24 §
22:33 <razzi> restart an-test-coord presto coordinator service to experiment withweb-ui.authentication.type=fixed [analytics]
20:00 <volker-e@deploy1002> Finished deploy [design/style-guide@362c6b1]: Deploy design/style-guide: 362c6b1 “Components”: Fix index link (#489) (duration: 00m 06s) [production]
20:00 <volker-e@deploy1002> Started deploy [design/style-guide@362c6b1]: Deploy design/style-guide: 362c6b1 “Components”: Fix index link (#489) [production]
19:33 <volker-e@deploy1002> Finished deploy [design/style-guide@6585e79]: Deploy design/style-guide: 6585e79 “Apps”: Add Apps x Design System section (#487) (duration: 00m 07s) [production]
19:33 <volker-e@deploy1002> Started deploy [design/style-guide@6585e79]: Deploy design/style-guide: 6585e79 “Apps”: Add Apps x Design System section (#487) [production]
19:07 <mwdebug-deploy@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
19:04 <mwdebug-deploy@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'mwdebug' for release 'pinkunicorn' . [production]
18:58 <wm-bot> <lucaswerkmeister> tried to deploy new pygments-server with support for latest SyntaxHighlight; not quite working yet, but as pages with SyntaxHighlight still work (just without highlighting), I’m okay to leave it at this for now, and hopefully resume tomorrow [tools.notwikilambda]
18:57 <legoktm@deploy1002> Synchronized php-1.38.0-wmf.1/includes/MovePage.php: MovePage: don't create a recent change for a redirect (T291677) (duration: 00m 57s) [production]
18:54 <legoktm@deploy1002> Synchronized php-1.38.0-wmf.1/extensions/PageTriage/: Revert "Remove deprecated date.js library" (T291675) (duration: 00m 59s) [production]
18:53 <legoktm@deploy1002> sync-file aborted: (no justification provided) (duration: 00m 00s) [production]
18:13 <legoktm@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'zotero' for release 'production' . [production]
18:12 <legoktm@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'zotero' for release 'production' . [production]
17:20 <elukey@cumin1001> END (PASS) - Cookbook sre.kafka.roll-restart-mirror-maker (exit_code=0) restart MirrorMaker for Kafka A:kafka-mirror-maker-jumbo-eqiad cluster: Roll restart of jvm daemons. - elukey@cumin1001 [production]
17:02 <elukey@cumin1001> START - Cookbook sre.kafka.roll-restart-mirror-maker restart MirrorMaker for Kafka A:kafka-mirror-maker-jumbo-eqiad cluster: Roll restart of jvm daemons. - elukey@cumin1001 [production]
16:35 <elukey@cumin1001> END (PASS) - Cookbook sre.kafka.roll-restart-brokers (exit_code=0) for Kafka A:kafka-jumbo-eqiad cluster: Roll restart of jvm daemons for openjdk upgrade. - elukey@cumin1001 [production]
15:59 <elukey@cumin1001> END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-analytics cluster: Roll restart of jvm daemons. - elukey@cumin1001 [production]
15:53 <elukey@cumin1001> START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-analytics cluster: Roll restart of jvm daemons. - elukey@cumin1001 [production]
15:52 <elukey@cumin1001> END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-druid-public cluster: Roll restart of jvm daemons. - elukey@cumin1001 [production]
15:46 <elukey@cumin1001> START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-druid-public cluster: Roll restart of jvm daemons. - elukey@cumin1001 [production]
15:41 <dpifke> Cherry-picking https://gerrit.wikimedia.org/r/c/performance/coal/+/722948 and latest https://gerrit.wikimedia.org/r/c/operations/puppet/+/721047 in deployment-prep. Should only affect deployment-webperf11. [releng]
15:23 <elukey@cumin1001> END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-druid-analytics cluster: Roll restart of jvm daemons. - elukey@cumin1001 [production]
15:17 <elukey@cumin1001> START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-druid-analytics cluster: Roll restart of jvm daemons. - elukey@cumin1001 [production]
15:09 <elukey> sudo cumin -m async -b2 "c:profile::analytics::cluster::hdfs_mount" "umount /mnt/hdfs" "mount /mnt/hdfs" - T288625 [production]
15:06 <btullis> btullis@cumin1001:~$ sudo cumin --mode async 'aqs100[4,7].eqiad.wmnet' 'nodetool-a snapshot -t T291469' 'nodetool-b snapshot -t T291469' [analytics]
14:47 <btullis> btullis@aqs1007:~$ sudo nodetool-a repair --full local_group_default_T_mediarequest_per_file data [analytics]
14:46 <dcaro> Created new project (T290768) [wikiwho]
14:32 <bd808@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'toolhub' for release 'main' . [production]
14:21 <dcaro> Created new project (T290098) [fr-tech-dev]
14:07 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:03 <cmjohnson@cumin1001> START - Cookbook sre.dns.netbox [production]
13:31 <Amir1> start of rebuilding metadata of images in commons to make them use json [production]
13:24 <elukey@cumin1001> START - Cookbook sre.kafka.roll-restart-brokers for Kafka A:kafka-jumbo-eqiad cluster: Roll restart of jvm daemons for openjdk upgrade. - elukey@cumin1001 [production]
13:02 <arturo> [codfw1dev] create VM manila-share-controller-01 on cloudinfra-codfw1dev [admin]
13:00 <arturo> [codfw1dev] rebase labs/private.git on cloudinfra-puppetmaster-01, had merge conflict [admin]
11:58 <effie> upgrading scap on canaries - T291095 [production]
11:39 <jiji@cumin1001> conftool action : set/pooled=true; selector: dnsdisc=tegola-vector-tiles [production]
11:32 <effie> uploading scap-4.0.0 to buster-wikimedia and stretch-wikimedia [production]
11:17 <effie> restart pybal in low traffic load balancers [production]
11:02 <btullis> btullis@an-master1001:~$ sudo systemctl restart hadoop-mapreduce-historyserver [analytics]
10:47 <btullis> btullis@an-master1002:~$ sudo systemctl restart hadoop-hdfs-namenode [analytics]
10:47 <btullis> btullis@an-master1002:~$ sudo systemctl restart hadoop-hdfs-zkfc [analytics]
10:44 <jynus> corrupting and fixing image metadata on testwiki before running script on commons T290462 [production]
10:35 <btullis> btullis@an-master1001:~$ sudo -u hdfs kerberos-run-command hdfs /usr/bin/hdfs haadmin -failover an-master1002-eqiad-wmnet an-master1001-eqiad-wmnet [analytics]
10:16 <elukey@cumin1001> END (PASS) - Cookbook sre.druid.roll-restart-workers (exit_code=0) for Druid public cluster: Roll restart of Druid jvm daemons. - elukey@cumin1001 [production]
10:11 <btullis@cumin1001> END (FAIL) - Cookbook sre.hadoop.roll-restart-masters (exit_code=99) restart masters for Hadoop analytics cluster: Restart of jvm daemons. - btullis@cumin1001 [production]
10:07 <btullis> btullis@an-launcher1002:~$ sudo -u analytics kerberos-run-command analytics /usr/local/bin/refine_eventlogging_legacy --ignore_failure_flag=true --table_include_regex='centralnoticeimpression' --since='2021-09-23T04:00:00.000Z' --until='2021-09-24T05:00:00.000Z' [analytics]
09:39 <jynus> upgrade and restart db2099 [production]
09:32 <btullis@cumin1001> START - Cookbook sre.hadoop.roll-restart-masters restart masters for Hadoop analytics cluster: Restart of jvm daemons. - btullis@cumin1001 [production]
09:29 <btullis@cumin1001> END (PASS) - Cookbook sre.hadoop.roll-restart-masters (exit_code=0) restart masters for Hadoop test cluster: Restart of jvm daemons. - btullis@cumin1001 [production]