1851-1900 of 10000 results (100ms)
2020-01-10 §
19:40 <ebernhardson> restart mjolnir-kafka-bulk-daemon across eqiad and codfw search clusters [production]
19:40 <ebernhardson@deploy1001> Finished deploy [search/mjolnir/deploy@e141941]: repair model upload in bulk daemon (duration: 05m 02s) [production]
19:35 <ebernhardson@deploy1001> Started deploy [search/mjolnir/deploy@e141941]: repair model upload in bulk daemon [production]
19:13 <otto@deploy1001> helmfile [STAGING] Ran 'apply' command on namespace 'eventgate-logging-external' for release 'logging-external' . [production]
18:53 <mutante> welcome new (restbase) service deployer Clara Andrew-Wani (T242152) [production]
18:35 <bd808> Restarted zuul on contint1001 at 18:29Z; no logs since 2020-01-10 17:55:28,452 [releng]
18:29 <bd808> Restarted zuul on contint1001; no logs since 2020-01-10 17:55:28,452 [production]
16:15 <bstorm_> restarted the tool to clean up loads of uninterruptible and zombie perl processes [tools.ftl]
15:45 <bstorm_> depooled tools-paws-worker-1013 to reboot because I think it is the last tools server with that mount issue (I hope) [tools]
15:35 <bstorm_> depooling and rebooting tools-worker-1016 because it still had the leftover mount problems [tools]
15:30 <bstorm_> git stash-ing local puppet changes in hopes that arturo has that material locally, and it doesn't break anything to do so [tools]
14:30 <elukey> restart oozie with new settings to instruct it to pick up spark-defaults.conf settings from /etc/spark2/conf [analytics]
13:27 <arturo> cloudvirt1009: virsh undefine i-000069b6. This is tools-elastic-01 which is running on cloudvirt1008 (so, leaked on cloudvirt1009) [admin]
13:04 <arturo> cyberbot-db-01 is now on cloudvirt1029 [cyberbot]
11:48 <moritzm> stop/mask nginx on hassium/hassaleh T224567 [production]
10:56 <akosiaris> repool mathoid codfw for testing canary support in the mathoid helm chart [production]
10:56 <akosiaris@cumin1001> conftool action : set/pooled=true; selector: name=codfw,dnsdisc=mathoid [production]
10:51 <akosiaris@deploy1001> helmfile [CODFW] Ran 'apply' command on namespace 'mathoid' for release 'canary' . [production]
10:51 <akosiaris@deploy1001> helmfile [CODFW] Ran 'apply' command on namespace 'mathoid' for release 'production' . [production]
10:40 <akosiaris@deploy1001> helmfile [STAGING] Ran 'sync' command on namespace 'mathoid' for release 'staging' . [production]
10:38 <akosiaris> depool mathoid codfw in preparation for testing canary support in the mathoid helm chart [production]
10:37 <akosiaris@cumin1001> conftool action : set/pooled=false; selector: name=codfw,dnsdisc=mathoid [production]
10:24 <moritzm> rename Ganeti group for esams from "default" to "row_OE" T236216 [production]
10:21 <moritzm> rename Ganeti group for eqsin from "default" to "row_1" T228099 [production]
09:46 <arturo> moving cyberbot-db-01 from cloudvirt1009 to cloudvirt1029 to try a faster hypervisor [cyberbot]
09:34 <legoktm> shutdown upgrader-05 instance [library-upgrader]
09:02 <marostegui> Remove revision partitions from db2091:3312 [production]
09:01 <marostegui@cumin1001> dbctl commit (dc=all): 'Depoool db2091:3312', diff saved to https://phabricator.wikimedia.org/P10113 and previous config saved to /var/cache/conftool/dbconfig/20200110-090143-marostegui.json [production]
08:59 <marostegui@cumin1001> dbctl commit (dc=all): 'Pool db2088:3312', diff saved to https://phabricator.wikimedia.org/P10112 and previous config saved to /var/cache/conftool/dbconfig/20200110-085921-marostegui.json [production]
08:55 <vgutierrez> restarting pybal on lvs3005 (high-traffic1) - T242321 [production]
08:51 <vgutierrez> restarting pybal on lvs3007 - T242321 [production]
08:48 <vgutierrez@puppetmaster1001> conftool action : set/pooled=yes; selector: service=nginx,name=ncredir3002.esams.wmnet [production]
08:48 <vgutierrez@puppetmaster1001> conftool action : set/pooled=yes; selector: service=nginx,name=ncredir3001.esams.wmnet [production]
08:24 <ema> cp3062: varnish-frontend-restart to clear things up after child crash the past days [production]
07:52 <legoktm> switched over libraryupgrader2.wmflabs.org to point to upgrader06 [library-upgrader]
07:38 <elukey> re-run virtualpageviews-druid-daily 09/01/2020 via Hue [analytics]
07:37 <elukey> systemctl restart drop-el-unsanitized-events on an-coord1001 [analytics]
06:59 <legoktm> upgrader06: adduser libup && adduser libup docker [library-upgrader]
06:59 <legoktm> upgrader06: enabling buster-wikimedia/thirdparty/kubeadm-k8s apt repo and installing docker-ce [library-upgrader]
06:53 <legoktm> disabling all services on upgrader-05 [library-upgrader]
06:33 <legoktm> creating upgrader-06 instance, large flavor this time [library-upgrader]
02:15 <wm-bot> <samwilson> Restarted web service and enwiki job. T242291. [tools.eranbot]
02:11 <jhuneidi@deploy1001> Pruned MediaWiki: 1.35.0-wmf.10 (duration: 04m 13s) [production]
00:45 <catrope@deploy1001> Synchronized php-1.35.0-wmf.14/extensions/GrowthExperiments/: Expose tasktype/topic API parameter info (T240512) (duration: 01m 01s) [production]
00:35 <shdubsh> restart prometheus on prometheus2004, enabling debug log [production]
00:13 <bstorm_> restarted tool to fix a brief error caused during bug testing of the Gridengine system [tools.whois]
2020-01-09 §
23:52 <bstorm_> restarted webservice because it didn't cleanly depool during toolforge maintenance T242385 [tools.fountain]
23:35 <bstorm_> depooled tools-sgeexec-0939 because it isn't acting right and rebooting it [tools]
21:25 <ebernhardson@deploy1001> Finished deploy [search/airflow@746c149]: Add skein to airflow venv (duration: 00m 55s) [production]
21:24 <ebernhardson@deploy1001> Started deploy [search/airflow@746c149]: Add skein to airflow venv [production]