2020-06-01
§
|
08:22 |
<mutante> |
mw1331 re-enabled puppet (SAL told me about an experiment a little while ago) |
[production] |
08:19 |
<jynus> |
disabling puppet on all db/es/pc hosts for deploy of gerrit:599596 |
[production] |
08:17 |
<RhinosF1> |
upload starter-new.sh and switched sopelbot.yaml foor T254046 |
[tools.zppixbot] |
07:46 |
<RF1dle> |
add notice for T254046 to wiki index about |
[tools.zppixbot] |
07:05 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1142 to clone db1147 T252512', diff saved to https://phabricator.wikimedia.org/P11339 and previous config saved to /var/cache/conftool/dbconfig/20200601-070519-marostegui.json |
[production] |
06:53 |
<elukey> |
re-run virtualpageview-hourly-wf-2020-5-31-19 |
[analytics] |
06:28 |
<elukey> |
temporary stop of all RU jobs on an-launcher1001 to priviledge camus and others |
[analytics] |
06:03 |
<elukey> |
kill all airflow-related processes on an-launcher1001 - host killing tasks due to OOM |
[analytics] |
05:03 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool enwiki db2071 slave to test new index - T238966', diff saved to https://phabricator.wikimedia.org/P11338 and previous config saved to /var/cache/conftool/dbconfig/20200601-050354-marostegui.json |
[production] |
04:54 |
<marostegui> |
Drop testreduce_0715 from m5 master T245408 |
[production] |
04:44 |
<marostegui> |
Depool db1141 from Analytics role - T249188 |
[production] |
00:39 |
<bd808> |
Ugh. Prior SAL message was about tools-sgeexec-0940 |
[tools] |
00:39 |
<bd808> |
Compressed /var/log/account/pacct.0 ahead of rotation schedule to free some space on the root partition |
[tools] |
00:31 |
<bd808> |
Also, why is tools.squirrelnestbot running a job for tools.unblockbot? |
[tools.squirrelnestbot] |
00:31 |
<bd808> |
Stopped grid job running tools.unblockbot/unblockbot.sh. Script is in an infinite crash loop because it does not handle https properly. |
[tools.squirrelnestbot] |
2020-05-30
§
|
21:52 |
<RhinosF1> |
Maint Complete! |
[tools.zppixbot-test] |
21:49 |
<wm-bot> |
<rhinosf1> chmod a+x starter-new.sh |
[tools.zppixbot-test] |
21:26 |
<RhinosF1> |
tools.zppixbot-test@tools-sgebastion-07:~/k8s$ take /data/project/zppixbot-test/k8s/starter-new.sh - I hate forklift's file handling at times |
[tools.zppixbot-test] |
21:14 |
<RhinosF1> |
tools.zppixbot-test tools.zppixbot-test@tools-sgebastion-07:~/.sopel$ kubectl scale --replicas=1 deployment.apps/sopeltest.bot |
[tools.zppixbot-test] |
21:01 |
<RhinosF1> |
rename starter.sh to starter-old and create starter-new - move zppixbot-test to use it for the deployment |
[tools.zppixbot-test] |
21:00 |
<RhinosF1> |
switch that to sopeltest.bot |
[tools.zppixbot-test] |
20:59 |
<RhinosF1> |
tools.zppixbot-test@tools-sgebastion-07:~/.sopel$ kubectl scale --replicas=0 deployment.apps/zppixbot-test |
[tools.zppixbot-test] |
16:53 |
<Reedy> |
Reloading Zuul to deploy https://gerrit.wikimedia.org/r/599963 |
[releng] |
14:30 |
<wm-bot> |
<lucaswerkmeister> deployed ff77d74e3f (add somevalue depicts statements) |
[tools.wd-image-positions] |
13:18 |
<Amir1> |
ladsgroup@deployment-deploy01:/srv/mediawiki/php-master$ mwscript maintenance/createAndPromote.php --wiki=fawiki --bureaucrat --force --interface-admin --sysop Ladsgroup (Part of T253291) |
[releng] |
08:15 |
<elukey> |
manual reset-failed of monitor_refine_mediawiki_job_events_failure_flags |
[analytics] |
2020-05-29
§
|
23:42 |
<bstorm_> |
stopped puppet and restarting mariadb on clouddb1002 after filtering out a table T253738 |
[clouddb-services] |
22:46 |
<James_F> |
Zuul: Archive mediawiki/extensions/PopupPages T251000 |
[releng] |
22:32 |
<bstorm_> |
updated views on labsdb1010 T252219 |
[production] |
22:10 |
<bd808> |
Rebooting mwv-builder-03.mediawiki-vagrant.eqiad.wmflabs |
[mediawiki-vagrant] |
22:10 |
<wm-bot> |
<rhinosf1> dropped old logs |
[tools.zppixbot-test] |
22:07 |
<wm-bot> |
<rhinosf1> purge old *.*db files, tar+gzip logs/* and nuke the pycahce's |
[tools.zppixbot-test] |
21:24 |
<wm-bot> |
<rhinosf1> sync done |
[tools.zppixbot] |
21:23 |
<wm-bot> |
<rhinosf1> syncing to deploy 599873 -- T233993 |
[tools.zppixbot] |
20:55 |
<bstorm_> |
updating views on labsdb1011 T252219 |
[production] |
19:39 |
<bstorm_> |
switch deployment to the openresty version to try it out T252217 |
[tools.paws-public] |
19:37 |
<bstorm_> |
adding docker image for paws-public docker-registry.tools.wmflabs.org/paws-public-nginx:openresty T252217 |
[tools] |
19:27 |
<ryankemper> |
Successfully finished a rolling restart of the `cloudelastic` clusters (chi, psi, omega) as part of elasticsearch plugins upgrade. Host and service checks re-enabled. |
[production] |
18:42 |
<hauskatze> |
gerrit: replication start mediawiki/extensions/WikiShare --wait refs. T250400 |
[releng] |
18:17 |
<hauskatze> |
GitHub: Deleted mirror wikimedia/mediawiki-extensions-PopupPages refs. T251000 |
[releng] |
18:12 |
<bstorm_> |
applied in-place fix for non-ASCII usernames and applied this to my own version of the image T252217 |
[tools.paws-public] |
17:28 |
<bstorm_> |
updating views on labsdb1009 T252219 |
[production] |
16:50 |
<ryankemper> |
Performing a rolling restart of the `cloudelastic` clusters (chi, psi, omega) as part of elasticsearch plugins upgrade. Host and service checks disabled. |
[production] |
16:00 |
<bstorm_> |
Updating views on labsdb1012 T252219 |
[production] |
15:59 |
<ryankemper> |
Concluded rolling restart of the `relforge` clusters as part of elasticsearch plugins upgrade. Both hosts `relforge1001` and `relforge1002` are back up. Downtime lifted. |
[production] |
15:29 |
<ryankemper> |
Performing a rolling restart of the `relforge` clusters as part of elasticsearch plugins upgrade |
[production] |