2022-02-01 §
00:12 <tgr> deployment-prep cherry-picked gerrit 758584 to beta puppetmaster T300591 [releng]
2022-01-31 §
19:01 <James_F> Re-configured Jenkins job mediawiki-i18n-check-docker to 9e3ea96c548d7a84be763d38c2d118bc861cf189 for T222216 [releng]
10:49 <hashar> Added integration-agent-qemu-1003 with label `Qemu` # T284774 [releng]
2022-01-28 §
21:45 <taavi> running recountCategories.php on all beta wikis per T299823#7652496 [releng]
14:27 <hashar> taking heapdump of CI Jenkins `sudo -u jenkins /usr/lib/jvm/java-11-openjdk-amd64/bin/jmap -dump:live,format=b,file=/var/lib/jenkins/202201281527.hprof xxxx` [releng]
2022-01-27 §
20:26 <hashar> Successfully published image docker-registry.discovery.wmnet/releng/logstash-filter-verifier:0.0.2 # T299431 [releng]
19:34 <Amir1> Reloading Zuul to deploy 757464 [releng]
16:00 <hashar> Pooling back agents 1035 1036 1037 1038 , they could not connect due to ssh host mismatch since yesterday they all got attached to instance 1033 and accepted that host key # T300214 [releng]
09:16 <hashar> integration: cumin --force 'name:docker' 'apt install rsync' # T300236 [releng]
09:05 <hashar> integration: cumin --force 'name:docker' 'apt install rsync' # T300214 [releng]
00:24 <thcipriani> restarting jenkins [releng]
2022-01-26 §
20:29 <hashar> Completed migration of integration-agent-docker-XXXX instances from Stretch to Bullseye - T252071 [releng]
19:55 <hashar> deleting integration-agent-docker-1014 which only has the `codehealth` label. A short live experiment no more used since October 2nd 2019 - https://gerrit.wikimedia.org/r/c/integration/config/+/540362 - T234259 [releng]
18:56 <hashar> integration: pooled in Jenkins a few more Bullseye docker agents for T252071 [releng]
18:17 <hashar> integration: pooled in Jenkins a few Bullseye docker agent for T252071 [releng]
16:45 <hashar> integration: creating integration-agent-docker-1023 based on buster with new flavor `g3.cores8.ram24.disk20.ephemeral60.4xiops` # T290783 [releng]
2022-01-25 §
20:17 <James_F> Zuul: [mediawiki/extensions/CentralAuth] Drop UserMerge dependency [releng]
16:39 <James_F> Zuul: Mark Math extension as now tarballed in parameter_functions for T232948 [releng]
15:57 <James_F> Zuul: [mediawiki/extensions/Math] Add Math to the main gate for T232948 [releng]
13:44 <hashar> Jenkins CI: added Logger https://integration.wikimedia.org/ci/log/ProcessTree%20-%20T299995/ to watch `hudson.util.ProcessTree` for T299995 [releng]
10:02 <hashar> integration: removing usage of `role::ci::slave::labs::docker::docker_lvm_volume` in Horizon following https://gerrit.wikimedia.org/r/c/operations/puppet/+/755948 . Docker role instances now always have a 24G partition for Docker [releng]
09:59 <hashar> integration-agent-qemu-1001: resized /srv to 100% disk free: `lvextend -r -l +100%FREE /dev/mapper/vd-second--local--disk` # T299996 [releng]
09:59 <hashar> integration-agent-qemu-1001: resizing /dev/mapper/vd-second--local--disk (/srv) to 20G : `resize2fs -p /dev/mapper/vd-second--local--disk 20G` # T299996 [releng]
09:51 <hashar> integration-agent-qemu-1001: resizing /dev/mapper/vd-second--local--disk (/srv) to 20G : `resize2fs -p /dev/mapper/vd-second--local--disk 20G` [releng]
09:51 <hashar> integration-agent-qemu-1003: nuked /dev/vd/second-local-disk and /srv to make room for a docker logical volume. That has fixed puppet T299996 [releng]
09:22 <Reedy> unblocked beta again [releng]
07:32 <Krinkle> integration-castor03:/srv/jenkins-workspace/caches$ sudo rm -rf castor-mw-ext-and-skins/ [releng]
2022-01-24 §
21:44 <Reedy> unstick beta ci jobs [releng]
21:19 <jeena> reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/756523 [releng]
20:36 <Krinkle> Reloading Zuul to deploy https://gerrit.wikimedia.org/r/c/integration/config/+/756139 [releng]
17:28 <hashar> Nuke castor caches on integration-castor03 : sudo rm -fR /srv/jenkins-workspace/caches/castor-mw-ext-and-skins/master/{quibble-vendor-mysql-php72-selenium-docker,wmf-quibble-selenium-php72-docker} # T299933 [releng]
17:28 <hashar> Nuke castor caches on integration-castor03 : sudo rm -fR /srv/jenkins-workspace/caches/castor-mw-ext-and-skins/master/{quibble-vendor-mysql-php72-selenium-docker,wmf-quibble-selenium-php72-docker} [releng]
2022-01-22 §
13:40 <taavi> apply T299827 on deployment-prep centralauth database [releng]
11:44 <taavi> restart varnish-frontend.service on deployment-cache-upload06 to clear puppet agent failure alerts [releng]
2022-01-21 §
18:12 <taavi> resolved merge conflicts on deployment-puppetmaster04 [releng]
15:50 <hashar> integration-puppetmaster-02: deleted 2021 snapshot tags in puppet repo and ran `git gc --prune=now` [releng]
2022-01-20 §
20:24 <James_F> Zuul: [Kartographer] Add parsoid as dependency for CI jobs [releng]
20:22 <James_F> Zuul: [DiscussionTools] Add Gadgets as dependency for Phan jobs [releng]
20:04 <dancy> Jenkins beta jobs are back online, using scap prep auto now. [releng]
19:19 <dancy> Pausing beta Jenkins jobs to make a copy of /srv/mediawiki-staging in preparation for testing [releng]
19:10 <dancy> Unpacking scap (4.1.1-1+0~20220120175448.144~1.gbp517f9d) over (4.1.1-1+0~20220113154148.133~1.gbp6e3a17) on deploy03 [releng]
18:07 <hashar> Updating Quibble jobs to have MediaWiki files written on the hosts /srv partition (38G) instead of inside the container which ends in /var/lib/docker (24G) https://gerrit.wikimedia.org/r/755743 # T292729 [releng]
16:31 <hashar> Rebalancing /var/lib/docker and /srv partitions on CI agents | https://gerrit.wikimedia.org/r/755713 [releng]
12:12 <hashar> contint2001 deleting all the Docker images (they will be pulled as needed) [releng]
12:10 <hashar> contint2001 : docker container prune && docker image prune [releng]
12:07 <hashar> contint1001 deleting all the Docker images (they will be pulled as needed) [releng]
12:04 <hashar> contint1001 `docker image prune` [releng]
11:51 <hashar> Cleaning very old Docker images on contint1001.wikimedia.Org [releng]
2022-01-19 §
18:20 <hashar> Adding https://integration.wikimedia.org/ci/computer/contint1001/ back to the pool again [releng]
17:31 <hashar> Adding https://integration.wikimedia.org/ci/computer/contint1001/ back to the pool after the machine got powercycled # T299542 [releng]