401-450 of 10000 results (46ms)
2022-04-01 §
11:31 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sretest1001.eqiad.wmnet [production]
11:26 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host sretest1001.eqiad.wmnet [production]
11:01 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief2001.codfw.wmnet [production]
10:55 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host acmechief2001.codfw.wmnet [production]
10:54 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief1001.eqiad.wmnet [production]
10:50 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host acmechief1001.eqiad.wmnet [production]
10:47 <vgutierrez> reboot acme-chief instances to catch up on kernel upgrades [production]
10:34 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ncredir6002.drmrs.wmnet [production]
10:29 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host ncredir6002.drmrs.wmnet [production]
10:29 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ncredir6001.drmrs.wmnet [production]
10:21 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host ncredir6001.drmrs.wmnet [production]
10:20 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ncredir5002.eqsin.wmnet [production]
10:14 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host ncredir5002.eqsin.wmnet [production]
10:06 <vgutierrez> vgutierrez@puppetmaster2001:~$ sudo -i rm /var/run/confd-template/.ml-staging-ctrl*.err [production]
10:04 <vgutierrez> vgutierrez@puppetmaster1001:~$ sudo -i rm /var/run/confd-template/.ml-staging-ctrl*.err [production]
10:03 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ncredir5001.eqsin.wmnet [production]
09:57 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host ncredir5001.eqsin.wmnet [production]
09:47 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ncredir4002.ulsfo.wmnet [production]
09:43 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host ncredir4002.ulsfo.wmnet [production]
09:43 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ncredir4001.ulsfo.wmnet [production]
09:37 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host ncredir4001.ulsfo.wmnet [production]
09:35 <vgutierrez@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host ncredir3002.esams.wmnet [production]
09:24 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host ncredir3002.esams.wmnet [production]
09:24 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ncredir3001.esams.wmnet [production]
09:18 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host ncredir3001.esams.wmnet [production]
09:16 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ncredir2002.codfw.wmnet [production]
09:10 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host ncredir2002.codfw.wmnet [production]
09:10 <vgutierrez@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host ncredir2001.codfw.wmnet [production]
08:59 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host ncredir2001.codfw.wmnet [production]
08:58 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ncredir1002.eqiad.wmnet [production]
08:54 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host ncredir1002.eqiad.wmnet [production]
08:53 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ncredir1001.eqiad.wmnet [production]
08:49 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host ncredir1001.eqiad.wmnet [production]
08:48 <vgutierrez@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host ncredir1001.eqiad.wmnet [production]
08:48 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host ncredir1001.eqiad.wmnet [production]
08:44 <vgutierrez@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-cluster (exit_code=99) [production]
08:44 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-cluster [production]
08:42 <vgutierrez> rolling restart of ncredir instances to catch up on kernel upgrades [production]
06:54 <XioNoX> traffic engineering in drmrs to prevent link saturation [production]
2022-03-31 §
23:45 <mutante> gitlab2001 - fdisk /dev/vdb (g, w) (create partition table), (n, w) (create partition) ; mkfs.ext4 /dev/vdb1 (create filesystem); systemctl reset-failed (fix Icinga alert); mkdir /mnt/gitlab-backup; mount /dev/vdb1 /mnt/gitlab-backup ; blkid (get UUID); edit /etc/fstab and insert "UUID=c5235682-ac21-46a9-85ee-9603f694a6a4 /mnt/gitlab-backup ext4 errors=remount-ro 0 2" T274463 [production]
23:27 <mutante> gitlab2001 - rebooted on ganeti level (needed when adding new virtual hardware), then ran into the usual bug T272555 where you have to manually fix the interface in /etc/network/interfaces T274463 [production]
23:21 <mutante> gitlab2001 (gitlab-replica.wikimedia.org) - rebooting to add new virtual disk T274463 [production]
23:11 <ejegg> updated payments-wiki from 47d9bd27 to 6f888c28 [production]
23:01 <bblack> esams->drmrs failover test begins - T304089 [production]
22:34 <moritzm> updated CAS to 6.4.6.2 [production]
22:28 <mutante> ganeti - creating new 100G virtual disk on gitlab1001 T274463 [production]
22:24 <mutante> ganeti - creating new 100G virtual disk on gitlab2001 T274463 [production]
22:16 <bking@cumin1001> END (PASS) - Cookbook sre.wdqs.reboot (exit_code=0) [production]
22:03 <bking@cumin1001> START - Cookbook sre.wdqs.reboot [production]
22:02 <bking@cumin1001> END (PASS) - Cookbook sre.wdqs.reboot (exit_code=0) [production]