4201-4250 of 10000 results (92ms)
2023-03-03 ยง
15:02 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM urldownloader1004.wikimedia.org - jmm@cumin2002" [production]
14:59 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['elastic1053.eqiad.wmnet'] [production]
14:58 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM urldownloader1004.wikimedia.org - jmm@cumin2002" [production]
14:56 <jmm@cumin2002> START - Cookbook sre.dns.netbox [production]
14:56 <jmm@cumin2002> START - Cookbook sre.ganeti.makevm for new host urldownloader1004.wikimedia.org [production]
14:37 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host urldownloader1003.wikimedia.org [production]
14:27 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) urldownloader1003.wikimedia.org on all recursors [production]
14:27 <jmm@cumin2002> START - Cookbook sre.dns.wipe-cache urldownloader1003.wikimedia.org on all recursors [production]
14:27 <jmm@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:27 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM urldownloader1003.wikimedia.org - jmm@cumin2002" [production]
14:27 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on 14 hosts with reason: rerack [production]
14:26 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on 14 hosts with reason: rerack [production]
14:24 <jmm@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1002.eqiad.wmnet with OS bookworm [production]
14:16 <jmm@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM urldownloader1003.wikimedia.org - jmm@cumin2002" [production]
14:10 <jmm@cumin2002> START - Cookbook sre.dns.netbox [production]
14:10 <jmm@cumin2002> START - Cookbook sre.ganeti.makevm for new host urldownloader1003.wikimedia.org [production]
14:09 <inflatador> bking@cumin2002 banning elastic1053-59 from the cluster in preparation for T322082 [production]
14:02 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
13:51 <jmm@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1002.eqiad.wmnet with OS bookworm [production]
13:16 <ayounsi@cumin1001> END (FAIL) - Cookbook sre.network.peering (exit_code=99) with action 'email' for AS: 20485 [production]
13:16 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'email' for AS: 20485 [production]
13:15 <ayounsi@cumin1001> END (FAIL) - Cookbook sre.network.peering (exit_code=99) with action 'email' for AS: 20485 [production]
13:15 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'email' for AS: 20485 [production]
12:55 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
11:29 <jmm@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sretest1002.eqiad.wmnet with OS bookworm [production]
11:17 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
11:13 <moritzm> imported PHP 7.4 1:7.4.33-1+0~20221108.73+debian10~1.gbpa00350a+wmf10u2+icu67u1 to component/icu67 (build of PHP against co-installable ICU67) T329491 [production]
10:39 <vgutierrez> restart ntp.service in dns2001 [production]
10:30 <jelto@cumin1001> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab2002.wikimedia.org with reason: Install software version upgrade [production]
10:25 <moritzm> installing 5.10.162 kernels on buster systems running Linux 5.10 [production]
10:12 <root@cumin2002> END (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Jonas Kress (WMDE) out of all services on: 1119 hosts [production]
10:12 <root@cumin2002> START - Cookbook sre.idm.logout Logging Jonas Kress (WMDE) out of all services on: 1119 hosts [production]
09:56 <root@cumin2002> END (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Tobias Andersson out of all services on: 1119 hosts [production]
09:55 <root@cumin2002> START - Cookbook sre.idm.logout Logging Tobias Andersson out of all services on: 1119 hosts [production]
09:54 <root@cumin2002> END (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Tobias Andersson out of all services on: 909 hosts [production]
09:54 <root@cumin2002> START - Cookbook sre.idm.logout Logging Tobias Andersson out of all services on: 909 hosts [production]
09:45 <jelto@cumin1001> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: Install software version upgrade [production]
09:45 <jelto@cumin1001> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1004.wikimedia.org with reason: Install software version upgrade [production]
09:27 <jelto@cumin1001> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1003.wikimedia.org with reason: Install software version upgrade [production]
09:10 <elukey@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
09:10 <elukey@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
09:07 <elukey@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
09:01 <jelto@cumin1001> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1004.wikimedia.org with reason: Install software version upgrade [production]
08:54 <elukey> restart pybal on lvs2010 (standby) and then on lvs2009 (active) to pick up monitoring change (https://gerrit.wikimedia.org/r/c/operations/puppet/+/893008) [production]
08:48 <elukey> restart pybal on lvs1020 (standby) and then on lvs1019 (active) to pick up monitoring change (https://gerrit.wikimedia.org/r/c/operations/puppet/+/893008) [production]
08:45 <jelto@cumin1001> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1003.wikimedia.org with reason: Install software version upgrade [production]
08:36 <vgutierrez> restarting ntp in dns1001 [production]
07:29 <elukey> truncate /var/log/auth.log.1 on krb1001 to free space (root partition almost filled up) [production]
01:12 <mutante> releases1002: deleting /usr/local/sbin/sync-srv-org-wikimedia-reprepro-releases1002.eqiad.wmnet which confusingly contains an rsync command to rsync from releases1001 which does not exist anymore T330960 [production]
00:13 <mutante> switching releases.wikimedia.org from eqiad to codfw - T330960 [production]