501-550 of 10000 results (98ms)
2022-12-15 ยง
11:42 <effie> switching maps/kartotherian from codfw to eqiad [production]
11:39 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host krb1001.eqiad.wmnet [production]
11:39 <jgiannelos@deploy1002> Finished deploy [kartotherian/deploy@00c9a16] (codfw): codfw: Disable traffic mirroring (duration: 01m 43s) [production]
11:37 <jgiannelos@deploy1002> Started deploy [kartotherian/deploy@00c9a16] (codfw): codfw: Disable traffic mirroring [production]
11:34 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host krb1001.eqiad.wmnet [production]
11:27 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host flowspec1001.eqiad.wmnet [production]
11:21 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host flowspec1001.eqiad.wmnet [production]
11:16 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ping3002.esams.wmnet [production]
11:15 <elukey@cumin1001> END (PASS) - Cookbook sre.kafka.reboot-workers (exit_code=0) for Kafka test-eqiad cluster: Reboot kafka nodes [production]
11:11 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ping3002.esams.wmnet [production]
11:04 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ping2002.codfw.wmnet [production]
11:00 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ping2002.codfw.wmnet [production]
10:53 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ping1002.eqiad.wmnet [production]
10:50 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ping1002.eqiad.wmnet [production]
10:49 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host krb2001.codfw.wmnet [production]
10:47 <XioNoX> disable ping offload in eqiad [production]
10:43 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host krb2001.codfw.wmnet [production]
10:42 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host urldownloader2001.wikimedia.org [production]
10:38 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host urldownloader2001.wikimedia.org [production]
10:36 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host urldownloader1002.wikimedia.org [production]
10:34 <jayme> restarted istiod pods in aux-k8s because of T303184 [production]
10:32 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host urldownloader1002.wikimedia.org [production]
09:56 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief1001.eqiad.wmnet [production]
09:54 <effie> stopping and masking nutcracker on mw servers - T277183 [production]
09:53 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host acmechief1001.eqiad.wmnet [production]
09:51 <vgutierrez@cumin1001> END (ERROR) - Cookbook sre.hosts.reboot-single (exit_code=97) for host acmechief2001.codfw.wmnet [production]
09:42 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host apt1001.wikimedia.org [production]
09:41 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host acmechief2001.codfw.wmnet [production]
09:40 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief-test2001.codfw.wmnet [production]
09:38 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host apt1001.wikimedia.org [production]
09:38 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host acmechief-test2001.codfw.wmnet [production]
09:37 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief-test1001.eqiad.wmnet [production]
09:31 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host acmechief-test1001.eqiad.wmnet [production]
09:30 <vgutierrez@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host acmechief-test1001.eqiad.wmnet [production]
09:30 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reboot-single for host acmechief-test1001.eqiad.wmnet [production]
09:27 <elukey@cumin1001> START - Cookbook sre.kafka.reboot-workers for Kafka test-eqiad cluster: Reboot kafka nodes [production]
09:21 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host urldownloader2002.wikimedia.org [production]
09:17 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host urldownloader2002.wikimedia.org [production]
09:15 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host urldownloader1001.wikimedia.org [production]
09:12 <akosiaris> reboot rdb2007 for kernel upgrades [production]
09:10 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host urldownloader1001.wikimedia.org [production]
09:08 <hashar@deploy1002> rebuilt and synchronized wikiversions files: all wikis to 1.40.0-wmf.14 refs T320519 [production]
08:53 <akosiaris> reboot rdb2009 for kernel upgrades [production]
08:52 <akosiaris> correction: reboot rdb1011 for kernel upgrades [production]
08:51 <akosiaris> reboot rdb1007 for kernel upgrades [production]
08:51 <akosiaris> nothing noticed with rdb1007 reboot for mw, jobqueue, api-gateway. changeprop had a minor backlog increase, but everything appears fine now. [production]
08:28 <akosiaris> reboot rdb1009 for kernel upgrades. possibly (but probably not) affected applications: changeprop, cpjobqueue, api-gateway, redisLockManager [production]
08:13 <kartik@deploy1002> Finished scap: Backport for [[gerrit:868215|Enable Section Translation on 6 WPs (T319177)]] (duration: 10m 55s) [production]
08:08 <jmm@cumin2002> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host puppetdb2003.codfw.wmnet [production]
08:04 <kartik@deploy1002> kartik and kartik: Backport for [[gerrit:868215|Enable Section Translation on 6 WPs (T319177)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet [production]