4251-4300 of 10000 results (68ms)
2022-08-04 ยง
19:49 <mvernon@cumin1001> START - Cookbook sre.hosts.remove-downtime for thanos-be2001.codfw.wmnet [production]
19:44 <mvernon@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for 8 hosts [production]
19:44 <mvernon@cumin1001> START - Cookbook sre.hosts.remove-downtime for 8 hosts [production]
19:42 <Emperor> rebooting thanos-be2001 to fix drive ordering [production]
19:37 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for elastic2071.codfw.wmnet [production]
19:37 <bking@cumin1001> START - Cookbook sre.hosts.remove-downtime for elastic2071.codfw.wmnet [production]
19:31 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on elastic2071.codfw.wmnet with reason: T310146 [production]
19:31 <bking@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on elastic2071.codfw.wmnet with reason: T310146 [production]
19:13 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
19:12 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
19:12 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
19:12 <ryankemper@deploy1002> helmfile [eqiad] DONE helmfile.d/services/changeprop: apply [production]
19:11 <ryankemper@deploy1002> helmfile [eqiad] START helmfile.d/services/changeprop: apply [production]
19:11 <dancy> There were many errors during php-fpm restart due to failure to contact http://lvs2009:9090/pools/appservers-https_443/mw2361.codfw.wmnet and the like. [production]
19:11 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
19:10 <dancy@deploy1002> rebuilt and synchronized wikiversions files: group2 wikis to 1.39.0-wmf.23 refs T308076 [production]
19:09 <ryankemper@deploy1002> helmfile [codfw] DONE helmfile.d/services/changeprop: apply [production]
19:09 <ryankemper@deploy1002> helmfile [codfw] START helmfile.d/services/changeprop: apply [production]
19:05 <otto@deploy1002> helmfile [eqiad] DONE helmfile.d/services/eventgate-analytics-external: sync [production]
19:04 <otto@deploy1002> helmfile [eqiad] START helmfile.d/services/eventgate-analytics-external: sync [production]
19:04 <otto@deploy1002> helmfile [codfw] DONE helmfile.d/services/eventgate-analytics-external: sync [production]
19:03 <otto@deploy1002> helmfile [codfw] START helmfile.d/services/eventgate-analytics-external: sync [production]
19:03 <otto@deploy1002> helmfile [staging] DONE helmfile.d/services/eventgate-analytics-external: sync [production]
19:02 <otto@deploy1002> helmfile [staging] START helmfile.d/services/eventgate-analytics-external: sync [production]
19:02 <ottomata> roll-restarting eventgate-analytics-external to pick up backwards incompatible schema change - T314151 [production]
18:47 <ryankemper@deploy1002> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
18:46 <ryankemper@deploy1002> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
18:41 <cwhite> poweroff kafka-logging2003 - T310145 [production]
18:39 <dzahn@cumin2002> conftool action : set/pooled=yes; selector: dc=codfw,name=mw237[0-6].codfw.wmnet [production]
18:39 <dzahn@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for 7 hosts [production]
18:39 <dzahn@cumin2002> START - Cookbook sre.hosts.remove-downtime for 7 hosts [production]
18:35 <dzahn@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw2369.codfw.wmnet [production]
18:35 <dzahn@cumin2002> START - Cookbook sre.hosts.remove-downtime for mw2369.codfw.wmnet [production]
18:35 <dzahn@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw2368.codfw.wmnet [production]
18:35 <dzahn@cumin2002> START - Cookbook sre.hosts.remove-downtime for mw2368.codfw.wmnet [production]
18:35 <dzahn@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw2367.codfw.wmnet [production]
18:35 <dzahn@cumin2002> START - Cookbook sre.hosts.remove-downtime for mw2367.codfw.wmnet [production]
18:35 <dzahn@cumin2002> conftool action : set/pooled=yes; selector: dc=codfw,name=mw2369.codfw.wmnet [production]
18:35 <dzahn@cumin2002> conftool action : set/pooled=yes; selector: dc=codfw,name=mw2368.codfw.wmnet [production]
18:35 <dzahn@cumin2002> conftool action : set/pooled=yes; selector: dc=codfw,name=mw2367.codfw.wmnet [production]
18:35 <dzahn@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw2366.codfw.wmnet [production]
18:35 <dzahn@cumin2002> START - Cookbook sre.hosts.remove-downtime for mw2366.codfw.wmnet [production]
18:34 <dzahn@cumin2002> conftool action : set/pooled=yes; selector: dc=codfw,name=mw2366.codfw.wmnet [production]
18:30 <dzahn@cumin2002> conftool action : set/pooled=yes; selector: dc=codfw,name=mw2279.codfw.wmnet [production]
18:30 <dzahn@cumin2002> conftool action : set/pooled=yes; selector: dc=codfw,name=mw2278.codfw.wmnet [production]
18:29 <dzahn@cumin2002> conftool action : set/pooled=yes; selector: dc=codfw,name=mw2277.codfw.wmnet [production]
18:29 <dzahn@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw2276.codfw.wmnet [production]
18:29 <dzahn@cumin2002> START - Cookbook sre.hosts.remove-downtime for mw2276.codfw.wmnet [production]
18:29 <dzahn@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw2275.codfw.wmnet [production]
18:29 <dzahn@cumin2002> START - Cookbook sre.hosts.remove-downtime for mw2275.codfw.wmnet [production]