951-1000 of 10000 results (84ms)
2023-06-26 §
21:54 <ryankemper@cumin1001> END (PASS) - Cookbook sre.wdqs.restart (exit_code=0) [production]
21:53 <eevans@cumin2002> START - Cookbook sre.discovery.service-route pool sessionstore in codfw: maintenance [production]
21:53 <urandom> pooling sessionstore/codfw for bullseye upgrades — T340043 [production]
21:45 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: apply on main [production]
21:44 <eevans@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sessionstore2003.codfw.wmnet with OS bullseye [production]
21:43 <ryankemper@cumin1001> END (FAIL) - Cookbook sre.wdqs.restart (exit_code=99) [production]
21:39 <btullis@deploy1002> helmfile [staging] DONE helmfile.d/services/datahub: sync on main [production]
21:36 <ryankemper@cumin1001> END (PASS) - Cookbook sre.wdqs.restart (exit_code=0) [production]
21:26 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: apply on main [production]
21:22 <ryankemper@cumin1001> START - Cookbook sre.wdqs.restart [production]
21:22 <eevans@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sessionstore2003.codfw.wmnet with reason: host reimage [production]
21:21 <ryankemper@cumin1001> START - Cookbook sre.wdqs.restart [production]
21:18 <eevans@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on sessionstore2003.codfw.wmnet with reason: host reimage [production]
21:15 <ryankemper@cumin1001> START - Cookbook sre.wdqs.restart [production]
21:13 <ryankemper@puppetmaster1001> conftool action : set/weight=0:pooled=inactive; selector: name=wdqs2022.* [production]
21:13 <ryankemper@puppetmaster1001> conftool action : set/weight=0:pooled=inactive; selector: name=wdqs2021.* [production]
21:13 <ryankemper@cumin1001> START - Cookbook sre.wdqs.data-transfer [production]
21:02 <eevans@cumin2002> START - Cookbook sre.hosts.reimage for host sessionstore2003.codfw.wmnet with OS bullseye [production]
20:55 <eevans@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host sessionstore2003.codfw.wmnet with OS bullseye [production]
20:45 <eevans@cumin2002> START - Cookbook sre.hosts.reimage for host sessionstore2003.codfw.wmnet with OS bullseye [production]
20:42 <eevans@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sessionstore2001.codfw.wmnet with OS bullseye [production]
20:34 <brennen@deploy1002> Finished deploy [phabricator/deployment@0529926]: deploy latest state to phab1004 (duration: 00m 31s) [production]
20:33 <brennen@deploy1002> Started deploy [phabricator/deployment@0529926]: deploy latest state to phab1004 [production]
20:30 <brennen@deploy1002> Finished deploy [phabricator/deployment@a25a737]: deploy latest state to phab1004 (duration: 00m 34s) [production]
20:30 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on phab2002.codfw.wmnet with reason: patch application [production]
20:30 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 0:15:00 on phab2002.codfw.wmnet with reason: patch application [production]
20:30 <brennen@deploy1002> Started deploy [phabricator/deployment@a25a737]: deploy latest state to phab1004 [production]
20:29 <brennen@deploy1002> Finished deploy [phabricator/deployment@a25a737]: deploy latest state to phab2002 (duration: 00m 38s) [production]
20:29 <brennen@deploy1002> Started deploy [phabricator/deployment@a25a737]: deploy latest state to phab2002 [production]
20:28 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on phab1004.eqiad.wmnet with reason: patch application [production]
20:28 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 0:15:00 on phab1004.eqiad.wmnet with reason: patch application [production]
20:27 <brennen> deploying minor phabricator updates shortly [production]
20:27 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on phab1004.eqiad.wmnet with reason: first setup [production]
20:27 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 0:15:00 on phab1004.eqiad.wmnet with reason: first setup [production]
20:18 <eevans@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sessionstore2001.codfw.wmnet with reason: host reimage [production]
20:15 <eevans@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on sessionstore2001.codfw.wmnet with reason: host reimage [production]
20:00 <eevans@cumin2002> START - Cookbook sre.hosts.reimage for host sessionstore2001.codfw.wmnet with OS bullseye [production]
19:49 <akosiaris> force puppet run on cp hosts T340483 [production]
19:48 <akosiaris> revert "Redirect www.mediawiki.org to mw-on-k8s", debugging T340483 [production]
19:24 <eevans@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sessionstore2002.codfw.wmnet with OS bullseye [production]
19:02 <eevans@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sessionstore2002.codfw.wmnet with reason: host reimage [production]
18:57 <eevans@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on sessionstore2002.codfw.wmnet with reason: host reimage [production]
18:42 <eevans@cumin2002> START - Cookbook sre.hosts.reimage for host sessionstore2002.codfw.wmnet with OS bullseye [production]
18:38 <eevans@cumin2002> END (PASS) - Cookbook sre.discovery.service-route (exit_code=0) depool sessionstore in codfw: maintenance [production]
18:33 <eevans@cumin2002> START - Cookbook sre.discovery.service-route depool sessionstore in codfw: maintenance [production]
18:33 <urandom> depooling sessionstore/codfw for bullseye upgrades — T340043 [production]
18:07 <otto@deploy1002> helmfile [eqiad] DONE helmfile.d/services/eventgate-main: apply [production]
18:07 <otto@deploy1002> helmfile [eqiad] START helmfile.d/services/eventgate-main: apply [production]
18:06 <otto@deploy1002> helmfile [codfw] DONE helmfile.d/services/eventgate-main: apply [production]
18:05 <otto@deploy1002> helmfile [codfw] START helmfile.d/services/eventgate-main: apply [production]