501-550 of 10000 results (58ms)
2022-08-16 ยง
16:08 <cmjohnson@cumin1001> START - Cookbook sre.dns.netbox [production]
16:02 <btullis@deploy1002> Finished deploy [airflow-dags/analytics@3c998da]: (no justification provided) (duration: 00m 12s) [production]
16:02 <btullis@deploy1002> Started deploy [airflow-dags/analytics@3c998da]: (no justification provided) [production]
15:48 <mvernon@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for ms-be2032.codfw.wmnet [production]
15:48 <mvernon@cumin1001> START - Cookbook sre.hosts.remove-downtime for ms-be2032.codfw.wmnet [production]
15:42 <ryankemper@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic1074.eqiad.wmnet with OS bullseye [production]
15:29 <mvernon@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on ms-be2032.codfw.wmnet with reason: RAID battery failure [production]
15:29 <mvernon@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on ms-be2032.codfw.wmnet with reason: RAID battery failure [production]
15:25 <ryankemper@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic1074.eqiad.wmnet with reason: host reimage [production]
15:23 <ryankemper@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on elastic1074.eqiad.wmnet with reason: host reimage [production]
15:12 <jayme@cumin1001> END (PASS) - Cookbook sre.discovery.service-route-jayme (exit_code=0) [production]
15:10 <ryankemper@cumin1001> START - Cookbook sre.hosts.reimage for host elastic1074.eqiad.wmnet with OS bullseye [production]
15:07 <jayme@cumin1001> START - Cookbook sre.discovery.service-route-jayme [production]
15:07 <jayme@cumin1001> END (PASS) - Cookbook sre.discovery.service-route-jayme (exit_code=0) [production]
15:07 <jayme@cumin1001> START - Cookbook sre.discovery.service-route-jayme [production]
14:31 <jayme@cumin1001> END (PASS) - Cookbook sre.discovery.service-route-jayme (exit_code=0) [production]
14:30 <ryankemper@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic1077.eqiad.wmnet with OS bullseye [production]
14:26 <jayme@cumin1001> START - Cookbook sre.discovery.service-route-jayme [production]
14:13 <ryankemper@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic1077.eqiad.wmnet with reason: host reimage [production]
14:10 <ryankemper@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on elastic1077.eqiad.wmnet with reason: host reimage [production]
14:01 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
14:00 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
14:00 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
13:59 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
13:57 <ryankemper@cumin1001> START - Cookbook sre.hosts.reimage for host elastic1077.eqiad.wmnet with OS bullseye [production]
13:55 <ryankemper@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic1057.eqiad.wmnet with OS bullseye [production]
13:55 <taavi@deploy1002> Synchronized wmf-config/InitialiseSettings.php: revert: Config: [[gerrit:823148|jawiki: Restrict abusefilter log view to "abusefilter-modify" user (T315199)]] (duration: 03m 12s) [production]
13:41 <taavi> UTC afternoon deploys done [production]
13:40 <taavi@deploy1002> Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:823148|jawiki: Restrict abusefilter log view to "abusefilter-modify" user (T315199)]] (duration: 03m 21s) [production]
13:39 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
13:38 <jayme@cumin1001> END (PASS) - Cookbook sre.discovery.service-route (exit_code=0) [production]
13:38 <jayme@cumin1001> START - Cookbook sre.discovery.service-route [production]
13:38 <jayme@cumin1001> END (FAIL) - Cookbook sre.discovery.service-route (exit_code=1) [production]
13:38 <jayme@cumin1001> START - Cookbook sre.discovery.service-route [production]
13:38 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
13:38 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
13:37 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
13:36 <ryankemper@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic1057.eqiad.wmnet with reason: host reimage [production]
13:33 <ryankemper@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on elastic1057.eqiad.wmnet with reason: host reimage [production]
13:24 <jayme@cumin1001> END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on A:wikikube-staging-worker-eqiad [production]
13:24 <jayme@cumin1001> END (PASS) - Cookbook sre.discovery.service-route (exit_code=0) [production]
13:24 <jayme@cumin1001> START - Cookbook sre.discovery.service-route [production]
13:24 <taavi@deploy1002> Synchronized wmf-config: Config: [[gerrit:822718|kowiki: Change logo for 600k articles (T315127)]] (duration: 03m 11s) [production]
13:22 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
13:21 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
13:21 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
13:20 <taavi@deploy1002> Synchronized static/images: Config: [[gerrit:822717|kowiki: Add logo (legacy vector and vector-2022) for 600k articles (T315127)]] (duration: 03m 29s) [production]
13:20 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
13:17 <ryankemper@cumin1001> START - Cookbook sre.hosts.reimage for host elastic1057.eqiad.wmnet with OS bullseye [production]
13:16 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db1169.eqiad.wmnet with reason: Maintenance [production]