1-50 of 10000 results (41ms)
2022-03-04 ยง
17:59 <btullis@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
17:57 <btullis@cumin1001> START - Cookbook sre.dns.netbox [production]
17:57 <btullis@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
17:48 <btullis@cumin1001> START - Cookbook sre.dns.netbox [production]
17:46 <mforns@deploy1002> Finished deploy [airflow-dags/analytics@19520c1]: (no justification provided) (duration: 00m 07s) [production]
17:46 <mforns@deploy1002> Started deploy [airflow-dags/analytics@19520c1]: (no justification provided) [production]
17:39 <mforns@deploy1002> Finished deploy [airflow-dags/analytics_test@19520c1]: (no justification provided) (duration: 00m 08s) [production]
17:39 <mforns@deploy1002> Started deploy [airflow-dags/analytics_test@19520c1]: (no justification provided) [production]
17:09 <mforns@deploy1002> Finished deploy [airflow-dags/analytics_test@1388c61]: (no justification provided) (duration: 00m 08s) [production]
17:09 <mforns@deploy1002> Started deploy [airflow-dags/analytics_test@1388c61]: (no justification provided) [production]
16:35 <mforns@deploy1002> Finished deploy [airflow-dags/analytics_test@1388c61]: (no justification provided) (duration: 00m 07s) [production]
16:35 <mforns@deploy1002> Started deploy [airflow-dags/analytics_test@1388c61]: (no justification provided) [production]
16:13 <mforns@deploy1002> Finished deploy [airflow-dags/analytics_test@1388c61]: (no justification provided) (duration: 00m 10s) [production]
16:13 <mforns@deploy1002> Started deploy [airflow-dags/analytics_test@1388c61]: (no justification provided) [production]
16:06 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1116.eqiad.wmnet with reason: Maintenance [production]
16:06 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1116.eqiad.wmnet with reason: Maintenance [production]
16:06 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1005.eqiad.wmnet with reason: Maintenance [production]
16:06 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on dbstore1005.eqiad.wmnet with reason: Maintenance [production]
16:06 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1099:3318 (T300992)', diff saved to https://phabricator.wikimedia.org/P21856 and previous config saved to /var/cache/conftool/dbconfig/20220304-160629-ladsgroup.json [production]
16:03 <mforns@deploy1002> Finished deploy [airflow-dags/analytics_test@1388c61]: (no justification provided) (duration: 00m 03s) [production]
16:03 <mforns@deploy1002> Started deploy [airflow-dags/analytics_test@1388c61]: (no justification provided) [production]
15:59 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1086.eqiad.wmnet with OS buster [production]
15:58 <vgutierrez> pool cp1086 with HAProxy as TLS termination layer - T290005 [production]
15:56 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp2038.codfw.wmnet with OS buster [production]
15:51 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1099:3318', diff saved to https://phabricator.wikimedia.org/P21854 and previous config saved to /var/cache/conftool/dbconfig/20220304-155124-ladsgroup.json [production]
15:51 <vgutierrez> pool cp2038 with HAProxy as TLS termination layer - T290005 [production]
15:49 <mforns@deploy1002> Finished deploy [airflow-dags/analytics_test@1388c61]: (no justification provided) (duration: 00m 07s) [production]
15:49 <mforns@deploy1002> Started deploy [airflow-dags/analytics_test@1388c61]: (no justification provided) [production]
15:41 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp1086.eqiad.wmnet with reason: host reimage [production]
15:38 <vgutierrez@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cp1086.eqiad.wmnet with reason: host reimage [production]
15:37 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp2038.codfw.wmnet with reason: host reimage [production]
15:36 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1099:3318', diff saved to https://phabricator.wikimedia.org/P21852 and previous config saved to /var/cache/conftool/dbconfig/20220304-153619-ladsgroup.json [production]
15:34 <XioNoX> blackhole IPs - T303055 [production]
15:34 <vgutierrez@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cp2038.codfw.wmnet with reason: host reimage [production]
15:22 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reimage for host cp1086.eqiad.wmnet with OS buster [production]
15:21 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1099:3318 (T300992)', diff saved to https://phabricator.wikimedia.org/P21851 and previous config saved to /var/cache/conftool/dbconfig/20220304-152114-ladsgroup.json [production]
15:20 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1099:3318 (T300992)', diff saved to https://phabricator.wikimedia.org/P21850 and previous config saved to /var/cache/conftool/dbconfig/20220304-152007-ladsgroup.json [production]
15:20 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1099.eqiad.wmnet with reason: Maintenance [production]
15:20 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db1099.eqiad.wmnet with reason: Maintenance [production]
15:19 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on 12 hosts with reason: Maintenance [production]
15:19 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 12:00:00 on 12 hosts with reason: Maintenance [production]
15:19 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2079.codfw.wmnet with reason: Maintenance [production]
15:19 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on db2079.codfw.wmnet with reason: Maintenance [production]
15:19 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1172 (T300992)', diff saved to https://phabricator.wikimedia.org/P21849 and previous config saved to /var/cache/conftool/dbconfig/20220304-151937-ladsgroup.json [production]
15:16 <vgutierrez@cumin1001> START - Cookbook sre.hosts.reimage for host cp2038.codfw.wmnet with OS buster [production]
15:04 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P21848 and previous config saved to /var/cache/conftool/dbconfig/20220304-150433-ladsgroup.json [production]
14:59 <ebernhardson> restart elasticsearch_6@production-search-psi-eqiad.service on elastic1049 to resolve CirrusSearchJVMGCOldPoolFlatlined alert [production]
14:49 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1172', diff saved to https://phabricator.wikimedia.org/P21847 and previous config saved to /var/cache/conftool/dbconfig/20220304-144926-ladsgroup.json [production]
14:46 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp3059.esams.wmnet with OS buster [production]
14:43 <vgutierrez> pool cp3059 with HAProxy as TLS termination layer - T290005 [production]