4401-4450 of 10000 results (62ms)
2022-05-30 §
08:13 <jbond@cumin2002> START - Cookbook sre.dns.netbox [production]
08:13 <jbond@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:10 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance [production]
08:10 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance [production]
08:09 <jbond@cumin2002> START - Cookbook sre.dns.netbox [production]
08:09 <jbond@cumin2002> START - Cookbook sre.ganeti.makevm for new host netbox2002.codfw.wmnet [production]
08:08 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance [production]
08:08 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance [production]
08:08 <moritzm> installing dpkg security updates [production]
07:34 <_joe_> removing l10update leftovers from deployment servers in production [production]
07:02 <aqu@deploy1002> Finished deploy [airflow-dags/analytics_test@3ae51e7]: (no justification provided) (duration: 00m 03s) [production]
07:02 <aqu@deploy1002> Started deploy [airflow-dags/analytics_test@3ae51e7]: (no justification provided) [production]
06:49 <elukey> restart kube-api on ml-serve-ctrl2002 as attempt to clear some high api latencies / HTTP 504 due to LIST to a specific knative resource [production]
06:48 <elukey> restart kube-api on ml-serve-ctrl1002 as attempt to clear some high api latencies / HTTP 504 due to LIST to a specific knative resource [production]
06:39 <marostegui> Drop renamed revision_actor_temp on s4 T307906 [production]
06:36 <marostegui> Drop renamed revision_actor_temp on s8 T307906 [production]
06:10 <marostegui> Drop renamed revision_actor_temp on s5 T307906 [production]
06:01 <marostegui> Drop renamed revision_actor_temp on s7 T307906 [production]
05:35 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db2088 (s1 and s2) T309485', diff saved to https://phabricator.wikimedia.org/P28913 and previous config saved to /var/cache/conftool/dbconfig/20220530-053459-marostegui.json [production]
05:28 <marostegui> Drop renamed revision_actor_temp on s2 T307906 [production]
05:26 <marostegui> Drop renamed revision_actor_temp on s6 T307906 [production]
05:15 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1184', diff saved to https://phabricator.wikimedia.org/P28911 and previous config saved to /var/cache/conftool/dbconfig/20220530-051555-marostegui.json [production]
04:38 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1140.eqiad.wmnet with reason: Maintenance [production]
04:38 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1140.eqiad.wmnet with reason: Maintenance [production]
04:38 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1131 (T298560)', diff saved to https://phabricator.wikimedia.org/P28910 and previous config saved to /var/cache/conftool/dbconfig/20220530-043837-ladsgroup.json [production]
04:23 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1131', diff saved to https://phabricator.wikimedia.org/P28909 and previous config saved to /var/cache/conftool/dbconfig/20220530-042332-ladsgroup.json [production]
04:08 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1131', diff saved to https://phabricator.wikimedia.org/P28908 and previous config saved to /var/cache/conftool/dbconfig/20220530-040827-ladsgroup.json [production]
03:53 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1131 (T298560)', diff saved to https://phabricator.wikimedia.org/P28907 and previous config saved to /var/cache/conftool/dbconfig/20220530-035322-ladsgroup.json [production]
02:00 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1131 (T298560)', diff saved to https://phabricator.wikimedia.org/P28906 and previous config saved to /var/cache/conftool/dbconfig/20220530-020011-ladsgroup.json [production]
02:00 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1131.eqiad.wmnet with reason: Maintenance [production]
02:00 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1131.eqiad.wmnet with reason: Maintenance [production]
02:00 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1113:3316 (T298560)', diff saved to https://phabricator.wikimedia.org/P28905 and previous config saved to /var/cache/conftool/dbconfig/20220530-020003-ladsgroup.json [production]
01:44 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1113:3316', diff saved to https://phabricator.wikimedia.org/P28904 and previous config saved to /var/cache/conftool/dbconfig/20220530-014458-ladsgroup.json [production]
01:29 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1113:3316', diff saved to https://phabricator.wikimedia.org/P28903 and previous config saved to /var/cache/conftool/dbconfig/20220530-012953-ladsgroup.json [production]
01:14 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1113:3316 (T298560)', diff saved to https://phabricator.wikimedia.org/P28902 and previous config saved to /var/cache/conftool/dbconfig/20220530-011448-ladsgroup.json [production]
2022-05-29 §
22:39 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1098:3317 (T298560)', diff saved to https://phabricator.wikimedia.org/P28901 and previous config saved to /var/cache/conftool/dbconfig/20220529-223940-ladsgroup.json [production]
22:24 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1098:3317', diff saved to https://phabricator.wikimedia.org/P28900 and previous config saved to /var/cache/conftool/dbconfig/20220529-222435-ladsgroup.json [production]
22:09 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1098:3317', diff saved to https://phabricator.wikimedia.org/P28899 and previous config saved to /var/cache/conftool/dbconfig/20220529-220930-ladsgroup.json [production]
21:54 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1098:3317 (T298560)', diff saved to https://phabricator.wikimedia.org/P28898 and previous config saved to /var/cache/conftool/dbconfig/20220529-215425-ladsgroup.json [production]
19:31 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1113:3316 (T298560)', diff saved to https://phabricator.wikimedia.org/P28897 and previous config saved to /var/cache/conftool/dbconfig/20220529-193138-ladsgroup.json [production]
19:31 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1113.eqiad.wmnet with reason: Maintenance [production]
19:31 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1113.eqiad.wmnet with reason: Maintenance [production]
19:31 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1098:3316 (T298560)', diff saved to https://phabricator.wikimedia.org/P28896 and previous config saved to /var/cache/conftool/dbconfig/20220529-193130-ladsgroup.json [production]
19:16 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1098:3316', diff saved to https://phabricator.wikimedia.org/P28895 and previous config saved to /var/cache/conftool/dbconfig/20220529-191625-ladsgroup.json [production]
19:01 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1098:3316', diff saved to https://phabricator.wikimedia.org/P28894 and previous config saved to /var/cache/conftool/dbconfig/20220529-190119-ladsgroup.json [production]
18:46 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1098:3316 (T298560)', diff saved to https://phabricator.wikimedia.org/P28893 and previous config saved to /var/cache/conftool/dbconfig/20220529-184614-ladsgroup.json [production]
15:10 <jelto> cleanup stalled backups on gitlab1001, re-run full backup [production]
14:48 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depooling db1098:3317 (T298560)', diff saved to https://phabricator.wikimedia.org/P28892 and previous config saved to /var/cache/conftool/dbconfig/20220529-144839-ladsgroup.json [production]
14:48 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1098.eqiad.wmnet with reason: Maintenance [production]
14:48 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1098.eqiad.wmnet with reason: Maintenance [production]