2022-05-30
§
|
08:29 |
<jbond@cumin2002> |
START - Cookbook sre.ganeti.makevm for new host netbox2002.codfw.wmnet |
[production] |
08:18 |
<jbond@cumin2002> |
END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host netbox2002.codfw.wmnet |
[production] |
08:18 |
<jbond@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
08:13 |
<jbond@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
08:13 |
<jbond@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
08:10 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance |
[production] |
08:10 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance |
[production] |
08:09 |
<jbond@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
08:09 |
<jbond@cumin2002> |
START - Cookbook sre.ganeti.makevm for new host netbox2002.codfw.wmnet |
[production] |
08:08 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance |
[production] |
08:08 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance |
[production] |
08:08 |
<moritzm> |
installing dpkg security updates |
[production] |
07:34 |
<_joe_> |
removing l10update leftovers from deployment servers in production |
[production] |
07:02 |
<aqu@deploy1002> |
Finished deploy [airflow-dags/analytics_test@3ae51e7]: (no justification provided) (duration: 00m 03s) |
[production] |
07:02 |
<aqu@deploy1002> |
Started deploy [airflow-dags/analytics_test@3ae51e7]: (no justification provided) |
[production] |
06:49 |
<elukey> |
restart kube-api on ml-serve-ctrl2002 as attempt to clear some high api latencies / HTTP 504 due to LIST to a specific knative resource |
[production] |
06:48 |
<elukey> |
restart kube-api on ml-serve-ctrl1002 as attempt to clear some high api latencies / HTTP 504 due to LIST to a specific knative resource |
[production] |
06:39 |
<marostegui> |
Drop renamed revision_actor_temp on s4 T307906 |
[production] |
06:36 |
<marostegui> |
Drop renamed revision_actor_temp on s8 T307906 |
[production] |
06:10 |
<marostegui> |
Drop renamed revision_actor_temp on s5 T307906 |
[production] |
06:01 |
<marostegui> |
Drop renamed revision_actor_temp on s7 T307906 |
[production] |
05:35 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2088 (s1 and s2) T309485', diff saved to https://phabricator.wikimedia.org/P28913 and previous config saved to /var/cache/conftool/dbconfig/20220530-053459-marostegui.json |
[production] |
05:28 |
<marostegui> |
Drop renamed revision_actor_temp on s2 T307906 |
[production] |
05:26 |
<marostegui> |
Drop renamed revision_actor_temp on s6 T307906 |
[production] |
05:15 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1184', diff saved to https://phabricator.wikimedia.org/P28911 and previous config saved to /var/cache/conftool/dbconfig/20220530-051555-marostegui.json |
[production] |
04:38 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1140.eqiad.wmnet with reason: Maintenance |
[production] |
04:38 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1140.eqiad.wmnet with reason: Maintenance |
[production] |
04:38 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1131 (T298560)', diff saved to https://phabricator.wikimedia.org/P28910 and previous config saved to /var/cache/conftool/dbconfig/20220530-043837-ladsgroup.json |
[production] |
04:23 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1131', diff saved to https://phabricator.wikimedia.org/P28909 and previous config saved to /var/cache/conftool/dbconfig/20220530-042332-ladsgroup.json |
[production] |
04:08 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1131', diff saved to https://phabricator.wikimedia.org/P28908 and previous config saved to /var/cache/conftool/dbconfig/20220530-040827-ladsgroup.json |
[production] |
03:53 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1131 (T298560)', diff saved to https://phabricator.wikimedia.org/P28907 and previous config saved to /var/cache/conftool/dbconfig/20220530-035322-ladsgroup.json |
[production] |
02:00 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1131 (T298560)', diff saved to https://phabricator.wikimedia.org/P28906 and previous config saved to /var/cache/conftool/dbconfig/20220530-020011-ladsgroup.json |
[production] |
02:00 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1131.eqiad.wmnet with reason: Maintenance |
[production] |
02:00 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1131.eqiad.wmnet with reason: Maintenance |
[production] |
02:00 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1113:3316 (T298560)', diff saved to https://phabricator.wikimedia.org/P28905 and previous config saved to /var/cache/conftool/dbconfig/20220530-020003-ladsgroup.json |
[production] |
01:44 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1113:3316', diff saved to https://phabricator.wikimedia.org/P28904 and previous config saved to /var/cache/conftool/dbconfig/20220530-014458-ladsgroup.json |
[production] |
01:29 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1113:3316', diff saved to https://phabricator.wikimedia.org/P28903 and previous config saved to /var/cache/conftool/dbconfig/20220530-012953-ladsgroup.json |
[production] |
01:14 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1113:3316 (T298560)', diff saved to https://phabricator.wikimedia.org/P28902 and previous config saved to /var/cache/conftool/dbconfig/20220530-011448-ladsgroup.json |
[production] |
2022-05-29
§
|
22:39 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1098:3317 (T298560)', diff saved to https://phabricator.wikimedia.org/P28901 and previous config saved to /var/cache/conftool/dbconfig/20220529-223940-ladsgroup.json |
[production] |
22:24 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1098:3317', diff saved to https://phabricator.wikimedia.org/P28900 and previous config saved to /var/cache/conftool/dbconfig/20220529-222435-ladsgroup.json |
[production] |
22:09 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1098:3317', diff saved to https://phabricator.wikimedia.org/P28899 and previous config saved to /var/cache/conftool/dbconfig/20220529-220930-ladsgroup.json |
[production] |
21:54 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1098:3317 (T298560)', diff saved to https://phabricator.wikimedia.org/P28898 and previous config saved to /var/cache/conftool/dbconfig/20220529-215425-ladsgroup.json |
[production] |
19:31 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db1113:3316 (T298560)', diff saved to https://phabricator.wikimedia.org/P28897 and previous config saved to /var/cache/conftool/dbconfig/20220529-193138-ladsgroup.json |
[production] |
19:31 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1113.eqiad.wmnet with reason: Maintenance |
[production] |
19:31 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1113.eqiad.wmnet with reason: Maintenance |
[production] |
19:31 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1098:3316 (T298560)', diff saved to https://phabricator.wikimedia.org/P28896 and previous config saved to /var/cache/conftool/dbconfig/20220529-193130-ladsgroup.json |
[production] |
19:16 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1098:3316', diff saved to https://phabricator.wikimedia.org/P28895 and previous config saved to /var/cache/conftool/dbconfig/20220529-191625-ladsgroup.json |
[production] |
19:01 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1098:3316', diff saved to https://phabricator.wikimedia.org/P28894 and previous config saved to /var/cache/conftool/dbconfig/20220529-190119-ladsgroup.json |
[production] |
18:46 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1098:3316 (T298560)', diff saved to https://phabricator.wikimedia.org/P28893 and previous config saved to /var/cache/conftool/dbconfig/20220529-184614-ladsgroup.json |
[production] |
15:10 |
<jelto> |
cleanup stalled backups on gitlab1001, re-run full backup |
[production] |