2022-05-30
ยง
|
08:54 |
<jbond@cumin1001> |
START - Cookbook sre.dns.wipe-cache netbox2002.codfw.wmnet on all recursors |
[production] |
08:53 |
<jbond@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
08:53 |
<jbond@cumin2002> |
START - Cookbook sre.ganeti.makevm for new host netbox2002.codfw.wmnet |
[production] |
08:52 |
<jbond@cumin1001> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) netbox2002.codfw.wmnet on all recursors |
[production] |
08:52 |
<jbond@cumin1001> |
START - Cookbook sre.dns.wipe-cache netbox2002.codfw.wmnet on all recursors |
[production] |
08:45 |
<jbond> |
disable puppet fleet wide Gerrit:799344 |
[production] |
08:42 |
<oblivian@deploy1002> |
Synchronized README: testing php restarts with scap, T266055 (duration: 00m 45s) |
[production] |
08:42 |
<jbond@cumin2002> |
END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host netbox2002.codfw.wmnet |
[production] |
08:42 |
<jbond@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
08:40 |
<ladsgroup@deploy1002> |
Synchronized php-1.39.0-wmf.13/extensions/LiquidThreads/classes/Thread.php: Backport: [[gerrit:800705|Stop trying to pass legacy page_restrictions to RestrictionStore (T309460)]] (duration: 00m 47s) |
[production] |
08:38 |
<jbond@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
08:38 |
<jbond@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
08:36 |
<jbond@cumin1001> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) netbox2002.codfw.wmnet on all recursors |
[production] |
08:36 |
<jbond@cumin1001> |
START - Cookbook sre.dns.wipe-cache netbox2002.codfw.wmnet on all recursors |
[production] |
08:35 |
<jbond@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
08:35 |
<jbond@cumin2002> |
START - Cookbook sre.ganeti.makevm for new host netbox2002.codfw.wmnet |
[production] |
08:35 |
<jbond@cumin2002> |
END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host netbox2002.codfw.wmnet |
[production] |
08:35 |
<jbond@cumin2002> |
END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) |
[production] |
08:34 |
<jbond@cumin1001> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) netbox2002.codfw.wmnet on all recursors |
[production] |
08:34 |
<jbond@cumin1001> |
START - Cookbook sre.dns.wipe-cache netbox2002.codfw.wmnet on all recursors |
[production] |
08:32 |
<jbond@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
08:32 |
<jbond@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
08:29 |
<jbond@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
08:29 |
<jbond@cumin2002> |
START - Cookbook sre.ganeti.makevm for new host netbox2002.codfw.wmnet |
[production] |
08:18 |
<jbond@cumin2002> |
END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host netbox2002.codfw.wmnet |
[production] |
08:18 |
<jbond@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
08:13 |
<jbond@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
08:13 |
<jbond@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
08:10 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance |
[production] |
08:10 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance |
[production] |
08:09 |
<jbond@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
08:09 |
<jbond@cumin2002> |
START - Cookbook sre.ganeti.makevm for new host netbox2002.codfw.wmnet |
[production] |
08:08 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance |
[production] |
08:08 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance |
[production] |
08:08 |
<moritzm> |
installing dpkg security updates |
[production] |
07:34 |
<_joe_> |
removing l10update leftovers from deployment servers in production |
[production] |
07:02 |
<aqu@deploy1002> |
Finished deploy [airflow-dags/analytics_test@3ae51e7]: (no justification provided) (duration: 00m 03s) |
[production] |
07:02 |
<aqu@deploy1002> |
Started deploy [airflow-dags/analytics_test@3ae51e7]: (no justification provided) |
[production] |
06:49 |
<elukey> |
restart kube-api on ml-serve-ctrl2002 as attempt to clear some high api latencies / HTTP 504 due to LIST to a specific knative resource |
[production] |
06:48 |
<elukey> |
restart kube-api on ml-serve-ctrl1002 as attempt to clear some high api latencies / HTTP 504 due to LIST to a specific knative resource |
[production] |
06:39 |
<marostegui> |
Drop renamed revision_actor_temp on s4 T307906 |
[production] |
06:36 |
<marostegui> |
Drop renamed revision_actor_temp on s8 T307906 |
[production] |
06:10 |
<marostegui> |
Drop renamed revision_actor_temp on s5 T307906 |
[production] |
06:01 |
<marostegui> |
Drop renamed revision_actor_temp on s7 T307906 |
[production] |
05:35 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2088 (s1 and s2) T309485', diff saved to https://phabricator.wikimedia.org/P28913 and previous config saved to /var/cache/conftool/dbconfig/20220530-053459-marostegui.json |
[production] |
05:28 |
<marostegui> |
Drop renamed revision_actor_temp on s2 T307906 |
[production] |
05:26 |
<marostegui> |
Drop renamed revision_actor_temp on s6 T307906 |
[production] |
05:15 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db1184', diff saved to https://phabricator.wikimedia.org/P28911 and previous config saved to /var/cache/conftool/dbconfig/20220530-051555-marostegui.json |
[production] |
04:38 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1140.eqiad.wmnet with reason: Maintenance |
[production] |
04:38 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1140.eqiad.wmnet with reason: Maintenance |
[production] |