7051-7100 of 10000 results (67ms)
2022-05-30 ยง
08:54 <jbond@cumin1001> START - Cookbook sre.dns.wipe-cache netbox2002.codfw.wmnet on all recursors [production]
08:53 <jbond@cumin2002> START - Cookbook sre.dns.netbox [production]
08:53 <jbond@cumin2002> START - Cookbook sre.ganeti.makevm for new host netbox2002.codfw.wmnet [production]
08:52 <jbond@cumin1001> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) netbox2002.codfw.wmnet on all recursors [production]
08:52 <jbond@cumin1001> START - Cookbook sre.dns.wipe-cache netbox2002.codfw.wmnet on all recursors [production]
08:45 <jbond> disable puppet fleet wide Gerrit:799344 [production]
08:42 <oblivian@deploy1002> Synchronized README: testing php restarts with scap, T266055 (duration: 00m 45s) [production]
08:42 <jbond@cumin2002> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host netbox2002.codfw.wmnet [production]
08:42 <jbond@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:40 <ladsgroup@deploy1002> Synchronized php-1.39.0-wmf.13/extensions/LiquidThreads/classes/Thread.php: Backport: [[gerrit:800705|Stop trying to pass legacy page_restrictions to RestrictionStore (T309460)]] (duration: 00m 47s) [production]
08:38 <jbond@cumin2002> START - Cookbook sre.dns.netbox [production]
08:38 <jbond@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:36 <jbond@cumin1001> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) netbox2002.codfw.wmnet on all recursors [production]
08:36 <jbond@cumin1001> START - Cookbook sre.dns.wipe-cache netbox2002.codfw.wmnet on all recursors [production]
08:35 <jbond@cumin2002> START - Cookbook sre.dns.netbox [production]
08:35 <jbond@cumin2002> START - Cookbook sre.ganeti.makevm for new host netbox2002.codfw.wmnet [production]
08:35 <jbond@cumin2002> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host netbox2002.codfw.wmnet [production]
08:35 <jbond@cumin2002> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
08:34 <jbond@cumin1001> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) netbox2002.codfw.wmnet on all recursors [production]
08:34 <jbond@cumin1001> START - Cookbook sre.dns.wipe-cache netbox2002.codfw.wmnet on all recursors [production]
08:32 <jbond@cumin2002> START - Cookbook sre.dns.netbox [production]
08:32 <jbond@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:29 <jbond@cumin2002> START - Cookbook sre.dns.netbox [production]
08:29 <jbond@cumin2002> START - Cookbook sre.ganeti.makevm for new host netbox2002.codfw.wmnet [production]
08:18 <jbond@cumin2002> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host netbox2002.codfw.wmnet [production]
08:18 <jbond@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:13 <jbond@cumin2002> START - Cookbook sre.dns.netbox [production]
08:13 <jbond@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:10 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance [production]
08:10 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance [production]
08:09 <jbond@cumin2002> START - Cookbook sre.dns.netbox [production]
08:09 <jbond@cumin2002> START - Cookbook sre.ganeti.makevm for new host netbox2002.codfw.wmnet [production]
08:08 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance [production]
08:08 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance [production]
08:08 <moritzm> installing dpkg security updates [production]
07:34 <_joe_> removing l10update leftovers from deployment servers in production [production]
07:02 <aqu@deploy1002> Finished deploy [airflow-dags/analytics_test@3ae51e7]: (no justification provided) (duration: 00m 03s) [production]
07:02 <aqu@deploy1002> Started deploy [airflow-dags/analytics_test@3ae51e7]: (no justification provided) [production]
06:49 <elukey> restart kube-api on ml-serve-ctrl2002 as attempt to clear some high api latencies / HTTP 504 due to LIST to a specific knative resource [production]
06:48 <elukey> restart kube-api on ml-serve-ctrl1002 as attempt to clear some high api latencies / HTTP 504 due to LIST to a specific knative resource [production]
06:39 <marostegui> Drop renamed revision_actor_temp on s4 T307906 [production]
06:36 <marostegui> Drop renamed revision_actor_temp on s8 T307906 [production]
06:10 <marostegui> Drop renamed revision_actor_temp on s5 T307906 [production]
06:01 <marostegui> Drop renamed revision_actor_temp on s7 T307906 [production]
05:35 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db2088 (s1 and s2) T309485', diff saved to https://phabricator.wikimedia.org/P28913 and previous config saved to /var/cache/conftool/dbconfig/20220530-053459-marostegui.json [production]
05:28 <marostegui> Drop renamed revision_actor_temp on s2 T307906 [production]
05:26 <marostegui> Drop renamed revision_actor_temp on s6 T307906 [production]
05:15 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1184', diff saved to https://phabricator.wikimedia.org/P28911 and previous config saved to /var/cache/conftool/dbconfig/20220530-051555-marostegui.json [production]
04:38 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1140.eqiad.wmnet with reason: Maintenance [production]
04:38 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1140.eqiad.wmnet with reason: Maintenance [production]