2020-11-16
ยง
|
15:57 |
<hnowlan@cumin1001> |
START - Cookbook sre.cassandra.roll-restart |
[production] |
15:57 |
<hnowlan> |
roll-restarting eqiad restbase for java security updates |
[production] |
15:56 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) |
[production] |
15:50 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
15:40 |
<cdanis@cumin1001> |
END (PASS) - Cookbook sre.network.cf (exit_code=0) |
[production] |
15:40 |
<cdanis@cumin1001> |
START - Cookbook sre.network.cf |
[production] |
14:16 |
<hnowlan@cumin1001> |
START - Cookbook sre.cassandra.roll-restart |
[production] |
14:12 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Repool pc1007 in pc1 after restarting mysql T266483 (duration: 00m 59s) |
[production] |
14:06 |
<marostegui> |
Restart pc1007's mysql T266483 |
[production] |
14:06 |
<marostegui@deploy1001> |
Synchronized wmf-config/db-eqiad.php: Depool pc1007 and place pc1010 instead of it T266483 (duration: 01m 00s) |
[production] |
13:23 |
<hnowlan@cumin1001> |
END (FAIL) - Cookbook sre.cassandra.roll-restart (exit_code=99) |
[production] |
13:00 |
<kormat> |
running schema change against s1 in codfw T259831 |
[production] |
12:59 |
<kormat@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
12:59 |
<kormat@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
12:43 |
<moritzm> |
installing tcpdump security updates |
[production] |
12:35 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
12:35 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
12:25 |
<hnowlan@cumin1001> |
START - Cookbook sre.cassandra.roll-restart |
[production] |
12:25 |
<hnowlan> |
roll-restarting restbase-codfw |
[production] |
12:24 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) |
[production] |
12:10 |
<hnowlan@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
12:10 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
11:49 |
<hnowlan> |
roll restarting sessionstore for java updates |
[production] |
11:49 |
<hnowlan@cumin1001> |
START - Cookbook sre.cassandra.roll-restart |
[production] |
11:44 |
<dcaro> |
etcd5 member added, creating instance toolsbeta-test-k8s-etcd6 and adding to the etcd cluster (T267140) |
[toolsbeta] |
11:27 |
<dcaro> |
Creating instance toolsbeta-test-k8s-etcd5 and adding to the etcd cluster (T267140) |
[toolsbeta] |
11:13 |
<moritzm> |
installing poppler security updates |
[production] |
10:46 |
<klausman@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
10:46 |
<klausman@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:45 |
<dcaro@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
10:45 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:44 |
<dcaro@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
10:44 |
<dcaro@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:41 |
<klausman> |
about to update stat1008 to new kernel and rocm |
[analytics] |
09:31 |
<gehel@cumin2001> |
END (FAIL) - Cookbook sre.elasticsearch.force-shard-allocation (exit_code=99) |
[production] |
09:31 |
<gehel@cumin2001> |
START - Cookbook sre.elasticsearch.force-shard-allocation |
[production] |
09:13 |
<joal> |
Rerun webrequest-refine for hours 0 to 6 of day 2020-11-16 - This will prevent webrequest-druid-daily to get loaded with incoherent data due to bucketing change |
[analytics] |
08:45 |
<joal> |
Correct webrequest job directly on HDFS and restart webrequest bundle oozie job |
[analytics] |
08:43 |
<joal> |
Kill webrequest bundle to correct typo |
[analytics] |
08:39 |
<godog> |
centrallog1001 move invalid config /etc/logrotate.d/logrotate-debug to /etc |
[production] |
08:35 |
<moritzm> |
installing codemirror-js security updates |
[production] |
08:32 |
<XioNoX> |
asw-c-codfw> request system power-off member 7 - T267865 |
[production] |
08:31 |
<joal> |
Restart webrequest bundle oozie job with update |
[analytics] |
08:31 |
<joal> |
Restart webrequest bun |
[analytics] |
08:25 |
<joal> |
Deploying refinery onto HDFS |
[analytics] |
08:24 |
<joal@deploy1001> |
Finished deploy [analytics/refinery@3df51cb] (thin): Analytics special train for webrequest table update THIN [analytics/refinery@3df51cb] (duration: 00m 07s) |
[production] |
08:23 |
<joal@deploy1001> |
Started deploy [analytics/refinery@3df51cb] (thin): Analytics special train for webrequest table update THIN [analytics/refinery@3df51cb] |
[production] |
08:23 |
<joal@deploy1001> |
Finished deploy [analytics/refinery@3df51cb]: Analytics special train for webrequest table update [analytics/refinery@3df51cb] (duration: 10m 09s) |
[production] |
08:13 |
<joal> |
Deploying refinery with scap |
[analytics] |
08:13 |
<joal@deploy1001> |
Started deploy [analytics/refinery@3df51cb]: Analytics special train for webrequest table update [analytics/refinery@3df51cb] |
[production] |