2021-01-29
ยง
|
23:26 |
<razzi@cumin1001> |
END (PASS) - Cookbook sre.kafka.reboot-workers (exit_code=0) for Kafka test cluster: Reboot kafka nodes - razzi@cumin1001 |
[production] |
22:36 |
<dancy@deploy1001> |
Finished scap: MW servers complaining about l10n files after .27 rollback (duration: 07m 22s) |
[production] |
22:29 |
<dancy@deploy1001> |
Started scap: MW servers complaining about l10n files after .27 rollback |
[production] |
22:26 |
<dancy@deploy1001> |
rebuilt and synchronized wikiversions files: all wikis to 1.36.0-wmf.27 |
[production] |
22:20 |
<reedy@deploy1001> |
Synchronized php-1.36.0-wmf.27/includes/parser/CacheTime.php: CacheTime: Extra protection for rollback unserialization T273007 (duration: 01m 00s) |
[production] |
22:14 |
<dancy@deploy1001> |
rebuilt and synchronized wikiversions files: all wikis to 1.36.0-wmf.28 |
[production] |
22:09 |
<dancy@deploy1001> |
scap failed: average error rate on 8/9 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org/goto/83629bcb5560d11e61d3085c89dd9ed6 for details) |
[production] |
21:42 |
<razzi> |
rebalance kafka partitions for codfw.resource_change |
[production] |
21:40 |
<razzi@cumin1001> |
START - Cookbook sre.kafka.reboot-workers for Kafka test cluster: Reboot kafka nodes - razzi@cumin1001 |
[production] |
19:26 |
<razzi@cumin1001> |
END (FAIL) - Cookbook sre.kafka.reboot-workers (exit_code=99) for Kafka test cluster: Reboot kafka nodes - razzi@cumin1001 |
[production] |
19:26 |
<razzi@cumin1001> |
START - Cookbook sre.kafka.reboot-workers for Kafka test cluster: Reboot kafka nodes - razzi@cumin1001 |
[production] |
18:50 |
<hashar> |
CI slightly overloaded due to a surge of library updates but is otherwise processing changes |
[production] |
17:31 |
<reedy@deploy1001> |
Synchronized php-1.36.0-wmf.28/extensions/WikiEditor/modules/jquery.wikiEditor.toolbar.config.js: T273231 (duration: 01m 02s) |
[production] |
16:56 |
<effie> |
depool mw1403 and mw1405 |
[production] |
15:46 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.ganeti.makevm (exit_code=0) for new host an-test-presto1001.eqiad.wmnet |
[production] |
15:27 |
<elukey@cumin1001> |
START - Cookbook sre.ganeti.makevm for new host an-test-presto1001.eqiad.wmnet |
[production] |
14:58 |
<ariel@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on snapshot1007.eqiad.wmnet with reason: REIMAGE |
[production] |
14:56 |
<ariel@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on snapshot1007.eqiad.wmnet with reason: REIMAGE |
[production] |
13:50 |
<akosiaris@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'staging' . |
[production] |
13:50 |
<akosiaris@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'production' . |
[production] |
13:50 |
<akosiaris@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . |
[production] |
13:49 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'linkrecommendation' for release 'production' . |
[production] |
13:49 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . |
[production] |
13:48 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'linkrecommendation' for release 'staging' . |
[production] |
13:47 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'linkrecommendation' for release 'production' . |
[production] |
13:47 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . |
[production] |
13:47 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'linkrecommendation' for release 'staging' . |
[production] |
13:16 |
<akosiaris@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'staging' . |
[production] |
13:16 |
<akosiaris@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'production' . |
[production] |
13:16 |
<akosiaris@deploy1001> |
helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . |
[production] |
13:05 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'linkrecommendation' for release 'production' . |
[production] |
13:05 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . |
[production] |
13:05 |
<akosiaris@deploy1001> |
helmfile [codfw] Ran 'sync' command on namespace 'linkrecommendation' for release 'staging' . |
[production] |
13:02 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'linkrecommendation' for release 'staging' . |
[production] |
13:02 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . |
[production] |
13:02 |
<akosiaris@deploy1001> |
helmfile [staging] Ran 'sync' command on namespace 'linkrecommendation' for release 'production' . |
[production] |
12:38 |
<hnowlan> |
uploaded osmborder_0.1.0-2~buster0 package to buster-wikimedia |
[production] |
12:00 |
<gilles@deploy1001> |
Finished deploy [performance/coal@b0d3b59]: T271208 Filter out canary events (duration: 00m 06s) |
[production] |
12:00 |
<gilles@deploy1001> |
Started deploy [performance/coal@b0d3b59]: T271208 Filter out canary events |
[production] |
11:42 |
<dcausse@deploy1001> |
Synchronized wmf-config/unitConversionConfig.json: T270252: Update unitConversionConfig.json (duration: 01m 01s) |
[production] |
11:39 |
<gilles@deploy1001> |
Finished deploy [performance/navtiming@ae8310a]: T271208 Fix canary event check (duration: 00m 05s) |
[production] |
11:39 |
<gilles@deploy1001> |
Started deploy [performance/navtiming@ae8310a]: T271208 Fix canary event check |
[production] |
11:26 |
<gilles@deploy1001> |
Finished deploy [performance/navtiming@e7712c3]: T271208 Log instead of hard error on missing wiki field (duration: 00m 06s) |
[production] |
11:26 |
<gilles@deploy1001> |
Started deploy [performance/navtiming@e7712c3]: T271208 Log instead of hard error on missing wiki field |
[production] |
11:06 |
<gilles@deploy1001> |
Finished deploy [performance/navtiming@125f6be]: T271208 Ignore canary events (duration: 00m 05s) |
[production] |
11:06 |
<gilles@deploy1001> |
Started deploy [performance/navtiming@125f6be]: T271208 Ignore canary events |
[production] |
11:04 |
<elukey> |
upload presto-* version 0.246-1 packages to buster/stretch-wikimedia |
[production] |
10:54 |
<jynus@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
10:45 |
<jynus@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
10:35 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'db1078 (re)pooling @ 100%: After upgrading the kernel', diff saved to https://phabricator.wikimedia.org/P14050 and previous config saved to /var/cache/conftool/dbconfig/20210129-103505-root.json |
[production] |