7901-7950 of 10000 results (80ms)
2019-11-14 ยง
14:01 <ema> depool cp3060 and reimage as text_ats T227432 [production]
13:37 <ladsgroup@deploy1001> scap failed: average error rate on 7/11 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org/goto/db09a36be5ed3e81155041f7d46ad040 for details) [production]
13:35 <gehel> depool wdqs1004 to allow catching up on lag - T238229 [production]
13:06 <bblack> removing digicert-2019 files from cache nodes - https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/550829/ [production]
12:24 <mobrovac@deploy1001> Finished deploy [restbase/deploy@58cf5ae]: Fix /metrics/mediarequests/top/ indentation (duration: 14m 52s) [production]
12:09 <mobrovac@deploy1001> Started deploy [restbase/deploy@58cf5ae]: Fix /metrics/mediarequests/top/ indentation [production]
11:58 <mobrovac@deploy1001> Finished deploy [restbase/deploy@58cf5ae] (dev-cluster): Fix /metrics/mediarequests/top/ indentation (duration: 02m 50s) [production]
11:55 <mobrovac@deploy1001> Started deploy [restbase/deploy@58cf5ae] (dev-cluster): Fix /metrics/mediarequests/top/ indentation [production]
11:26 <gehel@cumin1001> END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) [production]
10:48 <vgutierrez> Rolling restart of ats-tls/ats-backend to upgrade to 8.0.5-1wm11 - T238307 [production]
10:44 <vgutierrez> uploaded trafficserver-8.0.5-1wm11 to apt.wikimedia.org (stretch) - T238307 [production]
10:43 <ema> pool cp3058 with ATS backend T227432 [production]
10:25 <ema@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
10:23 <ema@cumin1001> START - Cookbook sre.hosts.downtime [production]
10:20 <godog> netbox1001 bandaid/symlink /srv/deployment/netbox/deploy/src/netbox/project-static to 'static' [production]
10:06 <gehel> copying journal from wdqs1007 to wdqs1005 - T238232 [production]
10:05 <gehel@cumin1001> START - Cookbook sre.wdqs.data-transfer [production]
10:03 <Urbanecm> Run deleteEqualMessages.php --delete for cswiki and viwiki [production]
09:59 <ema@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
09:57 <ema@cumin1001> START - Cookbook sre.hosts.downtime [production]
09:55 <gehel> depool wdqs (public) eqiad - high lag - T238229 [production]
09:34 <ema> depool cp3058 and reimage as text_ats T227432 [production]
09:31 <marostegui> Compare wikidatawiki.pagelinks between labsdb1011 and labsdb1010 - T233986 [production]
09:25 <moritzm> installing ghostscript updates on thumbor1001 [production]
09:24 <marostegui> Stop mysql on db2067 to clone db21133 - T238183 [production]
09:20 <marostegui@cumin1001> dbctl commit (dc=all): 'Full weight to db1089 on special groups for s1 T223151', diff saved to https://phabricator.wikimedia.org/P9635 and previous config saved to /var/cache/conftool/dbconfig/20191114-092006-marostegui.json [production]
09:06 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
09:05 <marostegui> Compare wikidatawiki.pagelinks between db1124:3318 and labsdb1010 - T233986 [production]
09:04 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime [production]
08:42 <marostegui> Remove ar_comment from triggers on db1124:3315 - T234704 [production]
08:41 <marostegui> Deploy schema change with replication on db1082, this will generate lag on s5 labs - T233135 T234066 [production]
08:40 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
08:40 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1082 for schema change', diff saved to https://phabricator.wikimedia.org/P9634 and previous config saved to /var/cache/conftool/dbconfig/20191114-084043-marostegui.json [production]
08:38 <marostegui@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) [production]
08:38 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime [production]
08:38 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime [production]
08:37 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db1110 after schema change', diff saved to https://phabricator.wikimedia.org/P9633 and previous config saved to /var/cache/conftool/dbconfig/20191114-083729-marostegui.json [production]
08:03 <eileen> process-control config revision is 6adc66a20b re-enable backfill [production]
08:00 <marostegui@cumin1001> dbctl commit (dc=all): 'Pool a non partitioned slave db1089 on special groups for s1 T223151', diff saved to https://phabricator.wikimedia.org/P9632 and previous config saved to /var/cache/conftool/dbconfig/20191114-080038-marostegui.json [production]
07:54 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1103:3312 T235599', diff saved to https://phabricator.wikimedia.org/P9631 and previous config saved to /var/cache/conftool/dbconfig/20191114-075449-marostegui.json [production]
07:41 <eileen> process-control config revision is b7c2cf7227 - disabled backfill again - some error? [production]
07:29 <eileen> process-control config revision is 909108622d re-enable omnirecipient date repair job [production]
07:25 <eileen> process-control config revision is d3ebeddcc1 (I renabled the old back fill job) [production]
07:12 <moritzm> installing intel-microcode updates [production]
06:53 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1067', diff saved to https://phabricator.wikimedia.org/P9630 and previous config saved to /var/cache/conftool/dbconfig/20191114-065309-marostegui.json [production]
06:16 <marostegui> Stop replication on db1067 [production]
06:01 <marostegui@cumin2001> dbctl commit (dc=all): 'Promote db1083 to s1 master and remove read-only from s1 T234800', diff saved to https://phabricator.wikimedia.org/P9629 and previous config saved to /var/cache/conftool/dbconfig/20191114-060138-marostegui.json [production]
06:00 <marostegui@cumin2001> dbctl commit (dc=all): 'Set s1 as read-only for maintenance T234800', diff saved to https://phabricator.wikimedia.org/P9628 and previous config saved to /var/cache/conftool/dbconfig/20191114-060026-marostegui.json [production]
06:00 <marostegui> Starting s1 failover from db1067 to db1083 - T234800 [production]
05:51 <jynus> stopping db1114 replication [production]