2020-12-18
ยง
|
18:36 |
<andrew@deploy1001> |
Started deploy [horizon/deploy@89b308c]: update codfw1dev deploy |
[production] |
18:36 |
<andrew@deploy1001> |
Finished deploy [horizon/deploy@89b308c]: update codfw1dev deploy (duration: 00m 09s) |
[production] |
18:36 |
<andrew@deploy1001> |
Started deploy [horizon/deploy@89b308c]: update codfw1dev deploy |
[production] |
18:07 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on labstore1005.eqiad.wmnet with reason: REIMAGE |
[production] |
18:05 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on labstore1005.eqiad.wmnet with reason: REIMAGE |
[production] |
17:59 |
<cmjohnson@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
17:40 |
<cmjohnson@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
17:28 |
<andrew@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudweb2001-dev.wikimedia.org with reason: REIMAGE |
[production] |
17:26 |
<andrew@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudweb2001-dev.wikimedia.org with reason: REIMAGE |
[production] |
17:09 |
<dcaro> |
finished cleaning up the dangling snapshots from cloudvirt1026 (T270478) |
[admin] |
17:08 |
<dcaro> |
removing dangling rbd snapshots (for backups on cloudvirt1026) (T270478) |
[admin] |
17:06 |
<dcaro> |
finished cleaning up the dangling snapshots from cloudvirt1025 (T270478) |
[admin] |
17:05 |
<dcaro> |
removing dangling rbd snapshots (for backups on cloudvirt1025) (T270478) |
[admin] |
17:00 |
<dcaro> |
finished cleaning up the dangling snapshots from cloudvirt1021 (T270478) |
[admin] |
16:58 |
<dcaro> |
removing dangling rbd snapshots (for backups on cloudvirt1021) (T270478) |
[admin] |
16:56 |
<dcaro> |
finished cleaning up the dangling snapshots from cloudvirt1022 (T270478) |
[admin] |
16:55 |
<dcaro> |
removing dangling rbd snapshots (for backups on cloudvirt1022) (T270478) |
[admin] |
16:54 |
<dcaro> |
finished cleaning up the dangling snapshots from cloudvirt1023 (T270478) |
[admin] |
16:51 |
<dcaro> |
removing dangling rbd snapshots (for backups on cloudvirt1023) (T270478) |
[admin] |
16:47 |
<dcaro> |
finished cleaning up the dangling snapshots from cloudvirt1024, freed ~12% of the capacity (T270478) |
[admin] |
16:21 |
<dcaro> |
removing dangling rbd snapshots (for backups on cloudvirt1024) (T270478) |
[admin] |
16:19 |
<shdubsh> |
restart logstash on logstash2004 |
[production] |
16:13 |
<andrewbogott> |
setting autoscale to 'off' for both ceph pools (eqiad1-compute and eqiad1-glance-images) because we like how things are set and the autoscaler does not |
[admin] |
16:00 |
<cmjohnson@cumin1001> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
15:51 |
<cmjohnson@cumin1001> |
START - Cookbook sre.dns.netbox |
[production] |
14:52 |
<hashar> |
Updating https://integration.wikimedia.org/ci/job/wikidata-query-flink-swift-plugin-maven-java8-docker-site-publish/ to run sonar with Java 11 (build remains on Java 8) T264873 |
[releng] |
14:10 |
<elukey> |
restore stat1004 to its previous settings for kerberos credential cache |
[analytics] |
13:33 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc2021.codfw.wmnet with reason: REIMAGE |
[production] |
13:31 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1021.eqiad.wmnet with reason: REIMAGE |
[production] |
13:31 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc2021.codfw.wmnet with reason: REIMAGE |
[production] |
13:29 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc1021.eqiad.wmnet with reason: REIMAGE |
[production] |
13:22 |
<marostegui> |
Compress clouddb1018:3312 clouddb1014:3312 T270473 |
[production] |
10:59 |
<jynus> |
starting test swift backup of enwiki on a single thread towards dbstore2003 T264189 |
[production] |
10:53 |
<jynus> |
returning db2102 to its original state |
[production] |
10:52 |
<arturo> |
live-hacking local puppetmaster with https://gerrit.wikimedia.org/r/c/operations/puppet/+/650470 (T267966) |
[toolsbeta] |
10:42 |
<arturo> |
updated facts from the tools project: `PUPPET_MASTER="tools-puppetmaster-02.eqiad.wmflabs" modules/puppet_compiler/files/compiler-update-facts` |
[puppet-diffs] |
10:33 |
<dcaro> |
purging rbd snapshots for image fc6fb78b-4515-4dcc-8254-591b9fe01762 (T270478) |
[admin] |
10:20 |
<hashar@deploy1001> |
Finished deploy [integration/docroot@1166384]: noop: clear out proper env variable in tests (duration: 00m 07s) |
[production] |
10:20 |
<hashar@deploy1001> |
Started deploy [integration/docroot@1166384]: noop: clear out proper env variable in tests |
[production] |
09:13 |
<marostegui> |
Compress clouddb1018:3317 clouddb1014:3317 T270473 |
[production] |
08:26 |
<jynus> |
temporarily taking db2102 offline for mysql testing |
[production] |
07:54 |
<elukey> |
on kafka-test10[08-10] - "ip addr flush dev ens5; systemctl restart ifup@ens5.service" |
[production] |
07:37 |
<legoktm> |
reloaded zuul for https://gerrit.wikimedia.org/r/649745 |
[releng] |
07:06 |
<marostegui> |
Stop mysql on db1124:3313 T268742 |
[production] |
07:02 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Remove es1013 from dbctl T268436', diff saved to https://phabricator.wikimedia.org/P13600 and previous config saved to /var/cache/conftool/dbconfig/20201218-070235-marostegui.json |
[production] |
07:00 |
<marostegui> |
Compress clouddb1019:3316 clouddb1015:3316 T270473 |
[production] |
06:53 |
<marostegui> |
Compress clouddb1020:3315 clouddb1016:3315 T270473 |
[production] |
01:34 |
<legoktm> |
restarted gerrit (T270451) |
[production] |
01:33 |
<legoktm@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on gerrit1001.wikimedia.org with reason: OOM |
[production] |
01:33 |
<legoktm@cumin1001> |
START - Cookbook sre.hosts.downtime for 0:10:00 on gerrit1001.wikimedia.org with reason: OOM |
[production] |