2020-09-30
ยง
|
15:28 |
<moritzm> |
removed librsvg 2.40.20-3+wmf1+stretch1 from component/thumbor, superseded by 2.40.21-0+deb9u1 released via stretch-security |
[production] |
15:15 |
<Operator873> |
@Operator873 Restarted CVNBot19, 16, 10, 9, 8, 7, and 6 |
[cvn] |
14:23 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:22 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:22 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:22 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:22 |
<hnowlan@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:22 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:20 |
<hnowlan@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
14:20 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:20 |
<hnowlan@cumin1001> |
END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) |
[production] |
14:20 |
<hnowlan@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:10 |
<cmjohnson1> |
powering down ores100[3-9 to upgrade memory in each T259909 |
[production] |
14:05 |
<elukey> |
create thirdparty/amd-rocm33 for stretch-wikimedia |
[production] |
14:03 |
<cmjohnson1> |
powering down ores1002 to upgrade memory T259909 |
[production] |
13:55 |
<cmjohnson1> |
powering down ores1001 to upgrade memory T259909 |
[production] |
13:28 |
<arturo> |
enable puppet, reboot and pool back cloudvirt1031 |
[admin] |
13:27 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
13:27 |
<arturo> |
extend icinga downtimes for another 120 mins |
[admin] |
13:27 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
13:27 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
13:27 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
13:25 |
<arturo> |
deleted a bunch of failed VMs |
[cloudvirt-canary] |
13:15 |
<arturo> |
`aborrero@cloudcontrol1003:~$ sudo nova-manage placement sync_aggregates` after reading a hint in nova-api.log |
[admin] |
13:12 |
<hnowlan> |
started bootstrapping restbase1028-a, first buster restbase host |
[production] |
13:06 |
<arturo> |
creating VM canary1016-01 |
[cloudvirt-canary] |
13:02 |
<arturo> |
rebooting cloudvirt1016 and moving it to the ceph host aggregate |
[admin] |
12:58 |
<arturo> |
creating VM canary1014-01 |
[cloudvirt-canary] |
12:56 |
<arturo> |
creating VM canary1013-01 |
[cloudvirt-canary] |
12:55 |
<arturo> |
rebooting cloudvirt1014 and moving it to the ceph host aggregate |
[admin] |
12:51 |
<arturo> |
rebooting cloudvirt1013 and moving it to the ceph host aggregate |
[admin] |
12:39 |
<arturo> |
root@cloudcontrol1005:~# openstack aggregate add host maintenance cloudvirt1031 |
[admin] |
12:39 |
<marostegui> |
Deploy schema change on db2080, db2081 T264109 |
[production] |
12:38 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db2081', diff saved to https://phabricator.wikimedia.org/P12858 and previous config saved to /var/cache/conftool/dbconfig/20200930-123851-marostegui.json |
[production] |
12:38 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2081', diff saved to https://phabricator.wikimedia.org/P12857 and previous config saved to /var/cache/conftool/dbconfig/20200930-123824-marostegui.json |
[production] |
12:37 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Repool db2080', diff saved to https://phabricator.wikimedia.org/P12856 and previous config saved to /var/cache/conftool/dbconfig/20200930-123753-marostegui.json |
[production] |
12:37 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2080', diff saved to https://phabricator.wikimedia.org/P12855 and previous config saved to /var/cache/conftool/dbconfig/20200930-123659-marostegui.json |
[production] |
12:36 |
<arturo> |
restarted with `bin/stashbot.sh restart` |
[tools.stashbot] |
12:36 |
<arturo> |
rebooted cloudnet1003 (active) a couple of minutes ago |
[admin] |
12:36 |
<arturo> |
move cloudvirt1012 and cloudvirt1039 to the ceph aggregate |
[admin] |
11:49 |
<arturo> |
rebooting cloudvirt1039 |
[admin] |
11:46 |
<arturo> |
rebooting cloudvirt1012 |
[admin] |
11:40 |
<arturo> |
rebooting cloudnet1004 (standby) to pick up https://gerrit.wikimedia.org/r/c/operations/puppet/+/631167 (T262979) |
[admin] |
11:38 |
<arturo> |
[codfw1dev] rebooting cloudnet2002-dev to pick up https://gerrit.wikimedia.org/r/c/operations/puppet/+/631167 |
[admin] |
11:36 |
<arturo> |
[codfw1dev] rebooting cloudnet2003-dev to pick up https://gerrit.wikimedia.org/r/c/operations/puppet/+/631167 |
[admin] |
11:33 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
11:33 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
11:33 |
<aborrero@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
11:33 |
<aborrero@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
11:33 |
<effie> |
enable puppet P:mediawiki::mcrouter_wancache for 630845 - T244340 |
[production] |