2016-07-04
ยง
|
13:43 |
<akosiaris> |
restart smokeping on netmon1001, temporarily disabled msw1-codfw |
[production] |
13:38 |
<gehel> |
resuming writes on Cirrus / elasticsearch, this did not speedup cluster recovery |
[production] |
13:30 |
<paladox> |
installing Gearman plugin in jenkins on gerrit-test instance |
[git] |
13:18 |
<godog> |
bounce redis on rcs1001 |
[production] |
13:16 |
<gehel> |
restarting elastic1021 for kernel upgrade (T138811) |
[production] |
13:07 |
<elukey> |
Bootstrapping again Cassandra on aqs100[456] (rack awareness + 2.2.6 - testing environment) |
[production] |
13:02 |
<gehel> |
pausing writes on Cirrus / elasticsearch for faster cluster restart |
[production] |
12:50 |
<yuvipanda> |
migrating deployment-tin to labvirt1011 |
[releng] |
12:43 |
<hashar> |
Nodepool back up with 10 instances (instead of 20) to accomodate for labs capacity T139285 |
[production] |
12:39 |
<godog> |
nodetool-b stop -- COMPACTION on restbase1014 |
[production] |
12:37 |
<yuvipanda> |
migrate test-prometheus2 to labvirt1011 |
[monitoring] |
12:33 |
<yuvipanda> |
reduced instances quota to 10 before starting it back up for T139285 |
[contintcloud] |
12:29 |
<moritzm> |
rolling reboot of rcs* cluster for kernel security update |
[production] |
12:10 |
<moritzm> |
rolling reboot of ocg* cluster for kernel security update |
[production] |
11:46 |
<paladox> |
finished migration (Disabled from loading in apache2 for now) will need to be added in sites-e* now deleting phab-03 instance. |
[phabricator] |
11:41 |
<paladox> |
sorry i am migrating it to phab-05 |
[phabricator] |
11:40 |
<jynus@tin> |
Synchronized wmf-config/db-eqiad.php: Failover db1053 to db1072 (duration: 00m 40s) |
[production] |
11:39 |
<yuvipanda> |
delete project, is no longer used |
[zulip] |
11:39 |
<paladox> |
migrating 50-phabricator.conf from phab-03 to phab-02. |
[phabricator] |
11:39 |
<tom29739> |
deleted 4 instances that are not being used right now. |
[privpol-captcha] |
11:38 |
<paladox> |
deleting phab-03 instance. To test git redirects please install them on phab-02 instance or git-redirect-01 instance. Reason labs out of space and we can use the rules on the same instance without needing seperate one. |
[phabricator] |
11:37 |
<yuvipanda> |
delete zulip-01, unused |
[zulip] |
11:34 |
<paladox> |
deleting git-phab4 instance not needed and will free space in labs. |
[git] |
11:28 |
<yuvipanda> |
deleted project (after verifying with jynus) |
[ops-db-candidates] |
11:28 |
<yuvipanda> |
deleted project |
[marathon-eval] |
11:18 |
<yuvipanda> |
deleted project |
[marathon] |
11:15 |
<yuvipanda> |
delete instance waah, was totally unused for a long time |
[marathon] |
11:14 |
<yuvipanda> |
stop instance papaultest to free up some resources on labvirt1006 |
[testlabs] |
11:14 |
<yuvipanda> |
stop instance tool-master-02 to free up some resources on labvirt1006 |
[testlabs] |
11:13 |
<yuvipanda> |
delete tools-prometheus-01 to free up resources on labvirt1010 |
[tools] |
11:11 |
<yuvipanda> |
actually deleted instance tools-cron-02 to free up resources on labvirt1010 - was large and not currently used, and failover process takes a while anyway, so we can recreate if needed |
[tools] |
11:10 |
<yuvipanda> |
stopped instance tools-cron-02 to free up some resources on labvirt1010 |
[tools] |
11:10 |
<yuvipanda> |
shutoff instances tool-master-05, tool-exec-05 and tool-exec-07 to save up resources on labvirt1010 |
[tool-renewal] |
11:06 |
<yuvipanda> |
delete matrix-01 to clear up some space on labvirt1010 |
[matrix] |
10:56 |
<moritzm> |
rolling reboot of swift frontends in eqiad for kernel security update |
[production] |
10:43 |
<yuvipanda> |
migrate wm-bot instance to labvirt1011 for T139264 |
[bots] |
10:30 |
<yuvipanda> |
stop nodepool on labnodepool1001 and disable puppet to keep it down, to allow stabilizing labs first |
[production] |
10:28 |
<yuvipanda> |
restart rabbitmq-server on labcontrol1001 |
[production] |
10:14 |
<moritzm> |
installing chromium security update on osmium |
[production] |
10:07 |
<moritzm> |
installing xerces-c security updates on Ubuntu systems (jessie already fixed) |
[production] |
10:01 |
<_joe_> |
stopping jobchron and jobrunner on mw1001-10 before decommission |
[production] |
09:50 |
<godog> |
reimage ms-be300[234] with jessie |
[production] |
09:44 |
<hashar> |
Labs infra cant delete instances anymore (impacts CI as well) T139285 |
[production] |
09:41 |
<paladox> |
<hashar> !log CI is out of Nodepool instances, the pool has drained because instances can no more be deleted over the OpenStack API |
[integration] |
09:41 |
<moritzm> |
installing p7zip security updates |
[production] |
09:38 |
<hashar> |
CI is out of Nodepool instances, the pool has drained because instances can no more be deleted over the OpenStack API |
[production] |
09:25 |
<elukey> |
Added new jobrunners in service - mw130[256].eqiad.wmnet (https://etherpad.wikimedia.org/p/jessie-install) |
[production] |
08:16 |
<moritzm> |
rolling reboot of swift backends in eqiad for kernel security update |
[production] |
07:49 |
<jynus@tin> |
Synchronized wmf-config/db-eqiad.php: Failover db1034 to db1062 (duration: 00m 30s) |
[production] |
02:26 |
<l10nupdate@tin> |
ResourceLoader cache refresh completed at Mon Jul 4 02:26:54 UTC 2016 (duration 5m 42s) |
[production] |