5901-5950 of 10000 results (35ms)
2016-09-20 §
20:16 <Krenair> enabled trusty-backports on deployment-puppetmaster [releng]
19:00 <thcipriani> cherry-picked https://gerrit.wikimedia.org/r/#/c/311760/ to deployment-puppetmaster to fix failing beta-scap-eqiad job, had to manually start rsync, puppet failed to start [releng]
18:38 <hashar> on tin: `sudo -u jenkins-deploy -H SSH_AUTH_SOCK=/run/keyholder/proxy.sock ssh mwdeploy@deployment-mira02.deployment-prep.eqiad.wmflabs` - T144006 [releng]
18:33 <hashar> on deployment-mira02 ran `sudo -u jenkins-deploy -H SSH_AUTH_SOCK=/run/keyholder/proxy.sock ssh mwdeploy@deployment-mediawiki04.deployment-prep.eqiad.wmflabs` per T144006 [releng]
18:01 <marxarelli> deployed mediawiki-config changes on beta cluster. back in read/write mode using new database instances [releng]
17:37 <marxarelli> deployment-db04 restored from backup and replication started [releng]
16:54 <marxarelli> upgraded package and data to mariadb 10 on deployment-db03 [releng]
16:31 <marxarelli> cherry picking operations/puppet patches (T138778) to deployment-puppetmaster [releng]
16:30 <moritzm> rebooting deployment-mira02 [releng]
16:23 <marxarelli> applied innodb transaction logs to deployment-db1 backup and successfully restored on deployment-db03 [releng]
15:47 <marxarelli> completed innobackupex on deployment-db1. copying backup to deployment-db03 for restoration [releng]
14:54 <hashar> beta: cherry picking fix up for the jobrunner logging https://gerrit.wikimedia.org/r/#/c/311702/ and https://gerrit.wikimedia.org/r/311719 T146040 [releng]
14:44 <marxarelli> entering read-only mode on beta cluster [releng]
14:27 <elukey> stopped puppet, jobrunner and jobchron on deployment-jobrunner01 [releng]
14:20 <marxarelli> disabling beta cluster jenkins jobs in preparation for data migration (T138778) [releng]
13:07 <godog> add deployment-prometheus01 instance T53497 [releng]
11:20 <elukey> applied beta::deployaccess, role::labs::lvm::srv, role::mediawiki::jobrunner to jobrunner02 [releng]
10:45 <elukey> created deployment-jobrunner02 in deployment-prep [releng]
2016-09-19 §
22:01 <legoktm> shutdown integration-puppetmaster [releng]
21:29 <yuvipanda> regenerated client certs only on integration-puppetmaster01, seems ok now [releng]
20:46 <yuvipanda> re-enable puppet everywhere [releng]
20:43 <yuvipanda> enable puppet and run on integration-slave-trusty-1003.eqiad.wmflabs [releng]
20:42 <yuvipanda> accidentally deleted /var/lib/puppet/ssl on integration-puppetmaster01 as well, causing it to lose keys. Reprovision by pointing to labs puppetmaster [releng]
20:34 <yuvipanda> rm -rf /var/lib/puppet/ssl on all integration nodes [releng]
20:34 <yuvipanda> copied /etc/puppet/puppet.conf from integration-trusty-slave-1001 to all integration [releng]
20:25 <yuvipanda> delete /etc/puppet/puppet.conf.d/10-self.conf and /var/lib/puppet/ssl on integration-slave-trusty-1001 [releng]
20:20 <yuvipanda> re-enabled puppet on integration-slave-trusty-1001 [releng]
20:08 <yuvipanda> reset puppetmaster of integration-puppetmaster01 to be labs puppetmaster [releng]
20:03 <yuvipanda> disable puppet across integration project, moving puppetmasters [releng]
19:49 <legoktm> creating T144951 enabled role::puppetmaster::standalone role on integration-puppetmaster01 [releng]
19:33 <legoktm> creating T144951 integration-puppetmaster01 instance using m1.small and debian jessie [releng]
15:11 <hashar> beta: updating jobrunner service 0dc341f..a0e8216 [releng]
2016-09-17 §
07:11 <legoktm> deploying https://gerrit.wikimedia.org/r/311024 [releng]
2016-09-16 §
21:03 <hashar> deployment-tin did a git gc on /srv/deployment/ores That freed up disk space and cleared an alarm on co master mira02 [releng]
21:00 <hashar> deleted deployment-parsoid05 [releng]
20:52 <hashar> fixed puppet on deployment-parsoid05 . Temporary instance will delete it later to clear out shinken.wmflabs.org [releng]
20:27 <hashar> beta: force running puppet in batches of 4 instances: salt --batch 4 -v 'deployment-*' cmd.run 'puppet agent -tv' [releng]
20:13 <hashar> beta: restarted puppetmaster [releng]
20:07 <hashar> beta: salt -v '*' cmd.run 'rm -fR /var/lib/puppet/client/ssl/' [releng]
20:07 <hashar> beta: stopping puppetmaster, rm -f /var/lib/puppet/server/ssl/ca/signed/* [releng]
19:53 <hashar> beta created instance "deployment-parsoid05" Should be deleted later, that is merely to purge the hostname from Shinken ( http://shinken.wmflabs.org/host/deployment-parsoid05 ) [releng]
11:42 <hashar> beta: apt-get upgrade on deployment-jobrunner01 [releng]
11:36 <hashar> apt-get upgrade on deployment-tin , bring in a new hhvm version and others [releng]
2016-09-15 §
22:29 <legoktm> sudo salt '*precise*' cmd.run 'service mysql start', all mysql's are down [releng]
16:45 <godog> install xenial kernel on deployment-zotero01 and reboot T145793 [releng]
16:18 <hashar> prometheus enabled on all beta cluster instance. Does not support Precise hence puppet will fail on the last two Precise instances deployment-db1 and deployment-db2 until they are migrated to Jessie T138778 [releng]
15:53 <godog> add role::prometheus::node_exporter to classes in hiera:deployment-prep T144502 [releng]
15:10 <hashar> beta: Applying puppet class role::prometheus::node_exporter to mira02 just like mira. That is for godog [releng]
15:08 <hashar> T144006 Disabled Jenkins job beta-scap-eqiad. On mira02 rm -fR /srv/* . Applying puppet for role::labs::lvm::srv [releng]
15:05 <hashar> T144006 Applying class role::labs::lvm::srv to mira02 (it is out of disk space :D ) [releng]