2015-12-03
§
|
20:55 |
<oblivian@tin> |
Synchronized wmf-config/CommonSettings.php: Fix the jobqueue on wikitech (duration: 00m 47s) |
[production] |
20:45 |
<_joe_> |
opening connection from mw1001 to silver, mysql |
[production] |
20:29 |
<ori> |
on palladium: salt -G 'cluster:jobrunner' cmd.run 'service jobrunner status | grep running && service jobrunner restart' ; salt -G 'cluster:jobrunner' cmd.run 'service jobchron status | grep running && service jobchron restart' |
[production] |
20:28 |
<ori> |
ran srem jobqueue:aggregator:s-wikis:v2 labswiki on rdb1001 aggr |
[production] |
19:41 |
<bblack> |
disabling pybal on lvs100[123] over the next few minutes (for reinstall to jessie later after confirmation everything is still ok on [456]) |
[production] |
19:10 |
<jynus> |
restarting eventlogging_sync on db1047 and dbstore1002 |
[production] |
19:04 |
<jynus> |
starting m4 slave again on dbstore2002 |
[production] |
18:45 |
<andrewbogott> |
disabling puppet on labcontrol1002 to test openldap with pdns |
[production] |
18:33 |
<mutante> |
neon - remove icinga user from "dialout" group |
[production] |
18:27 |
<jynus> |
disabling eventlogging_sync process on dbstore1002 and db1047 and replication on the other m4 slaves |
[production] |
18:18 |
<jynus> |
disabling event scheduler on db1046 (m4-master) |
[production] |
17:03 |
<kartik@tin> |
Finished scap: Update ContentTranslation (duration: 05m 52s) |
[production] |
16:57 |
<kartik@tin> |
Started scap: Update ContentTranslation |
[production] |
16:50 |
<oblivian@tin> |
Synchronized wmf-config/CommonSettings.php: Fix the jobqueue on wikitech (duration: 00m 28s) |
[production] |
15:23 |
<andrewbogott> |
stopping pdns on labcontrol2001 |
[production] |
15:11 |
<moritzm> |
restarting cassandra on restbase100[56] (subsequently) to effect openjdk security update |
[production] |
14:57 |
<mobrovac> |
restbase end of deployment of 262da91a |
[production] |
14:48 |
<mobrovac> |
restbase start deployment of 262da91a |
[production] |
14:06 |
<moritzm> |
installed dpkg updates across the cluster |
[production] |
11:35 |
<moritzm> |
restarting cassandra on aqs cluster (subsequently) to effect openjdk security update |
[production] |
10:51 |
<jynus> |
restarting, upgrading and general maintenance for es1013 (depooled) |
[production] |
10:36 |
<_joe_> |
imported dh-python into precise/universe from the ubuntu cloud archive |
[production] |
10:26 |
<jynus@tin> |
Synchronized wmf-config/db-eqiad.php: Depool es1013 for maintenance (duration: 00m 30s) |
[production] |
05:50 |
<l10nupdate@tin> |
ResourceLoader cache refresh completed at Thu Dec 3 05:50:08 UTC 2015 (duration 50m 7s) |
[production] |
02:25 |
<mwdeploy@tin> |
sync-l10n completed (1.27.0-wmf.7) (duration: 09m 54s) |
[production] |
2015-12-02
§
|
22:09 |
<jynus> |
unscheduled restart of dbstore1002 (analytics-slave) |
[production] |
21:44 |
<jynus> |
disabling all alert notifications for dbstore1002 |
[production] |
21:30 |
<bblack> |
rebooting lvs1007 for interface config test (not active, no BGP) |
[production] |
20:37 |
<Jeff_Green> |
enable mail queue monitoring for fundraising |
[production] |
19:13 |
<jynus> |
restarting mysql on db2067 to test a configuration change |
[production] |
18:07 |
<mutante> |
mw1136 - hhvm restart |
[production] |
14:45 |
<Coren> |
deploying cleanup of labs PAM configuration - this should be a functional noop but may cause some puppet noise |
[production] |
12:41 |
<akosiaris> |
restart cassandra on maps-test200{1,2,3,4}.codfw.wmnet |
[production] |
11:23 |
<moritzm> |
restarting cassandra on restbase100[78] (subsequently) (to effect openjdk security updates plus related libs) |
[production] |
11:06 |
<moritzm> |
restarting cassandra on restbase100[2-4] (subsequently) (to effect openjdk security updates plus related libs) |
[production] |
11:04 |
<moritzm> |
restart cassandra on restbase1001 (to effect openjdk security updates plus related libs) |
[production] |
10:21 |
<aude@tin> |
Synchronized wmf-config/InitialiseSettings.php: Enabling data access for wikinews, wikispecies and mediawiki.org (duration: 00m 27s) |
[production] |
10:20 |
<aude@tin> |
Synchronized dblists/arbitraryaccess.dblist: Enabling data access for wikinews, wikispecies and mediawiki.org (duration: 00m 30s) |
[production] |
10:14 |
<_joe_> |
clearing the job cache for ocg1003 |
[production] |
09:22 |
<_joe_> |
stopped ocg service on ocg1003 |
[production] |
08:38 |
<_joe_> |
planet1001 recovered as soon as I did get into console via gnt-instance |
[production] |
06:25 |
<ori@tin> |
Synchronized php-1.27.0-wmf.7/extensions/NavigationTiming: Idb675cdce: Add isHiDPI and isHttp2 properties; drop isHttps (T119014) (duration: 00m 50s) |
[production] |
05:53 |
<l10nupdate@tin> |
ResourceLoader cache refresh completed at Wed Dec 2 05:53:21 UTC 2015 (duration 53m 20s) |
[production] |
02:27 |
<mutante|away> |
labcontrol2001 - disable puppet, kill from puppet stored configs |
[production] |
02:26 |
<mwdeploy@tin> |
sync-l10n completed (1.27.0-wmf.7) (duration: 10m 47s) |
[production] |
00:29 |
<mutante> |
puppetstoredconfigclean.rb labcontrol2001.wikimedia.org fixes icinga config |
[production] |