2016-04-29
§
|
12:30 |
<gehel> |
restarting elasticsearch server elastic1018.eqiad.wmnet (T110236) |
[production] |
11:39 |
<elukey> |
soft reboot for mw1119 (not responsive to ssh, root login timed out on the console) |
[production] |
09:43 |
<gehel> |
restarting elasticsearch server elastic1017.eqiad.wmnet (T110236) |
[production] |
09:42 |
<gehel> |
restarting elasticsearch server elastic1016.eqiad.wmnet (T110236) |
[production] |
09:01 |
<jynus> |
changing live configuration of db1049 thread_pool_stall_limit to 10 to test impact on connection timout |
[production] |
08:20 |
<gehel> |
restarting elasticsearch server elastic1016.eqiad.wmnet (T110236) |
[production] |
07:57 |
<elukey> |
puppet disabled on new kafka codfw instances due to errors while starting Event Bus (hosts not in service) |
[production] |
07:54 |
<moritzm> |
enabled base::firewall on stat1002 |
[production] |
07:52 |
<gehel> |
restarting elasticsearch server elastic1015.eqiad.wmnet (T110236) |
[production] |
07:36 |
<godog> |
stop cleanups on restbase1014-b |
[production] |
06:46 |
<jynus@tin> |
Synchronized wmf-config/db-eqiad.php: Reduce normal traffic on s2 API servers (duration: 00m 27s) |
[production] |
06:33 |
<jynus@tin> |
Synchronized wmf-config/db-eqiad.php: Repool db1038, increase weight of new hardware slaves db107[4-8] (duration: 00m 33s) |
[production] |
05:42 |
<gehel> |
restarting elasticsearch server elastic1014.eqiad.wmnet (T110236) |
[production] |
05:41 |
<mutante> |
re: "02:29 Krenair: last deployment was slow because of snapshot1007 being offline" it's back, i don't know why, it was powered down and i just tried switching it on. that helped. the command is literally "power on" on HP |
[production] |
05:39 |
<mutante> |
snapshot1007 - was powered down, powering it on. (..connect to mgmt.. "damn it's a HP") |
[production] |
05:34 |
<mutante> |
snapshot1007 - not reachable, duration 10h |
[production] |
04:58 |
<gehel> |
restarting elasticsearch server elastic1013.eqiad.wmnet (T110236) |
[production] |
02:29 |
<Krenair> |
last deployment was slow because of snapshot1007 being offline, icinga shows it's been like that for the last 7 hours |
[production] |
02:22 |
<krenair@tin> |
Synchronized php-1.27.0-wmf.22/extensions/EventBus: https://gerrit.wikimedia.org/r/286115 (duration: 02m 27s) |
[production] |
02:19 |
<mwdeploy@tin> |
sync-l10n completed (1.27.0-wmf.22) (duration: 09m 20s) |
[production] |
00:27 |
<mutante> |
RT - remove libapache2-mod-php5, restart Apache, Perl apps dont need PHP |
[production] |
00:22 |
<aaron@tin> |
Synchronized wmf-config/filebackend-production.php: Set "autoResync" on for local-multiwrite (duration: 02m 29s) |
[production] |
00:15 |
<cwd> |
updated civicrm from 777a91b8f9f6003a3eebdb8f2c73e45cc2bfb4a4 to b386a6821c71310950ccdcdcf2616add727e1af4 |
[production] |
00:04 |
<Dereckson> |
Previous deployment: [[Gerrit:285553]] Enable lazy loaded references in beta (T129693) |
[production] |
00:03 |
<Dereckson> |
Previous deployment: [[Gerrit:285927]] GoogleNewsSitemap configuration (T39608) |
[production] |
00:03 |
<Dereckson> |
Previous deployment: [[Gerrit:252627]] Revert "Increase abusefilter emergency disable threshold on MediaWiki.org" |
[production] |
00:03 |
<Dereckson> |
Previous deployment: [[Gerrit:280865]]+[[Gerrit:285989]] Allow wmf-config/throttle.php to be lenient on ip/IP typo, clean rules (no-op) |
[production] |
00:02 |
<maxsem@tin> |
Synchronized wmf-config/InitialiseSettings.php: (no message) (duration: 02m 25s) |
[production] |
00:02 |
<Dereckson> |
Previous deployment: [[Gerrit:279142]] Document FIXME statement in config (no-op) |
[production] |
2016-04-28
§
|
23:59 |
<maxsem@tin> |
Synchronized wmf-config/throttle.php: (no message) (duration: 02m 24s) |
[production] |
23:56 |
<maxsem@tin> |
Synchronized wmf-config/InitialiseSettings.php: (no message) (duration: 02m 25s) |
[production] |
23:26 |
<maxsem@tin> |
Synchronized php-1.27.0-wmf.22/extensions/VisualEditor/: (no message) (duration: 02m 30s) |
[production] |
23:17 |
<maxsem@tin> |
Synchronized php-1.27.0-wmf.22/extensions/WikidataPageBanner/: https://gerrit.wikimedia.org/r/286018 (duration: 02m 29s) |
[production] |
23:11 |
<maxsem@tin> |
Synchronized php-1.27.0-wmf.22/extensions/VisualEditor/: https://gerrit.wikimedia.org/r/#q,285769,n,z (duration: 02m 34s) |
[production] |
23:07 |
<maxsem@tin> |
Synchronized php-1.27.0-wmf.22/extensions/UploadWizard/: https://gerrit.wikimedia.org/r/#q,286016,n,z (duration: 02m 34s) |
[production] |
22:01 |
<chasemp> |
reboot of holmium |
[production] |
21:41 |
<twentyafterfour> |
added usleep(200000); to slow down the phabricator import even further. |
[production] |
21:32 |
<twentyafterfour> |
reduced phabricator taskmaster processes to 1 |
[production] |
21:08 |
<gehel> |
restarting elasticsearch server elastic1012.eqiad.wmnet (T110236) |
[production] |
19:47 |
<gehel> |
restarting elasticsearch server elastic1011.eqiad.wmnet (T110236) |
[production] |
19:15 |
<jynus> |
manually rotating db1038's error log |
[production] |
19:10 |
<hashar> |
1.27.0-wmf.22 deployed. Uneventful. |
[production] |
19:00 |
<hashar@tin> |
rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.27.0-wmf.22 |
[production] |
18:42 |
<catrope@tin> |
Synchronized php-1.27.0-wmf.22/extensions/Echo/: Fix fatal T133921 (duration: 00m 32s) |
[production] |
18:19 |
<gehel> |
restarting elasticsearch server elastic1010.eqiad.wmnet (T110236) |
[production] |
18:08 |
<mattflaschen@tin> |
Synchronized wmf-config/db-labs.php: Beta Cluster change (duration: 00m 37s) |
[production] |
17:40 |
<yurik> |
deployed and restarted kartotherian & tilerator |
[production] |
16:57 |
<gehel> |
restarting elasticsearch server elastic1009.eqiad.wmnet (T110236) |
[production] |
16:41 |
<ejegg> |
updated payments-wiki from 16ed5af8c8544ea1c8d837ae16585eba4cbbfd4e to c502ab2f6b6ff914d67503a664d36076fdc32dcf |
[production] |
16:26 |
<twentyafterfour> |
further reduced the queue worker count on phabricator, to relieve stress on mysql m3 db1048 |
[production] |