Optimise S3 interaction #2466

pomadchin · 2017-11-02T14:44:45Z

We figured out that S3 connections are not parallelized well in the Java AWS SDK.
It was measured that on slow connections it makes sense to parallelize all connections by spawning more threads and splitting the request data into smaller chunks. The slower internet connection is the bigger difference is easier to notice. According to this fact we can investigate how it is possible to speed up already fast (but probably not enough) S3 connections (how to speed up getObject(request).getObjectContent calls).

How we can use Futures and how it is possible to determine an optimal way to split data into chunks to parallelize everything using setRange queries. Does this approach makes sense at all?
There is an interesting TransferManager API, which works faster (or should work faster as my tests were limited) but it makes a good highly parallelized downloads into files. The disadvantage of this API that it works only with files (downloads data from S3 into files). We can consider building an in-memory version of it and verifying that it makes sense and it indeed effects on objects downloads. There is already an issue in their repo: S3 TransferManager Should Allow Downloading to Stream aws/aws-sdk-java#893

It is a bit a small R&D issue to clarify AWS S3 API and to double check that we use it efficient.

The text was updated successfully, but these errors were encountered:

pomadchin · 2019-01-07T11:45:07Z

Would be resolved via #2302

pomadchin added the enhancement label Nov 2, 2017

moradology mentioned this issue Apr 25, 2019

AWS SDK v2 #2911

Merged

3 tasks

echeipesh added this to the 3.0 milestone May 7, 2019

pomadchin closed this as completed in #2911 May 30, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimise S3 interaction #2466

Optimise S3 interaction #2466

pomadchin commented Nov 2, 2017

pomadchin commented Jan 7, 2019

Optimise S3 interaction #2466

Optimise S3 interaction #2466

Comments

pomadchin commented Nov 2, 2017

pomadchin commented Jan 7, 2019