HTTP transport (parallel)

It is possible to run HTTP Transport in parallel. Workload may be distributed across multiple CPU cores and even across multiple servers.

How it works on high level

Parent process opens main connection to Exasol and spawns multiple child processes.
Each child process connects to individual Exasol node using http_transport() function, gets proxy host:port string and sends it to parent process.
Parent process collects list of proxies from child processes and runs export_parallel() or import_parallel() function to execute SQL query.
Each child process executes callback function and gets or sends chunk of data from or to Exasol.
Parent process waits for SQL query and child processes to finish.

Please note that PyEXASOL does not provide any specific way to send proxy strings from child processes to parent process. You are free to choose your own way of inter-process communication. For example, you may use multiprocessing.Pipe.

Examples

example_14 for EXPORT;
example_20 for IMPORT;
example_21 for EXPORT followed by IMPORT using the same child processes;

Example of EXPORT query executed in Exasol

This is how it looks from Exasol perspective.

EXPORT my_table INTO CSV
AT 'http://27.1.0.30:33601' FILE '000.csv'
AT 'http://27.1.0.31:41733' FILE '001.csv'
AT 'http://27.1.0.32:45014' FILE '002.csv'
AT 'http://27.1.0.33:42071' FILE '003.csv'
AT 'http://27.1.0.34:36669' FILE '004.csv'
AT 'http://27.1.0.35:36794' FILE '005.csv'

Known problems and limitations

Parallel IMPORT is not fully supported right now due to Exasol "N+1 connection" problem described in IDEA-370 and EXA-17055. It is possible to make it work using multiple hacks, but code becomes very ugly. Please let me know if you really need it and feel free to upvote relevant issues in Exasol tracker.

IMPORT problem was resolved starting from PyEXASOL 0.3.26. Please upgrade PyEXASOL to take full advantage of this feature.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HTTP_TRANSPORT_PARALLEL.md

HTTP_TRANSPORT_PARALLEL.md

HTTP transport (parallel)

How it works on high level

Examples

Example of EXPORT query executed in Exasol

Known problems and limitations

Files

HTTP_TRANSPORT_PARALLEL.md

Latest commit

History

HTTP_TRANSPORT_PARALLEL.md

File metadata and controls

HTTP transport (parallel)

How it works on high level

Examples

Example of EXPORT query executed in Exasol

Known problems and limitations