[Question]: Does SPU support bigdata or parallel computing #761

xyz-scorpio · 2024-07-09T14:04:56Z

Feature Request Type

Performance

Have you searched existing issues?

Yes

Is your feature request related to a problem?

A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]

Describe features you want to add to SPU

Just want to know if SPU supports bigdata / parallel computing, and if yes, in what way did it support such a think. TY.

Describe features you want to add to SPU

A clear and concise description of any alternative solutions or features you've considered.

tpppppub · 2024-07-10T01:57:37Z

There are various granularities of parallelism and vectorization within the SPU. It's not clear what your specific requirements are. How much data do you need to process and what granularity of parallelism do you need?

xyz-scorpio · 2024-07-10T09:14:38Z

There are various granularities of parallelism and vectorization within the SPU. It's not clear what your specific requirements are. How much data do you need to process and what granularity of parallelism do you need?

Let us take the SPU workflow as an example. My question is:
i) What is the upper bound of data scale that SPU could handle? If I write some code to train a model with tensorflow, on very large datasets (e.g. ~TB) from different parties, how does SPU handle that? Will it do data parallel automaticly?
ii) How many resources can a SPU VM take advantage of? Say, if I have 4 AWS EC2s, can one SPU VM take advantage of all 4 EC2s, or just one EC2 instance? And what is the parallel model of SPU VM?

anakinxc · 2024-07-19T12:24:10Z

There are various granularities of parallelism and vectorization within the SPU. It's not clear what your specific requirements are. How much data do you need to process and what granularity of parallelism do you need?

Let us take the SPU workflow as an example. My question is: i) What is the upper bound of data scale that SPU could handle? If I write some code to train a model with tensorflow, on very large datasets (e.g. ~TB) from different parties, how does SPU handle that? Will it do data parallel automaticly? ii) How many resources can a SPU VM take advantage of? Say, if I have 4 AWS EC2s, can one SPU VM take advantage of all 4 EC2s, or just one EC2 instance? And what is the parallel model of SPU VM?

We do not have a hard limit on data size. It is up to frameworks like Tensorflow to properly handles such large data through batching or some other techniques.
At this point, SPU can only take one machine. Right now, SPU only supports DLP and ILP on one machine.

anakinxc · 2024-08-14T16:01:28Z

no activity. close

anakinxc closed this as completed Aug 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question]: Does SPU support bigdata or parallel computing #761

[Question]: Does SPU support bigdata or parallel computing #761

xyz-scorpio commented Jul 9, 2024

tpppppub commented Jul 10, 2024

xyz-scorpio commented Jul 10, 2024 •

edited

Loading

anakinxc commented Jul 19, 2024

anakinxc commented Aug 14, 2024

[Question]: Does SPU support bigdata or parallel computing #761

[Question]: Does SPU support bigdata or parallel computing #761

Comments

xyz-scorpio commented Jul 9, 2024

Feature Request Type

Have you searched existing issues?

Is your feature request related to a problem?

Describe features you want to add to SPU

Describe features you want to add to SPU

tpppppub commented Jul 10, 2024

xyz-scorpio commented Jul 10, 2024 • edited Loading

anakinxc commented Jul 19, 2024

anakinxc commented Aug 14, 2024

xyz-scorpio commented Jul 10, 2024 •

edited

Loading