Implement N4409 on top of HPX #1141

hkaiser · 2014-05-31T15:09:58Z

Implement N3989 (http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2014/n3989.html) on top of HPX. This finally would be the first step to expose parallel algorithms to application developers.

There is an updated version of the proposal document here (N4409): http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2015/n4409.pdf.

As always, this implementation will add HPX specific functionality. Several extensions are obvious right away:

Add a new execution policy task_execution_policy on top of what the proposal mandates. The difference to the already described parallel_execution_policy would be that all algorithms would return a future<> representing the original result.
Add possibility to specify HPX executors to be used with the par and task execution policies. This could be done by adding par(exec) and task(exec) as valid execution policy arguments.
Add functionality enabling the use of all algorithms for distributed use cases (see also Extend parallel algorithms to work with hpx::partitioned_vector et.al. #1338).

Another possible extension needs more investigation:

Allow for all algorithms to be invoked with sequences of futures themselves, where the predicates/operators are invoked only after the corresponding future became ready.

Here is the list of algorithms mandated by the proposal:

These were added by N4310:

transform_reduce
transform_exclusive_scan transform_inclusive_scan (see Exclusive scan #1341, Inclusive scan #1342)

These were added to C++20:

shift_left shift_right (Add shift_left and shift_right algorithms #5466)
Add starts_with and ends_with algorithms (Add starts_with and ends_with algorithms #5381)

The text was updated successfully, but these errors were encountered:

Reduce by key, extends #1141

diehlpk · 2017-01-24T09:46:56Z

@hkaiser Could you please add a project description here https://github.com/STEllAR-GROUP/hpx/wiki/GSoC-2017-Project-Ideas

taeguk · 2017-03-25T08:34:26Z

I'm preparing GSoC. I have a question.
When implementing parallel algorithms, can I allocate additional memory for optimization or parallelization of algorithms?
For some algorithms, there may be differences in implementation depending on whether additional memory allocation is allowed or not.

hkaiser · 2017-03-25T12:15:46Z

When implementing parallel algorithms, can I allocate additional memory for optimization or parallelization of algorithms?

That's a very good and controversial question. The maximum memory requirements for the parallel algorithms are not specified, only the computational complexity. While allocating memory itself does not change the computational complexity of an algorithm, often the fact that you allocate some intermediate buffer requires more data copying which in turn may change the complexity.

So the first rule of the game is not to exceed the complexity requirements as specified (e.g. don't make an algorithm O(N) if it's supposed to be O(logN), etc.). If an additional allocation does not (indirectly) change the complexity, then please make sure (that even for large data arrays) this does not blow the memory requirements out of proportion.

Generally, I'd suggest to try to avoid memory allocations as much as possible (in the first step) and use the implementation allowing to do things without. I understand that additional allocations may improve the algorithm performance, or even it's complexity, but I'd like to make an implementation correct first before diving into possible optimizations.

I know I have not given a concrete answer to your question. I guess it's a case by case decision we'll have to make as we go.

hkaiser · 2017-03-25T12:20:53Z

@taeguk Most of the missing algorithms are usually implemented based on a variation of a parallel scan. We already have a handful algorithms based on a scan_partitioner (e.g. copy_if, remove_copy, inclusive_scan, etc.). I'd expect for the rest of those to be easily implementable using this very same (and already existing) scan_partitioner. I'd suggest for you to familiarize yourself with how we have implemented those existing algorithms as reusing the partitioner would significantly simplify implementing the missing algorithms.

msimberg · 2017-11-21T12:09:24Z

Marked inplace_merge as done because #2978 was merged.

victor-ludorum · 2018-02-18T15:04:08Z

Hello @hkaiser !! As Many algorithms are already implemented . But I have made one list of the unimplemented algorithms .
copy_backward
equal_range
is_permutation
lower_bound
upper_bound
move_backward
prev_permutation
next_permutation
nth_element
partition_point
pop_heap
push_heap
sort_heap
stable_sort
partial_sort

and as we have one numeric adjacent_difference (numeric algorithm) ,
These three numeric algorithms can also be implemented
accumulate
inner_product
partial_sum
As I have started learning about HPX , I have checked that these algorithms haven't been implemented yet. Is the implementation of these algorithms are not important for parallelism and concurrency ?

hkaiser · 2018-02-18T18:33:07Z

@victor-ludorum the algorithms listed here above are the ones specified for C++17.

accumulate, inner_product, and partial_sum are listed under a different name (reduce, transform_reduce, and inclusive_scan).

Somebody already tried to implement the heap algorithms, but that was abandoned (see #1914), feel free to revive that effort.

The algorithms related to sort are listed as not-implemented in our list above (nth_element, partial_sort, etc.), feel free to take those on.

I'm not sure if it's possible to parallelize the permutation algorithms.

copy_backwards and move_backwards can easily be implemented on top of the existing copy and move algorithms (it requires at least or bi-directional iterators, so you could wrap the given iterators into reverse_iterator), alternatively we'd need a separate (but similar to copy/move) implementation.

I don't know what is the difference between partition and partition_point.

victor-ludorum · 2018-02-18T18:44:50Z

Thanks @hkaiser sir !! I will definitely work on these algorithms. So numeric algorithms are already implemented . Remaining algorithms which is important can be implemented , I hope.

hkaiser · 2020-07-30T15:45:37Z

@fjtapia Just out of curiosity, we still have a couple of algorithms missing that are related to sorting (partial_sort, partial_sort_copy, and nth_element). Do you happen to have implementation available for those? Even some initial code would be very helpful. Our plan is to finally have all of the algorithms as specified by C++20 in place and these are the last missing pieces. Any help you could give would be most appreciated!

fjtapia · 2020-07-30T17:08:36Z

Hi Hartmut Glad to contact you again. I will prepare the implementation of the functions and send it to you to examine. But it will be at the end of August because in two days I am going on vacation. Please send me a list of the functions to implement. If they are single-thread or parallel, and if they have any conditions that must be taken into consideration. Regards Francisco El jue., 30 jul. 2020 a las 17:45, Hartmut Kaiser (<notifications@github.com>) escribió:

…

@fjtapia <https://github.com/fjtapia> Just out of curiosity, we still have a couple of algorithms missing that are related to sorting ( partial_sort, partial_sort_copy, and nth_element). Do you happen to have implementation available for those? Even some initial code would be very helpful. Our plan is to finally have all of the algorithms as specified by C++20 in place and these are the last missing pieces. Any help you could give would be most appreciated! — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#1141 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AA5O5GCKZPO5VOA4QRB67UTR6GITHANCNFSM4AP742IQ> .

hkaiser · 2020-07-30T17:47:01Z

Hi Hartmut Glad to contact you again. I will prepare the implementation of the functions and send it to you to examine. But it will be at the end of August because in two days I am going on vacation. Please send me a list of the functions to implement. If they are single-thread or parallel, and if they have any conditions that must be taken into consideration. Regards Francisco

Francisco, thanks for getting back so quickly, and thanks for your interest in helping! As said in my initial message, we are missing the implementations for the parallel versions of the following algorithms: partial_sort, partial_sort_copy, and nth_element (I believe I would be able to derive the partial_sort_copy from a partial_sort implementation myself, if needed - if that simplifies things). Also, we can always fall back to the std library versions for the sequential algorithms, so there is no need for you to look into those.

hkaiser · 2021-11-08T12:45:16Z

This has finally been done! Thanks to everybody who contributed to this task!

hkaiser added category: core labels May 31, 2014

hkaiser added this to the 0.9.9 milestone May 31, 2014

hkaiser assigned Syntaf Jun 8, 2014

sithhell mentioned this issue Jul 8, 2014

Avoid fork/join parallelism in implementations of N3989 #1185

Open

hkaiser mentioned this issue Jul 13, 2014

Parallel equal #1193

Merged

Syntaf changed the title ~~Implement N3989 on top of HPX~~ Implement N4071 on top of HPX Jul 31, 2014

hkaiser mentioned this issue Aug 2, 2014

Adding parallel::reverse and parallel::reverse_copy #1208

Merged

hkaiser mentioned this issue Aug 13, 2014

Remaining find algorithms implemented, N4071 #1223

Merged

hkaiser modified the milestones: 0.9.10, 0.9.9 Sep 13, 2014

hkaiser mentioned this issue Sep 13, 2014

parallel::copy_if is broken #1220

Closed

hkaiser mentioned this issue Dec 25, 2014

Extend parallel algorithms to work with hpx::partitioned_vector et.al. #1338

Open

36 tasks

hkaiser changed the title ~~Implement N4071 on top of HPX~~ Implement N4310 on top of HPX Dec 28, 2014

hkaiser added in progress and removed in progress labels Jan 7, 2015

hkaiser modified the milestones: 1.0.0, 0.9.10 Feb 25, 2015

hkaiser changed the title ~~Implement N4310 on top of HPX~~ Implement N4352 on top of HPX Mar 5, 2015

hkaiser changed the title ~~Implement N4352 on top of HPX~~ Implement N4409 on top of HPX May 11, 2015

dcbdan mentioned this issue Jul 16, 2015

Implemented inner_product and adjacent_diff algos #1659

Merged

hkaiser mentioned this issue Dec 7, 2015

version 1.0? #1896

Closed

hkaiser added a commit that referenced this issue Apr 22, 2016

Merge pull request #2097 from STEllAR-GROUP/reduce_by_key

82bb1c2

Reduce by key, extends #1141

hkaiser mentioned this issue Sep 26, 2016

Implemented parallel::stable_partition #2345

Merged

hkaiser added the project: GSoC label Jan 19, 2017

hkaiser modified the milestones: 1.0.0, 1.1.0 Apr 23, 2017

taeguk mentioned this issue Jul 21, 2017

Implement parallel::partition. #2778

Merged

8 tasks

taeguk mentioned this issue Aug 28, 2017

Implement parallel::unique. #2867

Merged

6 tasks

msimberg removed this from the 1.1.0 milestone Nov 21, 2017

taeguk mentioned this issue Dec 24, 2017

Implement parallel::remove and parallel::remove_if #3086

Merged

4 tasks

hkaiser mentioned this issue Jan 11, 2018

Parallel sorting algorithms Morwenn/cpp-sort#22

Open

hkaiser mentioned this issue Feb 18, 2019

Add shift_left and shift_right algorithms #3706

Closed

hkaiser added the tag: pinned Never close as stale label Jun 30, 2019

hkaiser unassigned Syntaf Jul 5, 2020

hkaiser mentioned this issue Jul 28, 2020

HEP: conformance to C++20 #4871

Closed

21 tasks

hkaiser added this to the 1.6.0 milestone Aug 4, 2020

msimberg removed this from the 1.6.0 milestone Jan 5, 2021

hkaiser closed this as completed Nov 8, 2021

hkaiser added this to the 1.8.0 milestone Nov 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement N4409 on top of HPX #1141

Implement N4409 on top of HPX #1141

hkaiser commented May 31, 2014 •

edited

Loading

diehlpk commented Jan 24, 2017

taeguk commented Mar 25, 2017

hkaiser commented Mar 25, 2017 •

edited

Loading

hkaiser commented Mar 25, 2017

msimberg commented Nov 21, 2017

victor-ludorum commented Feb 18, 2018

hkaiser commented Feb 18, 2018 •

edited

Loading

victor-ludorum commented Feb 18, 2018

hkaiser commented Jul 30, 2020

fjtapia commented Jul 30, 2020 via email

hkaiser commented Jul 30, 2020

hkaiser commented Nov 8, 2021

Implement N4409 on top of HPX #1141

Implement N4409 on top of HPX #1141

Comments

hkaiser commented May 31, 2014 • edited Loading

diehlpk commented Jan 24, 2017

taeguk commented Mar 25, 2017

hkaiser commented Mar 25, 2017 • edited Loading

hkaiser commented Mar 25, 2017

msimberg commented Nov 21, 2017

victor-ludorum commented Feb 18, 2018

hkaiser commented Feb 18, 2018 • edited Loading

victor-ludorum commented Feb 18, 2018

hkaiser commented Jul 30, 2020

fjtapia commented Jul 30, 2020 via email

hkaiser commented Jul 30, 2020

hkaiser commented Nov 8, 2021

hkaiser commented May 31, 2014 •

edited

Loading

hkaiser commented Mar 25, 2017 •

edited

Loading

hkaiser commented Feb 18, 2018 •

edited

Loading