-
Notifications
You must be signed in to change notification settings - Fork 6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Data] Allow generator UDFs for map_batches
and flat_map
#32767
Conversation
map_batches
map_batches
and map
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
one minor comment for docs
=================== | ||
Types of UDFs | ||
============= | ||
.. tabbed:: "Function UDFs" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
before the tab, can you mention a summary statement - ie "There are three types of UDFs that you can use with Ray Data: Function ..."
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
updated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM w/ minor comments. Also cc @clarkzinzow for related zero-copy batching work. thanks.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! @c21 I'll be sure to port this to the zero-copy fusion adapters PR.
Signed-off-by: amogkam <amogkamsetty@yahoo.com>
map_batches
and map
map_batches
and flat_map
…ject#32767) UDFs may return very large objects (for example when converting videos stored as bytes into multiple tensors for each window frame). This can lead to large amounts of heap memory usage and possibly cause OOMs. By allowing generator UDFs, users can instead yield chunks of the large object at a time. Closes ray-project#32746 --------- Signed-off-by: amogkam <amogkamsetty@yahoo.com> Signed-off-by: Jack He <jackhe2345@gmail.com>
…ject#32767) UDFs may return very large objects (for example when converting videos stored as bytes into multiple tensors for each window frame). This can lead to large amounts of heap memory usage and possibly cause OOMs. By allowing generator UDFs, users can instead yield chunks of the large object at a time. Closes ray-project#32746 --------- Signed-off-by: amogkam <amogkamsetty@yahoo.com> Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>
…ject#32767) UDFs may return very large objects (for example when converting videos stored as bytes into multiple tensors for each window frame). This can lead to large amounts of heap memory usage and possibly cause OOMs. By allowing generator UDFs, users can instead yield chunks of the large object at a time. Closes ray-project#32746 --------- Signed-off-by: amogkam <amogkamsetty@yahoo.com>
…ject#32767) UDFs may return very large objects (for example when converting videos stored as bytes into multiple tensors for each window frame). This can lead to large amounts of heap memory usage and possibly cause OOMs. By allowing generator UDFs, users can instead yield chunks of the large object at a time. Closes ray-project#32746 --------- Signed-off-by: amogkam <amogkamsetty@yahoo.com> Signed-off-by: elliottower <elliot@elliottower.com>
…ject#32767) UDFs may return very large objects (for example when converting videos stored as bytes into multiple tensors for each window frame). This can lead to large amounts of heap memory usage and possibly cause OOMs. By allowing generator UDFs, users can instead yield chunks of the large object at a time. Closes ray-project#32746 --------- Signed-off-by: amogkam <amogkamsetty@yahoo.com> Signed-off-by: Jack He <jackhe2345@gmail.com>
UDFs may return very large objects (for example when converting videos stored as bytes into multiple tensors for each window frame). This can lead to large amounts of heap memory usage and possibly cause OOMs.
By allowing generator UDFs, users can instead yield chunks of the large object at a time.
Closes #32746
Why are these changes needed?
Related issue number
Checks
git commit -s
) in this PR.scripts/format.sh
to lint the changes in this PR.