-
Notifications
You must be signed in to change notification settings - Fork 96
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Failure when performing ORDER BY desc query with JIT_UNSPILL enabled #714
Comments
Thanks @ChrisJar I can also reproduce. Here is perhaps a simpler reproducer: from dask_cuda import LocalCUDACluster
from dask.distributed import Client
cluster = LocalCUDACluster(n_workers=1, jit_unspill=True)
client = Client(cluster)
import cudf, dask_cudf
from dask_sql import Context
c = Context()
df = cudf.DataFrame({"id":[1,4,4,5,3], "val":[4,6,6,3,8]})
ddf = dask_cudf.from_cudf(df, npartitions=1)
c.create_table("df", ddf)
query = "SELECT * FROM df ORDER BY id desc"
c.sql(query).compute() Seeing |
Looking into it |
#716 contains the fix to this issue. But what's the expected result type for >>> from dask_cuda import LocalCUDACluster
>>> from dask.distributed import Client
>>> cluster = LocalCUDACluster(n_workers=1, jit_unspill=True)
>>> client = Client(cluster)
>>> import cudf, dask_cudf
>>> from dask_sql import Context
>>> c = Context()
>>> df = cudf.DataFrame({"id":[1,4,4,5,3], "val":[4,6,6,3,8]})
>>> ddf = dask_cudf.from_cudf(df, npartitions=1)
>>> c.create_table("df", ddf)
>>> query = "SELECT * FROM df ORDER BY id desc"
>>> c.sql(query).compute()
>>> x = c.sql(query).compute()
>>> x.to_pandas()
id val
4 5 3
3 4 6
2 4 6
1 3 8
0 1 4
>>> type(x)
<class 'dask_cuda.proxify_device_objects.FrameProxyObject'> |
Yes, that is as expected. The proxy objects leaks into userspace, unless setting |
Fixes: #714 This PR registers `percentile_lookup` for `FrameProxyObject` Authors: - GALI PREM SAGAR (https://github.com/galipremsagar) Approvers: - Mads R. B. Kristensen (https://github.com/madsbk) URL: #716
I get an unexpected error when performing the
ORDER BY desc
operation when using dask-sql with a dask-cuda cluster with JIT unspilling enabled.For example:
returns:
Environment:
dask - 2021.8.1
dask-sql - 0.3.10
cudf - 21.10
dask-cudf - 21.10
dask-cuda - 21.10
The text was updated successfully, but these errors were encountered: