Replies: 1 comment 1 reply
-
Hi, how many unique values are there for columnA, and what is the dtype of it? Regards, Maarten Breddels |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, sorry for all these questions. Really appreciate the help I've gotten here.
I was able to read multiple HDF5 files into a single vaex dataframe containing 1.12 billion rows. However, when I tried to do a
groupby
operation on the entire dataframe, the kernel dies.I tried decreasing the size of the dataframe and found that when I'm trying to access beyond the 4 million-th row of the dataframe, the kernel usually dies. But the operation is always successful for the first 4 million rows.
It does not seem related to the size of the dataframe, because the kernel fails when I'm trying to read 10 random rows beyond the 4 million-th row, e.g.
df[9001000:9001010]
, though there are a few times when reading some other 10 rows is successful. Basically, when reading beyond the 4 million-th row, the kernel dies about 70% of the time, but operation is always successful when reading up to the 4 million-th row. I'm not sure why this is happening.Thank you.
Beta Was this translation helpful? Give feedback.
All reactions