-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BUG: OSError: [Errno 24] Too many open files #769
Labels
bug
Something isn't working
Comments
@Hank0626 Can you look at this issue? |
@charliedream1 May I ask if you are using multiple machines together for the task? |
only one machine |
Could you please provide me with your data for testing? |
I used private data. You may reproduce with some open source LLM sft data. Data format is as below: I create a dataframe
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Describe the bug
A clear and concise description of what the bug is.
To Reproduce
To help us to reproduce this bug, please provide information below:
Expected behavior
A clear and concise description of what you expected to happen.
Additional context
Add any other context about the problem here.
I met this issue again. Please help to solve.
===================================
I create a dataframe
running code is as below:
data_lst contains 10k+ data.
===================================
I got error as below:
===================================
I did operations as below:
sudo sysctl -w fs.file-max=100000 ulimit -S -n 1048576
It still gives out error. Error comes from len(res['query'].tolist()), so how to parse the result.
Thanks for your reply.
The text was updated successfully, but these errors were encountered: