-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added Dockerfile #21
Added Dockerfile #21
Conversation
I updated the Dorckerfile. This is now dependent on I also uploaded the docker image based on this file. This image do all in readme except for nanogpt&lite setup and api key setting. so you can run |
Just a few recommendations from an outsider:
|
I've largely incorporated the points @JGalego pointed out, except for using texlive-full and ignoring some of the linter's suggestions (using
The image is here: You can use this image like this:
[interactive]
|
TeLive Full eats up all the space when trying to run this on Google Project IdX an Github Codespaces |
Thank you for your contribution! Indeed, something more lightweight than |
I keep getting NVIDIA CUDA errors what I need to install, I get it on all the places I've tried to install also locally |
What version is your CUDA driver? |
If you want to save your running results need this command:
It will be inside |
Thanks, changed! :) |
Thank you very much |
In interactive environment, I still get CUDA Disable related errors during model training. I don't know why. |
Any folks interested/willing to help integrate AIScientist into a DeSci DAO, please DM. |
Hello Dear System : Windows 11 x64 Please Help Me!! shell commanddocker run -d --gpus all --env-file=/app/ai-project/.env -v "/$(pwd)/templates:/app/ai-project/AI-Scientist/templa
tes" t4646/ai-scientist:20240817 --model gpt-4o-2024-05-13 --experiment nanoGPT --num-ideas 2 log message2024-09-09 07:57:20 Response Status Code: 200
2024-09-09 07:57:20 Response Content: {"total": 12, "offset": 0, "next": 10, "data": [{"paperId": "910aea4a020c329afb8b8a948abaafdf9ebcab3f", "title": "DropAttention: A Regularization Method for Fully-Connected Self-Attention Networks", "abstract": "Variants dropout methods have been designed for the fully-connected layer, convolutional layer and recurrent layer in neural networks, and shown to be effective to avoid overfitting. As an appealing alternative to recurrent and convolutional layers, the fully-connected self-attention lay
2024-09-09 07:57:20 Decision made: novel after round 3
2024-09-09 07:57:20
2024-09-09 07:57:20 Checking novelty of idea 3: adaptive_learning_rate
2024-09-09 07:57:20 Response Status Code: 200
2024-09-09 07:57:20 Response Content: {"total": 707, "offset": 0, "next": 10, "data": [{"paperId": "76488b0743c9553b7b1d7ec46afe107ea60a67ca", "title": "AutoLRS: Automatic Learning-Rate Schedule by Bayesian Optimization on the Fly", "abstract": "The learning rate (LR) schedule is one of the most important hyper-parameters needing careful tuning in training DNNs. However, it is also one of the least automated parts of machine learning systems and usually costs significant manual effort and computing. Though there are pre-defined LR s
2024-09-09 07:57:20 Response Status Code: 200
2024-09-09 07:57:20 Response Content: {"total": 53, "offset": 0, "next": 10, "data": [{"paperId": "8508a2660777b2cf9130530f690f58896b61bdf8", "title": "A Cascade CNN Model based on Adaptive Learning Rate Thresholding for Reliable Face Recognition", "abstract": "Convolutional models may effectively identify persons quickly through automatic face analysis. A convolutional neural network architecture known as the CNN cascade structure uses numerous deep convolution layers to extract hierarchical characteristics from the input image. Ca
2024-09-09 07:57:20 Decision made: novel after round 2
2024-09-09 07:57:20 Processing idea: adaptive_block_size
2024-09-09 07:57:20 Failed to evaluate idea adaptive_block_size: [Errno 2] No such file or directory: 'templates/nanoGPT/run_0/final_info.json'
2024-09-09 07:57:20 Processing idea: attention_dropout
2024-09-09 07:57:20 Failed to evaluate idea attention_dropout: [Errno 2] No such file or directory: 'templates/nanoGPT/run_0/final_info.json'
2024-09-09 07:57:20 Processing idea: adaptive_learning_rate
2024-09-09 07:57:20 Failed to evaluate idea adaptive_learning_rate: [Errno 2] No such file or directory: 'templates/nanoGPT/run_0/final_info.json'
2024-09-09 07:57:20 All ideas evaluated. |
Well look at the error.... 2024-09-09 07:57:20 Failed to evaluate idea adaptive_block_size: [Errno 2] No such file or directory: 'templates/nanoGPT/run_0/final_info.json' It sounds like they didn't build the pre-run steps into Docker. Go read the directions and run the steps it says "you need to run this first" |
I've added a Dockerfile. However, there's no specific need for this file - I just thought having a Dockerfile might be convenient.
Since installing texlive took a long time for me, I used a texlive-based image for now, but I'm not sure if that's the best approach. Also,I'm not certain if the RUN commands I included are desirable. If there's a better way to write the file, please feel free to use that instead.I've also created a docker image for reference:
https://hub.docker.com/repository/docker/t4646/ai-scientist/general
NOTE: I uploaded the newer version