Replies: 1 comment 2 replies
-
Well I figured out how to get some basic training going. My main problem that was causing the crash was that I needed to set the weights and biases as parameters in the same context. Another problem I was having was trying to just copy over data directly to the tensor's data. You have to set tensor values with the functions for it to work properly, as per the warning in the comments in ggml.h. Still unclear about how exactly to use contexts so everything is fast and efficient, and I am not sure at all about how the backend stuff works, but I'll be experimenting with that at some point. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm relatively new to machine learning and I can't seem to figure out how to train a simple model. I am using the model in https://github.com/ggerganov/ggml/blob/master/examples/mnist/main.cpp, and I can load in the weights and evaluate it properly, but I have no idea how I should go about training it from scratch. I found a few examples of training, but they are either based on complicated networks or use old APIs that don't exist anymore or something.
My main problem is that I don't really know the exact procedure for training and what needs to be done beforehand. Would someone be so kind as to explain the training process with this library, and maybe provide some example code for a simple network such as the one above?
I have a ton of questions and I guess I can just ask them here, maybe it will give a better idea of where I'm at:
I can post code if necessary, but it is in the language Odin.
Beta Was this translation helpful? Give feedback.
All reactions