-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hyperparameter #1
Comments
Hello, Thank you for your interest in our work. The hyperparameters used for the chemical-named entity recognition task can be found in Table 6 of the Appendix in our paper. Please note that these hyperparameters are provided for reference and haven't been fine-tuned for optimal performance, so they might not yield the best possible results. Feel free to ask if you have further questions. |
Thanks for your answer, I went through your paper and could not find the number of epochs for which the model is fine-tuned and also I could find where I should pass steps in the fine-tuned training argument. Again, for molecular property prediction, you replace the ring and branch in the smiles, how do you do this in your preprocessing stage? |
The number of epochs used for fine-tuning the model is 20, and the conversion from SMILES to SELFIES is performed using the package provided by the project found at https://github.com/aspuru-guzik-group/selfies. You do not need to input a specific number of steps. The model should handle this automatically based on the number of epochs and the size of your dataset. |
Thanks, I finetuned the model on my custom NER dataset using the same prompt template and instructions as the ones used for the chemical entity recognition task in your work. However, when I attempted to generate output using your "generate.py" script, I received the same output multiple times. Could you please explain why this is happening? |
Hi, Your issue is fairly common when using LLaMa. You might be able to circumvent this problem by adjusting some hyperparameters, such as:
In addition, you can refer to the original aplaca-lora project for more insights or methods of implementation. |
Thanks, |
|
Hi,
What are the hyperparameters used for chemical-named entity recognition?
The text was updated successfully, but these errors were encountered: