Replies: 1 comment
-
Thank you for your question. Things have changed a bit since I gave this talk, but at that time I was referring to the active learning strategy by the OLALA paper. You can find an implementation here. I took the guts off the strategy and re-implemented a version to be integrated using the labeling tool Prodigy. Regarding your questions about datasets. The layout models provided here are trained on Publaynet for about 3 epochs. This gives you some entry checkpoint to resume training on your data. I haven't provided the exact training script in the docs but you can find the training script for cell detection based on Pubtabnet here that is very similar. Hope this helps. |
Beta Was this translation helpful? Give feedback.
-
Hi @JaMe76 ,
In your talk at PyConDE you mentioned the use of active learning to be sample effective for fine-tuning. May I inquire which libraries you used for that?
I was wondering out of the box what datasets were the models trained on, where can I find this information?
Thanks,
Szilard
Beta Was this translation helpful? Give feedback.
All reactions