You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I see this as 2 part, will need OCR for documents that need it.
For searching, I'd suggest two options, the easier one but less featured to my knowledge is using postgres full text search and the other that would likely be against the philosophy of keeping it simple is OpenSearch
The text was updated successfully, but these errors were encountered:
I have given this issue some thought. I don't think OCR is in the scope of this project. As it is quite resource it clashes with the philosophy of keeping it simple and minimal. I think paperless-ngx would be a better fit for this use case.
However, I can see myself working on that feature for PDFs that don't require OCR as pypdf, which is already used in this project, can extract text from such files.
PS: If you like PdfDing I would be really happy over a star. Thanks!
That is acceptable. Can do OCR out of the scope of the project as well, so this request could narrow to just searching for already text compatible docs.
I see this as 2 part, will need OCR for documents that need it.For searching, I'd suggest two options, the easier one but less featured to my knowledge is using postgres full text search and the other that would likely be against the philosophy of keeping it simple is OpenSearch
The text was updated successfully, but these errors were encountered: