Searching #14

seang96 · 2024-11-30T18:17:01Z

~~I see this as 2 part, will need OCR for documents that need it.~~

For searching, I'd suggest two options, the easier one but less featured to my knowledge is using postgres full text search and the other that would likely be against the philosophy of keeping it simple is OpenSearch

mrmn2 · 2024-12-05T16:18:42Z

I have given this issue some thought. I don't think OCR is in the scope of this project. As it is quite resource it clashes with the philosophy of keeping it simple and minimal. I think paperless-ngx would be a better fit for this use case.

However, I can see myself working on that feature for PDFs that don't require OCR as pypdf, which is already used in this project, can extract text from such files.

PS: If you like PdfDing I would be really happy over a star. Thanks!

seang96 · 2024-12-05T18:02:47Z

That is acceptable. Can do OCR out of the scope of the project as well, so this request could narrow to just searching for already text compatible docs.

mrmn2 added the enhancement New feature or request label Dec 1, 2024

seang96 changed the title ~~OCR / Searching~~ Searching Dec 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Searching #14

Searching #14

seang96 commented Nov 30, 2024 •

edited

Loading

mrmn2 commented Dec 5, 2024 •

edited

Loading

seang96 commented Dec 5, 2024

Searching #14

Searching #14

Comments

seang96 commented Nov 30, 2024 • edited Loading

mrmn2 commented Dec 5, 2024 • edited Loading

seang96 commented Dec 5, 2024

seang96 commented Nov 30, 2024 •

edited

Loading

mrmn2 commented Dec 5, 2024 •

edited

Loading