-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
修复PDF旋转的BUG (Issues #2792) #2816
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…tch-1 Update milvus_kb_service.py
新功能: - 优化 PDF 文件的 OCR,过滤无意义的小图片 by @liunux4odoo chatchat-space#2525 - 支持 Gemini 在线模型 by @yhfgyyf chatchat-space#2630 - 支持 GLM4 在线模型 by @zRzRzRzRzRzRzR - elasticsearch更新https连接 by @xldistance chatchat-space#2390 - 增强对PPT、DOC知识库文件的OCR识别 by @596192804 chatchat-space#2013 - 更新 Agent 对话功能 by @zRzRzRzRzRzRzR - 每次创建对象时从连接池获取连接,避免每次执行方法时都新建连接 by @Lijia0 chatchat-space#2480 - 实现 ChatOpenAI 判断token有没有超过模型的context上下文长度 by @glide-the - 更新运行数据库报错和项目里程碑 by @zRzRzRzRzRzRzR chatchat-space#2659 - 更新配置文件/文档/依赖 by @imClumsyPanda @zRzRzRzRzRzRzR - 添加日文版 readme by @eltociear chatchat-space#2787 修复: - langchain 更新后,PGVector 向量库连接错误 by @HALIndex chatchat-space#2591 - Minimax's model worker 错误 by @xyhshen - ES库无法向量检索.添加mappings创建向量索引 by MSZheng20 chatchat-space#2688
…patch-1 Update README.md
dosubot
bot
added
the
size:M
This PR changes 30-99 lines, ignoring generated files.
label
Jan 28, 2024
投到dev分支吧,下一版应该是0.3.0了 |
改了 |
额,冲突了,你能解决一下冲突吗 |
done |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
问题描述
对于有旋转的PDF(page.rotation!=0),会直接用旋转的图片进行OCR,得到的文本内容和格式均不正确,导致问答结果不理想
解决方法
修改RapidOCRPDFLoader类,检测到page.rotation!=0时,对img_list中的图片做相应角度旋转后再送入ocr