You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We're attempting to parse the outline of a few PDF documents and it appears that the page number is lost for some of the documents using the GetOutlines method on the PdfReader object.
The attached BadOutline.pdf document seems to have the Dest field on the OutlineItem set to null and thus the page numbers are lost. The attached GoodOutline.pdf does not have that problem and is parsed correctly. Additionally, using pypdf2 Python package the correct page numbers are parsed and can be displayed when parsing the BadOutline.pdf (happy to provide that code as well).
Welcome! Thanks for posting your first issue. The way things work here is that while customer issues are prioritized, other issues go into our backlog where they are assessed and fitted into the roadmap when suitable. If you need to get this done, consider buying a license which also enables you to use it in your commercial products. More information can be found on https://unidoc.io/
Description
We're attempting to parse the outline of a few PDF documents and it appears that the page number is lost for some of the documents using the
GetOutlines
method on thePdfReader
object.The attached
BadOutline.pdf
document seems to have theDest
field on theOutlineItem
set tonull
and thus the page numbers are lost. The attachedGoodOutline.pdf
does not have that problem and is parsed correctly. Additionally, usingpypdf2
Python package the correct page numbers are parsed and can be displayed when parsing theBadOutline.pdf
(happy to provide that code as well).Expected Behavior
Actual Behavior
Attachments
I can reproduce the issue by copying the code from here:
BadOutline.pdf
GoodOutline.pdf
The text was updated successfully, but these errors were encountered: