Releases: UglyToad/PdfPig
0.1.2 Third Alpha
- Many updates to document layout analysis algorithms
- Bugfix for files with a large number of non-data trailing bytes
- Bugfix for OpenType fonts
- Paths and glyphs are now correctly rotated when the page itself has a rotation value
0.1.2 Second Alpha
Adds letter font details and a couple of other bugfixes to the alpha version.
0.1.2 First Alpha
First alpha version of 0.1.2
Cows That Move Backwards And Forwards
Many bug fixes for a whole range of document types. In addition:
- Add support for JPG images in PdfDocumentBuilder using
page.AddJpeg()
. - Access to marked content using
page.GetMarkedContents()
- Early access to PDF merging using
PdfMerger.Merge()
- Adds Doc-Comments back to the package.
- Improvements to NearestNeighbourWordExtractor and other Document Layout Analysis classes to support rotated text.
0.1.1 First Alpha
A whole bunch of bug fixes and other changes.
And It Comes Out As MIWK
This version focuses on improving performance.
To enable this it replaces decimals with doubles for most of the public API. It also reorganizes the code internally to support access to font related classes.
For this reason consumers will need to update their code, see the migration guide on the wiki.
Other features:
- Access to hyperlinks provides a convenience wrapper for retrieving annotations of type
Link
and their text content and destination. Usepage.GetHyperlinks()
. - Bug fixes for glyph positions.
- Access to the embedded files in the document. Use
document.Advanced.TryGetEmbeddedFiles(out IReadOnlyList<EmbeddedFile> files)
. - Ability to provide a list of passwords to try when opening encrypted documents. Use
ParsingOptions.Passwords
to provide the list of passwords. Any password set inParsingOptions.Password
will be included in the list of passwords. - Many bug fixes for different documents.
0.1.0 Second Beta
Updates the 0.1.0 beta version with many bug fixes.
0.1.0 First Beta
First release which moves internal numerics from decimal
to double
where appropriate.
Reorganises internal project structure.
See migration details in the wiki: https://github.com/UglyToad/PdfPig/wiki/Migration-0.0.X-to-0.1.0
Farms With Fields Which Cross The Border
This release fixes a major performance regression in 0.10.0.
It also adds bug-fixes for several new issues as well as additional methods for the geometry objects PdfPath
, PdfLine
and PdfRectangle
.
Mixed Together With Whiskey
This release adds two main new features:
- Access to form elements (AcroForms) such as text input, checkboxes, radio-buttons, etc. Use
document.TryGetForm(out AcroForm form)
to get the form for the document if it contains one. - Access to bookmarks which define the document structure by linking to chapters, etc. Use
document.TryGetBookmarks(out Bookmarks bookmarks)
to get the document's bookmarks tree if it contains one.
It also aims to improve performance for most content retrieval operations resulting in up to double speed for the smallest documents.
It also adds bug-fixes, structure analysis tools and small improvements:
- Adds
document.GetPages()
as a convenience method to enumerate all pages in a document. - Adds hOcr, AltoXml and PageXml format exporters to export the page content to standardized formats which can be used in other tools. These exporters implement the
ITextExporter
interface and are used to export each page to a compatible string. - Improves support for retrieving images from a page. The new
page.GetImages()
method enumerates all images on a page, images are eitherInlineImage
s orXObjectImage
s. - Adds support for extracting text which is defined in XObject forms (distinct from AcroForms) which was previously skipped, meaning text could have been missing from the
page.Text
on certain document types. - Adds support for vertical writing mode fonts (Japanese, etc).
- Additional bug fixes.