Releases · UglyToad/PdfPig

20 Jun 12:04

EliotJones

0.1.2-alpha003

5fb0458

0.1.2 Third Alpha Pre-release

Pre-release

Many updates to document layout analysis algorithms
Bugfix for files with a large number of non-data trailing bytes
Bugfix for OpenType fonts
Paths and glyphs are now correctly rotated when the page itself has a rotation value

Assets 3

10 May 15:37

EliotJones

01.2-alpha002

256c283

0.1.2 Second Alpha Pre-release

Pre-release

Adds letter font details and a couple of other bugfixes to the alpha version.

Assets 3

25 Apr 14:24

EliotJones

0.1.2-alpha001

98dd736

0.1.2 First Alpha Pre-release

Pre-release

First alpha version of 0.1.2

Assets 3

18 Mar 20:51

EliotJones

0.1.1

4ed1600

Cows That Move Backwards And Forwards

Many bug fixes for a whole range of document types. In addition:

Add support for JPG images in PdfDocumentBuilder using page.AddJpeg().
Access to marked content using page.GetMarkedContents()
Early access to PDF merging using PdfMerger.Merge()
Adds Doc-Comments back to the package.
Improvements to NearestNeighbourWordExtractor and other Document Layout Analysis classes to support rotated text.

Assets 3

15 Mar 16:54

EliotJones

0.1.1-alpha001

8ac4195

0.1.1 First Alpha Pre-release

Pre-release

A whole bunch of bug fixes and other changes.

Assets 3

13 Jan 11:35

EliotJones

0.1.0

e8401b8

And It Comes Out As MIWK

This version focuses on improving performance.

To enable this it replaces decimals with doubles for most of the public API. It also reorganizes the code internally to support access to font related classes.

For this reason consumers will need to update their code, see the migration guide on the wiki.

Other features:

Access to hyperlinks provides a convenience wrapper for retrieving annotations of type Link and their text content and destination. Use page.GetHyperlinks().
Bug fixes for glyph positions.
Access to the embedded files in the document. Use document.Advanced.TryGetEmbeddedFiles(out IReadOnlyList<EmbeddedFile> files).
Ability to provide a list of passwords to try when opening encrypted documents. Use ParsingOptions.Passwords to provide the list of passwords. Any password set in ParsingOptions.Password will be included in the list of passwords.
Many bug fixes for different documents.

Assets 3

08 Jan 14:27

EliotJones

0.1.0-beta002

6c1e3c7

0.1.0 Second Beta Pre-release

Pre-release

Updates the 0.1.0 beta version with many bug fixes.

Assets 3

06 Jan 12:41

EliotJones

0.1.0-beta001

8084586

0.1.0 First Beta Pre-release

Pre-release

First release which moves internal numerics from decimal to double where appropriate.

Reorganises internal project structure.

See migration details in the wiki: https://github.com/UglyToad/PdfPig/wiki/Migration-0.0.X-to-0.1.0

Assets 3

17 Dec 18:13

EliotJones

0.0.11

777bf9b

Farms With Fields Which Cross The Border

This release fixes a major performance regression in 0.10.0.

It also adds bug-fixes for several new issues as well as additional methods for the geometry objects PdfPath, PdfLine and PdfRectangle.

Assets 3

09 Dec 13:50

EliotJones

v0.0.10

af1217f

Mixed Together With Whiskey

This release adds two main new features:

Access to form elements (AcroForms) such as text input, checkboxes, radio-buttons, etc. Use document.TryGetForm(out AcroForm form) to get the form for the document if it contains one.
Access to bookmarks which define the document structure by linking to chapters, etc. Use document.TryGetBookmarks(out Bookmarks bookmarks) to get the document's bookmarks tree if it contains one.

It also aims to improve performance for most content retrieval operations resulting in up to double speed for the smallest documents.

It also adds bug-fixes, structure analysis tools and small improvements:

Adds document.GetPages() as a convenience method to enumerate all pages in a document.
Adds hOcr, AltoXml and PageXml format exporters to export the page content to standardized formats which can be used in other tools. These exporters implement the ITextExporter interface and are used to export each page to a compatible string.
Improves support for retrieving images from a page. The new page.GetImages() method enumerates all images on a page, images are either InlineImages or XObjectImages.
Adds support for extracting text which is defined in XObject forms (distinct from AcroForms) which was previously skipped, meaning text could have been missing from the page.Text on certain document types.
Adds support for vertical writing mode fonts (Japanese, etc).
Additional bug fixes.

Assets 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: UglyToad/PdfPig

0.1.2 Third Alpha

0.1.2 Second Alpha

0.1.2 First Alpha

Cows That Move Backwards And Forwards

0.1.1 First Alpha

And It Comes Out As MIWK

0.1.0 Second Beta

0.1.0 First Beta

Farms With Fields Which Cross The Border

Mixed Together With Whiskey