Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
ENH: Text Extraction improvements (#969)
* Improvements around /Encoding / /ToUnicode * Extraction of CMaps improved * Fallback for font def missing * Support for /Identity-H and /Identity-V: utf-16-be * Support for /GB-EUC-H / /GB-EUC-V / GBp/c-EUC-H / /GBpc-EUC-V (beta release for evaluation) * Arabic (for evaluation) * Whitespace extraction improvements
- Loading branch information