Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix issues with reference link extraction using HTML/XML readers #10

Merged
merged 3 commits into from
Nov 1, 2016

Commits on Nov 1, 2016

  1. Get all nested text for reference links without href

    Reference links with an immediate child element (e.g. sup) return None for their text property, raising an exception on the strip() call. Instead, use itertext() to get all nested text and join.
    mcs07 committed Nov 1, 2016
    Configuration menu
    Copy the full SHA
    e4c28ff View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    528c37e View commit details
    Browse the repository at this point in the history
  3. Add ACS reader test

    mcs07 committed Nov 1, 2016
    Configuration menu
    Copy the full SHA
    f59592b View commit details
    Browse the repository at this point in the history