Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Normalisation improvement #95

Merged
merged 56 commits into from
Aug 6, 2021
Merged

Normalisation improvement #95

merged 56 commits into from
Aug 6, 2021

Conversation

lfoppiano
Copy link
Owner

@lfoppiano lfoppiano commented Jul 2, 2019

With this PR I've improved the implementation for the normalisation:

I will merge it when the UOM library will be more stable and we will have more coverage.

…ng as it was creating the training data from the already parsed units and not just from the plain tagged one - the slashes were gone for example #90
This was linked to issues Jul 6, 2020
@lfoppiano lfoppiano self-assigned this Jul 6, 2020
# Conflicts:
#	.travis.yml
#	build.gradle
#	resources/models/values/model.wapiti
#	src/main/java/org/grobid/core/data/normalization/QuantityNormaliser.java
#	src/main/java/org/grobid/core/engines/QuantitiesEngine.java
#	src/main/java/org/grobid/core/engines/QuantityParser.java
#	src/main/java/org/grobid/core/engines/UnitParser.java
#	src/main/java/org/grobid/core/features/FeaturesVectorQuantities.java
#	src/main/java/org/grobid/core/features/FeaturesVectorUnits.java
#	src/main/java/org/grobid/core/features/FeaturesVectorValues.java
#	src/main/java/org/grobid/core/utilities/MeasurementOperations.java
#	src/main/java/org/grobid/trainer/UnitTrainer.java
#	src/main/resources/lexicon/en/units.json
#	src/test/java/org/grobid/core/engines/UnitParserIntegrationTest.java
#	src/test/java/org/grobid/core/utilities/TextParserIntegrationTest.java
…normalisation-improvement

# Conflicts:
#	build.gradle
#	gradle.properties
#	resources/models/values/model.wapiti
#	src/main/java/org/grobid/core/data/normalization/QuantityNormalizer.java
#	src/main/java/org/grobid/core/engines/QuantityParser.java
#	src/main/java/org/grobid/core/utilities/MeasurementOperations.java
# Conflicts:
#	.travis.yml
#	build.gradle
#	resources/models/values/model.wapiti
#	src/main/java/org/grobid/core/data/normalization/QuantityNormaliser.java
#	src/main/java/org/grobid/core/engines/QuantitiesEngine.java
#	src/main/java/org/grobid/core/engines/QuantityParser.java
#	src/main/java/org/grobid/core/engines/UnitParser.java
#	src/main/java/org/grobid/core/features/FeaturesVectorQuantities.java
#	src/main/java/org/grobid/core/features/FeaturesVectorUnits.java
#	src/main/java/org/grobid/core/features/FeaturesVectorValues.java
#	src/main/java/org/grobid/core/utilities/MeasurementOperations.java
#	src/main/java/org/grobid/trainer/UnitTrainer.java
#	src/main/resources/lexicon/en/units.json
#	src/test/java/org/grobid/core/engines/UnitParserIntegrationTest.java
#	src/test/java/org/grobid/core/utilities/TextParserIntegrationTest.java
@lfoppiano lfoppiano marked this pull request as ready for review July 6, 2020 02:17
@lfoppiano lfoppiano requested review from kermitt2 and Aazhar July 6, 2020 02:18
@lfoppiano lfoppiano changed the title [wip] Normalisation improvement Normalisation improvement Feb 10, 2021
@lfoppiano
Copy link
Owner Author

I think, unless I forgot something, this PR is ready to be merged.

@lfoppiano
Copy link
Owner Author

Part of this PR was the update of the unit model. Before running the application, the user needs to run ./gradlew copyModels.

@lfoppiano
Copy link
Owner Author

After updating is important to copy the models again. The Units and Values model has been updated in the meantime.

@lfoppiano lfoppiano merged commit af220ef into master Aug 6, 2021
@lfoppiano lfoppiano deleted the normalisation-improvement branch August 10, 2021 03:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Normalisation issues imperial units are not supported
2 participants