Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add BM25 and BagOfWords transformers, update tests, update readme, re… #12

Merged
merged 23 commits into from
Dec 21, 2021

Conversation

pazzo83
Copy link
Collaborator

@pazzo83 pazzo83 commented Dec 15, 2021

Adding two new transformers with updated tests and readme,
Also some refactoring since there is some shared code now.

@pazzo83 pazzo83 requested review from ablaom and storopoli December 15, 2021 16:43
@codecov-commenter
Copy link

codecov-commenter commented Dec 15, 2021

Codecov Report

Merging #12 (142752b) into dev (000a451) will increase coverage by 7.28%.
The diff coverage is 89.86%.

Impacted file tree graph

@@            Coverage Diff             @@
##              dev      #12      +/-   ##
==========================================
+ Coverage   81.05%   88.33%   +7.28%     
==========================================
  Files           3        7       +4     
  Lines          95      180      +85     
==========================================
+ Hits           77      159      +82     
- Misses         18       21       +3     
Impacted Files Coverage Δ
src/MLJText.jl 100.00% <ø> (ø)
src/abstract_text_transformer.jl 75.86% <75.86%> (ø)
src/bagofwords_transformer.jl 88.00% <88.00%> (ø)
src/bm25_transformer.jl 89.18% <89.18%> (ø)
src/utils.jl 97.67% <97.67%> (ø)
src/tfidf_transformer.jl 87.50% <100.00%> (+7.25%) ⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 000a451...142752b. Read the comment docs.

Project.toml Outdated Show resolved Hide resolved
src/bm25_transformer.jl Outdated Show resolved Hide resolved
test/bm25_transformer.jl Outdated Show resolved Hide resolved
Copy link
Member

@ablaom ablaom left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @pazzo83 for this very substantial contribution 🎉 .

I have reviewed the PR with mainly the MLJ model API in mind.

@storopoli Are you able to look over some of the core algorithm implentation details? @pazzo83 Be great if you can flag the names of those core functions.

@storopoli
Copy link
Collaborator

Yes I can, but only beginning next week (Final academic semester week is hell now).

Copy link
Collaborator

@storopoli storopoli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The implementation is great.
Only the unusual double spacing after a period at the .md files and docstrings.
Was it intentional?

README.md Outdated Show resolved Hide resolved
README.md Outdated Show resolved Hide resolved
README.md Outdated Show resolved Hide resolved
README.md Outdated Show resolved Hide resolved
README.md Outdated Show resolved Hide resolved
src/bm25_transformer.jl Show resolved Hide resolved
src/tfidf_transformer.jl Outdated Show resolved Hide resolved
src/tfidf_transformer.jl Outdated Show resolved Hide resolved
src/tfidf_transformer.jl Outdated Show resolved Hide resolved
src/tfidf_transformer.jl Outdated Show resolved Hide resolved
pazzo83 and others added 15 commits December 19, 2021 10:43
Co-authored-by: Jose Storopoli <43353831+storopoli@users.noreply.github.com>
Co-authored-by: Jose Storopoli <43353831+storopoli@users.noreply.github.com>
Co-authored-by: Jose Storopoli <43353831+storopoli@users.noreply.github.com>
Co-authored-by: Jose Storopoli <43353831+storopoli@users.noreply.github.com>
Co-authored-by: Jose Storopoli <43353831+storopoli@users.noreply.github.com>
Co-authored-by: Jose Storopoli <43353831+storopoli@users.noreply.github.com>
Co-authored-by: Jose Storopoli <43353831+storopoli@users.noreply.github.com>
Co-authored-by: Jose Storopoli <43353831+storopoli@users.noreply.github.com>
Co-authored-by: Jose Storopoli <43353831+storopoli@users.noreply.github.com>
Co-authored-by: Jose Storopoli <43353831+storopoli@users.noreply.github.com>
Co-authored-by: Jose Storopoli <43353831+storopoli@users.noreply.github.com>
pazzo83 and others added 4 commits December 21, 2021 12:54
Co-authored-by: Jose Storopoli <43353831+storopoli@users.noreply.github.com>
Co-authored-by: Jose Storopoli <43353831+storopoli@users.noreply.github.com>
Co-authored-by: Jose Storopoli <43353831+storopoli@users.noreply.github.com>
Co-authored-by: Jose Storopoli <43353831+storopoli@users.noreply.github.com>
@ablaom ablaom merged commit d772365 into dev Dec 21, 2021
@ablaom ablaom mentioned this pull request Dec 21, 2021
@storopoli storopoli deleted the add_cv_and_bmi25 branch December 21, 2021 20:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants