Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add more features to the BetterNLP library #45

Open
neomatrix369 opened this issue Nov 1, 2019 · 3 comments
Open

Add more features to the BetterNLP library #45

neomatrix369 opened this issue Nov 1, 2019 · 3 comments
Assignees
Labels
discussion enhancement New feature or request hacktoberfest Part of the Hacktoberfest 2020 (https://hacktoberfest.digitalocean.com)

Comments

@neomatrix369
Copy link
Owner

neomatrix369 commented Nov 1, 2019

On the back of this discussion, @shahanesanket and I will take this further
ydataai/ydata-profiling#278, some high-level ideas:

  • Missing value analysis
  • Text length analysis
    • 2.1 min, max, average, quantiles
    • 2.2 freq words, infrequent words (can include the deepmoji project's tokenizer. it's very robust)
    • 2.2 word cloud. (if it isn't a far stretched goal)

@shahanesanket let's continue with our discussions here.

@neomatrix369 neomatrix369 self-assigned this Nov 1, 2019
@neomatrix369 neomatrix369 added discussion enhancement New feature or request labels Nov 1, 2019
@neomatrix369
Copy link
Owner Author

@shahanesanket any thoughts on the above, shall we get started with your ideas and then draft some code on top of it!

@neomatrix369
Copy link
Owner Author

@shahanesanket

Please have a look at this implementation and let me know what you think, it's on the back of the issue you had raised as discussion point on the Pandas Profiling repo:
https://github.com/neomatrix369/awesome-ai-ml-dl/blob/master/examples/better-nlp/notebooks/jupyter/nlp_profiler.ipynb

I'm happy to expound on this further after hearing your response and feedback on it.

@neomatrix369
Copy link
Owner Author

NLP Profiler has been moved from under the Examples: BetterNLP section to into own repo: https://github.com/neomatrix369/nlp_profiler

@neomatrix369 neomatrix369 added the hacktoberfest Part of the Hacktoberfest 2020 (https://hacktoberfest.digitalocean.com) label Sep 25, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discussion enhancement New feature or request hacktoberfest Part of the Hacktoberfest 2020 (https://hacktoberfest.digitalocean.com)
Projects
None yet
Development

No branches or pull requests

1 participant