awesome-ai-ml-dl icon indicating copy to clipboard operation
awesome-ai-ml-dl copied to clipboard

Add more features to the BetterNLP library

Open neomatrix369 opened this issue 6 years ago • 3 comments
trafficstars

On the back of this discussion, @shahanesanket and I will take this further https://github.com/pandas-profiling/pandas-profiling/issues/278, some high-level ideas:

  • Missing value analysis
  • Text length analysis
    • 2.1 min, max, average, quantiles
    • 2.2 freq words, infrequent words (can include the deepmoji project's tokenizer. it's very robust)
    • 2.2 word cloud. (if it isn't a far stretched goal)

@shahanesanket let's continue with our discussions here.

neomatrix369 avatar Nov 01 '19 15:11 neomatrix369

@shahanesanket any thoughts on the above, shall we get started with your ideas and then draft some code on top of it!

neomatrix369 avatar Nov 07 '19 09:11 neomatrix369

@shahanesanket

Please have a look at this implementation and let me know what you think, it's on the back of the issue you had raised as discussion point on the Pandas Profiling repo: https://github.com/neomatrix369/awesome-ai-ml-dl/blob/master/examples/better-nlp/notebooks/jupyter/nlp_profiler.ipynb

I'm happy to expound on this further after hearing your response and feedback on it.

neomatrix369 avatar Jul 07 '20 21:07 neomatrix369

NLP Profiler has been moved from under the Examples: BetterNLP section to into own repo: https://github.com/neomatrix369/nlp_profiler

neomatrix369 avatar Sep 06 '20 10:09 neomatrix369