DataProfiler icon indicating copy to clipboard operation
DataProfiler copied to clipboard

TextProfiler only do case sensitivity at report

Open JGSweets opened this issue 3 years ago • 3 comments

Instead of case sensitivity being checked at profile time, we can leave sensitivity on and at report time we can then output collapse it if the options require case-insensitive.

JGSweets avatar May 06 '21 18:05 JGSweets

This would solve #210

lettergram avatar Aug 31 '21 21:08 lettergram

This applies to: https://github.com/capitalone/DataProfiler/blob/main/dataprofiler/profilers/unstructured_text_profile.py

Instead of applying is_case_sensitive in the functions _update_words, _update_vocab, _merge_words, move them to merging the counts in the functions profile and diff

JGSweets avatar Sep 13 '22 16:09 JGSweets

Can I work on this? Would I just have to move the application of case-sensitive to the functions mentioned in the above comment?

thebadcoder96 avatar Dec 23 '23 05:12 thebadcoder96