DataProfiler
DataProfiler copied to clipboard
TextProfiler only do case sensitivity at report
Instead of case sensitivity being checked at profile time, we can leave sensitivity on and at report time we can then output collapse it if the options require case-insensitive.
This would solve #210
This applies to: https://github.com/capitalone/DataProfiler/blob/main/dataprofiler/profilers/unstructured_text_profile.py
Instead of applying is_case_sensitive in the functions _update_words
, _update_vocab
, _merge_words
, move them to merging the counts in the functions profile
and diff
Can I work on this? Would I just have to move the application of case-sensitive to the functions mentioned in the above comment?