Johannes Filter
Johannes Filter
Hey @jdvala, this is good idea. I would suggest to use Python's [multiprocessing](https://docs.python.org/3/library/multiprocessing.html), e.g. with a pool. What's your opinion on this?
Hey @jdvala, in my opinion, the clean function should also accept a list of texts and then return a list of processed texts. Then, we need a new parameter, e.g....
Hey, so you are interested in the pairs of source and destination. Something like (x.jpg, test/x.jpg)? What is your use case for the paths? When do you need the file...
Thanks for the explanations. I will look into the issue.
I'm not sure if this package is right for you. I does not support this kind of folder structure. I think scikit learn got you covered:
You need to install this package via 'pip install clean-text' or 'poetry add clean-text'. Once you have it installed, you can import the package like this: ```python import clean-text clean-text.clean()...
Hey @lemon234071, thanks for reporting. I'm not sure how to handle this. Right now, the URL has to be somehow separated from other tokes (e.g. by a preceding space). In...
Hey @cvzi, thanks for bringing this up. I will look into this for the next release.
In v0.6.0 I restricted the version of emoji to v1.x, so there should not be a problem if you use v0.6.0. Let me know if you still have the problem....
> I'm seeing that v0.6.0 of clean-text is still attempting to import "UNICODE_EMOJI" Which should be fine because it used emoji v1. Do you have problems with v0.6.0?