Jonathan Besomi

Results 22 issues of Jonathan Besomi

Congratulations @KartikChugh and @sanujb for such a cool platform! Love it! Since you got some experience with wit.ai, how is it to work with? Also, can you compare it with...

**Motivation** Initially, [docusaurus.io](https://docusaurus.io/) has been chosen as a tool to visualize Texture's documentation on the website. Docusaurus is great for two reasons: it's very beautiful and it's super easy to...

documentation

The aim of this issue is to discuss and understand when `tokenize` should happen in the pipeline. The current solution is to apply `tokenize` once the text has already been...

version2

The following contains a high-level view of what will be the next main enhancement steps. This document will be kept up-to-date and improved frequently. This work will be mainly conducted...

Some of the user's most common questions are: - Does Texthero support other languages than English? - How fast is Texthero? Other important questions are: - Why Texthero in the...

documentation

This document attempt at giving an order of importance of tasks related to improving the documentation of the website. - [ ] Add the "Edit on Github" button in every...

documentation

The actual tokenizer is very fast as it uses a simple regex pattern but at the same time very imprecise. A better alternative might be to make use of `spaCy`....

enhancement
discussion

As now, `top_words` counts the occurrences of the word. Which other algorithms can be used to select the most *characteristic* terms? Inspirations: https://github.com/JasonKessler/scattertext Keywords: **Fisher's test select** and **Rudder** score.

enhancement

The current Travis CI log isn't particularly beautiful, [example](https://travis-ci.com/github/jbesomi/texthero/jobs/368704825). This task consists of cleaning the log output of travis CI by mainly improving the unit-tests files. Also, sometimes the Travis...

bug

(Edit) Add a function `hero.infer_lang(s)` (a suggestion for a better function name is more than welcomed!) that given a Pandas Series finds for each row the respective language. **Implementation** 1....

enhancement