textacy
textacy copied to clipboard
NLP, before and after spaCy
### steps to reproduce First create the following Corpus, save it to disk, and note that upon reloading you can still get word doc counts: ``` import textacy corpus =...
Is it possible to add context to ngram extraction? For example, currently running `list(textacy.Doc('I like green eggs and ham.').to_terms_list(ngrams=3,as_strings=True))` returns a list `['-PRON- like green', 'like green egg', 'egg and...
## Expected Behavior Users expect to learn from code examples and tutorials more so than from reading an API reference. We should oblige. ## Current Behavior Fairly brief usage examples...
I'm having numerous issues with stopwords when working with textacy's make_doc_from_text_chunks functionality. ## Expected Behavior I want to be able to load a model and then fire documents at it...
Hello, I am trying to use this function but it seems that it does not consider these kind of examples: ``` from textacy import extract as ex doc=nlp(" Kurds are...
Hi, I have function ``` def bag_of_term(text): doc = textacy.Doc(text) bot = doc.to_bag_of_terms(ngrams=(2, 3, 4), named_entities=True, weighting='count', as_strings=True) print(bot.items()) return sorted(bot.items(), key=lambda x: x[1], reverse=True) ``` which works normally and...
Thanks for this very useful library (& the porting onto spacy 2.0!). I'm playing with the various keyword extraction functions. I wonder on the best strategy to further whittle down...
_I posted this idea over on the spaCy repo but it didn't get picked up there ([spaCy issue 2237](https://github.com/explosion/spaCy/issues/2237)) . So perhaps it is suitable for this package instead..._ ###...
## Expected Behavior It would be nice if [some of the features](https://radimrehurek.com/gensim/models/phrases.html) of the `gensim.models.Phrases()` tool could get implemented into the `doc.to_terms_list()` method, or even elsewhere (expecially in the `textacy.preprocess.preprocess_text()`...
Hi, I am trying to extract subject-verb-object triplets from my data and then attach an ID like this: ``` #Make dataframe with SVO extraction count = 0; df2 = pd.DataFrame();...