twitter-topics icon indicating copy to clipboard operation
twitter-topics copied to clipboard

CountVectorizer error

Open Mustyy opened this issue 7 years ago • 3 comments

raise ValueError("empty vocabulary; perhaps the documents only" ValueError: empty vocabulary; perhaps the documents only contain stop words

Why am i getting this error, min_df = 2 and everything else seems to be right

Mustyy avatar Apr 21 '17 16:04 Mustyy

@Mustyy Please can you tell me how you are working on this? Please can you share me the details? I need to know what's the input file structure.

bozzmob avatar May 09 '17 09:05 bozzmob

@bozzmob Hey, thanks for getting back to me I extracted json to txt then im running twitter-topics-from-json-text-stream.py

Mustyy avatar May 09 '17 15:05 Mustyy

Hi @Mustyy Thanks for the quick reply. That is actually my question.

How do you get extract-json-to-text-stream.py to work?

What format is your input file? Please can you share its format?

bozzmob avatar May 09 '17 16:05 bozzmob