Verba 'NoneType' object has no attribute 'tokenize'

'NoneType' object has no attribute 'tokenize'

Open micuentadecasa opened this issue 1 year ago • 12 comments

I'm using Cohere and unstructured, and I'm receiving that error when trying to load a pdf. It works ok with the simple reader, but not with the options for PDF.

this is the log:

ℹ Received Data to Import: READER(PDFReader, Documents 1, Type Documentation) CHUNKER (TokenChunker, UNITS 250, OVERLAP 50), EMBEDDER (MiniLMEmbedder) ✔ Loaded ai-03-00057.pdf ✔ Loaded 1 documents Chunking documents: 100%|████████████████████████████████████████████| 1/1 [00:00<00:00, 37.20it/s] ✔ Chunking completed Vectorizing document chunks: 0%| | 0/1 [00:00<?, ?it/s] ✘ Loading data failed 'NoneType' object has no attribute 'tokenize'

Regards.

Nov 23 '23 22:11 micuentadecasa

Verba Verba copied to clipboard

'NoneType' object has no attribute 'tokenize'

Verba
Verba copied to clipboard