Snoopy

Results 7 issues of Snoopy

Fix AssertionError ( model_file should exist ) #1912

https://python.langchain.com/en/latest/use_cases/question_answering/semantic-search-over-chat.html https://github.com/hwchase17/langchain/blob/master/docs/use_cases/question_answering/semantic-search-over-chat.ipynb ![image](https://user-images.githubusercontent.com/54778084/235321215-c5907803-064b-4ea5-beb9-8e51815cf605.png) ![image](https://user-images.githubusercontent.com/54778084/235321252-ca274d67-f98c-4e6d-ab4e-7ac987f11a93.png) ![image](https://user-images.githubusercontent.com/54778084/235321303-3a8e92f8-9011-4ee1-a6af-3302a131610c.png) Apparently, `split_documents` function's input need to have attributes of `page_content` and `metadata`, but getting string list as input here

Tested with second sample of ChatGPT and the detection result is not same with server. The test result of https://openai-openai-detector.hf.space/ ![image](https://user-images.githubusercontent.com/54778084/211030406-85c0330f-a52b-45e3-8b4d-97a22e8c132d.png) Test result with `roberta-base` model on localhost ![image](https://user-images.githubusercontent.com/54778084/211030544-70d68d76-1676-483f-a482-e29d7682fecf.png) Test...

NameError: name 'processor' is not defined ![image](https://github.com/huggingface/community-events/assets/54778084/66b825ba-cc72-45dd-b89f-edddd3f92b82)

same issue in the parent https://github.com/imartinez/privateGPT/issues/859 possible solution https://github.com/imartinez/privateGPT/pull/881/files

![image](https://user-images.githubusercontent.com/54778084/106839650-9e9da500-66d9-11eb-9d8e-8a0e3c13d983.png)