Sideways Thought
Sideways Thought
I've written code that uses Numpty to count characters, and spacy to count tokens--count characters to create blocks of text that mix down to the number of tokens for an...
Merging vectors it is!
I pushed code last night into the associated branch that concatenates rather than averages.
https://github.com/sidewaysthought/Auto-GPT/tree/read_file-fix-character-length-%233222
> @sidewaysthought we can't process your contribution if you don't create a PR This was the PR https://github.com/Significant-Gravitas/Auto-GPT/pull/3262
The requirements.txt file will need to be hand sorted out. I don't want to mess things up more than I have.
I've drafted code that chunks, combines by averaging, then returning that combined embedding. I won't do anything with it if you've already done all the things @Pwuts.
The code as-is only functions with UTF-8. If it is in any other format, the code falls over. I've updated the code to detect the file encoding, then use that...