Sideways Thought

Results 8 comments of Sideways Thought

I've written code that uses Numpty to count characters, and spacy to count tokens--count characters to create blocks of text that mix down to the number of tokens for an...

I pushed code last night into the associated branch that concatenates rather than averages.

https://github.com/sidewaysthought/Auto-GPT/tree/read_file-fix-character-length-%233222

> @sidewaysthought we can't process your contribution if you don't create a PR This was the PR https://github.com/Significant-Gravitas/Auto-GPT/pull/3262

The requirements.txt file will need to be hand sorted out. I don't want to mess things up more than I have.

I've drafted code that chunks, combines by averaging, then returning that combined embedding. I won't do anything with it if you've already done all the things @Pwuts.

The code as-is only functions with UTF-8. If it is in any other format, the code falls over. I've updated the code to detect the file encoding, then use that...