VoiceInk
VoiceInk copied to clipboard
Add the thinking version of Qwen 235B model
It has just been released on Cerebras. Now, there are two models: thinking and non-thinking. Therefore, access to both would be really good because the thinking model would be an alternative to the 2.5 Flash model.
I see that the thinking model was added and then removed. Why was this the case? @Beingpax
The thinking model was too slow to respond, taking around 5 to 7 seconds. So, to prevent user confusion, I removed it.
Also the output format doesn't enclose inside think tokens properly messing with our thinking token filter.