Enrico Ros

Results 204 comments of Enrico Ros

The max Output Tokens for Gemini models are 8192. See here: https://cloud.google.com/vertex-ai/generative-ai/docs/model-reference/gemini When selecting Gemini models, Big-AGI lets you change the output tokens up to the full range. The values...

I see the usefulness of this feature, however the real issue is that some APIs do not report the context window of the models, or the maximum output tokens. Anthropic,...

Good feature idea. Some quick feedback: - I like the multi-level menu structure - expandable menus could have the "Category ..." (Dot dot dot) and even a ">" icon end...

One more thought. Take a look at the `pmix` file, if you want to use instantly replaced variables. This could be relevant to your efforts.

Thanks @sealad886 this has been implemented as a "Source" parameter, meaning you'll toggle it once for this source and not the model. To turn it on: ![image](https://github.com/enricoros/big-AGI/assets/32999/af77441d-558b-4188-b601-4d408c3efc0e) Then you can...

From a conversation on Nov 22, 2023 with ChatGPT4, we learn the following: ``` You are ChatGPT, a large language model trained by OpenAI, based on the GPT-4 architecture. Knowledge...

System prompt of ChatGPT. This is how they tell it how to behave. Probably since the network regurgitates lyrics, they are just telling to not tell the user :)

Thanks for the report @james777b. The compression prompts are probably not "strong" enough to work well with smaller models, and should be revised. It's impressive how the model hallucinated that...

Hi @argen666, welcome! Please let me know which ways are you thinking to integrate embeddings; could be used on a per-chat, per-message, per-chunk level, and to enable many use cases:...