Niek van der Maas comments

Results 215 comments of


                                            Niek van der Maas

feature(devcontainer): add llama cli/api

`llama-cpp-python` has a Docker image now: https://github.com/abetlen/llama-cpp-python/pkgs/container/llama-cpp-python

Local models

Thanks! `llama-cli` with the API addition sounds like a great match with ChatGPT-web! The models don't work because we hard-code explicit supported models: https://github.com/Niek/chatgpt-web/blob/1926f7df15b5bf099d1f0ad29740d35c98cfbbdf/src/lib/Types.svelte#L2-L9 This can be quite easily fixed...

feature request: option to change the API server adresse

We have this ability with an env var, but I assume you want to have it configurable in the web interface?

https://app.fireworks.ai/

Seems like the API is pretty similar: https://readme.fireworks.ai/docs/openai-compatibility So you could try by setting the `VITE_API_BASE` env var.

4096 token limit

There are some approaches to work around the token limit: * Truncate the conversation by removing old messages, as you proposed * Summarize the conversation in a new API call,...

4096 token limit

There is a interesting compression approach described here: https://twitter.com/VictorTaelin/status/1642664054912155648

4096 token limit

One thing to note is that the "compression" and "decompression" is a lot more consistent if you set the temperature to 0, meaning more deterministic and less random output. In...

Use new ChatGPT function calling for something

I'm thinking about this too... it should be possible to make a "browsing" plugin using function calling and puppeteer exposed witha simple JSON API. But pretty risky overall. Another (more...

CDP support

Ah cool, I didn't see the other repo! The extension model is great for casual tasks, but when operating at scale you can't really work without CDP/remote browsers. Speaking as...

Feature Request: Summarize all commit

This would also be a great feature to support changelog creation.