Justin Waugh

Results 19 comments of Justin Waugh

Using internal classes, this is my workaround for now that seems to be working. Sharing in case its useful for anyone else. ```python import guidance import nest_asyncio import asyncio def...

I think that this package: https://github.com/marella/ctransformers `ctransformers` (new as of a few days ago) has the look and API feel of huggingface transformers, but works directly on ggml models and...

Ah yeah, I guess I read this issue as `ggml` models and not just `llama.cpp` when i replied with the suggestion -> tbh, i didn't appreciate the difference between these,...

Just tried `santacoder` and it does seem to have the same problem, but at a very different scale. (Error is the same) (had to put in >700, maybe around 1000...

https://github.com/bigcode-project/starcoder.cpp/issues/3 Seems someone else has run into this on the `starcoder.cpp`

Hey @lrauhockey, I like this idea but I don't have time to add it, feel free to add a PR if you can think of a nice kwarg or path...

Hey! Right now this relies on being able to run aggregations quickly to summarize the data to add to the prompt, so it only really works if the data is...

Hi @tszumowski Fast Version: Current Stored State: logs and prompt history, right now we store Nginx logs to measure engagement (number of calls), and prompt history gets deleted on every...

Hey @dave-killough , I attached to a specific dataframe because it was the easiest to "gather" the local context. In a python kernel, just doing `sketch.ask(...)` and inspecting the `locals()`...

@marella Any update on this? I'm looking forward to using StarChat-ggml weights in guidance via ctransformers~ I will take a stab at this later this week, but I don't want...