optillm
optillm copied to clipboard
Optimizing inference proxy for LLMs
There have been a few interesting papers of the past year that translated reasoning and logic problems into prolog to enhance the capacity of various llms. Some examples: * [Reliable...
Hello, while trying bon, deepseek reports error since deepseek can't support n > 1, so I added ``` except: for _ in range(n): response = client.chat.completions.create( model=model, messages=messages, max_tokens=4096, n=1,...
can we add fiel store for the memory plugin? so we can give a permanent storage i added a sample code below. let me know if its a good plan...
Based on the idea here - https://www.reddit.com/r/ollama/comments/1grlewl/comment/lxdd6hr/?context=3
``` optillm | [Web Search Error]: Web search error: Failed to setup Chrome driver: Message: Service /home/appuser/.wdm/drivers/chromedriver/linux64/114.0.5735.90/chromedriver unexpectedly ex ited. Status code was: 127 optillm | optillm | 2025-11-21 02:47:10,817...
(.venv) niko@agentic-tools:~/optillm$ ruff check --fix . F401 `adaptive_classifier` imported but unused; consider using `importlib.util.find_spec` to test for availability --> optillm/autothink/classifier.py:39:24 | 37 | # Check if adaptive-classifier is installed 38...
## 🎯 Overview Implement a novel inference optimization approach inspired by [this research idea](https://x.com/willccbb/status/1940557166248972387): **a lightweight retriever that processes streaming Chain-of-Thought reasoning to inject contextual hints from a memory bank,...
Hi there, This pull request shares a security update on optillm. We also have an entry for optillm in our directory, MseeP.ai, where we provide regular security and trust updates...
# Problem The emergence of the **inference-time compute paradigm** presents a critical safety challenge: ensuring **chain-of-thought (CoT) faithfulness**. Through our work on **OptiLLM**, an open-source inference optimization framework implementing over...