optillm icon indicating copy to clipboard operation
optillm copied to clipboard

Optimizing inference proxy for LLMs

Results 21 optillm issues
Sort by recently updated
recently updated
newest added

There have been a few interesting papers of the past year that translated reasoning and logic problems into prolog to enhance the capacity of various llms. Some examples: * [Reliable...

enhancement

Hello, while trying bon, deepseek reports error since deepseek can't support n > 1, so I added ``` except: for _ in range(n): response = client.chat.completions.create( model=model, messages=messages, max_tokens=4096, n=1,...

enhancement

can we add fiel store for the memory plugin? so we can give a permanent storage i added a sample code below. let me know if its a good plan...

enhancement

Based on the idea here - https://www.reddit.com/r/ollama/comments/1grlewl/comment/lxdd6hr/?context=3

enhancement

``` optillm | [Web Search Error]: Web search error: Failed to setup Chrome driver: Message: Service /home/appuser/.wdm/drivers/chromedriver/linux64/114.0.5735.90/chromedriver unexpectedly ex ited. Status code was: 127 optillm | optillm | 2025-11-21 02:47:10,817...

(.venv) niko@agentic-tools:~/optillm$ ruff check --fix . F401 `adaptive_classifier` imported but unused; consider using `importlib.util.find_spec` to test for availability --> optillm/autothink/classifier.py:39:24 | 37 | # Check if adaptive-classifier is installed 38...

## 🎯 Overview Implement a novel inference optimization approach inspired by [this research idea](https://x.com/willccbb/status/1940557166248972387): **a lightweight retriever that processes streaming Chain-of-Thought reasoning to inject contextual hints from a memory bank,...

help wanted

Hi there, This pull request shares a security update on optillm. We also have an entry for optillm in our directory, MseeP.ai, where we provide regular security and trust updates...

# Problem The emergence of the **inference-time compute paradigm** presents a critical safety challenge: ensuring **chain-of-thought (CoT) faithfulness**. Through our work on **OptiLLM**, an open-source inference optimization framework implementing over...

enhancement
help wanted
good first issue