pointerhacker

Results 4 issues of pointerhacker

llm_quantization this blog cant find why offline?

# refactor: Check for embedding model consistency This pull request introduces a check to ensure that the embedding model used during inference is consistent with the model used when creating...

Summary This PR introduces support for custom sglang deployments that are OpenAI-compatible but have minor deviations from the official API specification. It also improves the frontend user experience by adding...

Summary This PR introduces support for custom sglang deployments that are OpenAI-compatible but have minor deviations from the official API specification. It also improves the frontend user experience by adding...