deepseek topic
inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any...
Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
ms-swift
Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Visio...
inferflow
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
aicommit2
A Reactive CLI that generates git commit messages with Ollama, ChatGPT, Gemini, Claude, Mistral and other AI
deepseek-free-api
🚀 DeepSeek-V2大模型逆向API白嫖测试【特长:GPT4平替】,支持高速流式输出、多轮对话,零配置部署,多路token支持。
rag-gpt
RAG-GPT, leveraging LLM and RAG technology, learns from user-customized knowledge bases to provide contextually relevant answers for a wide range of queries, ensuring rapid and accurate information re...
bigcodebench
BigCodeBench: Benchmarking Code Generation Towards AGI
cherry-studio
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers
obsidian-tars
Obsidian plugin that supports text generation based on tag suggestions, using services like Claude, OpenAI, Ollama, Kimi, Doubao, Qwen, Zhipu, DeepSeek, QianFan & more. 插件基于标签建议进行文本生成,...