genkit
genkit copied to clipboard
Can Genkit Support These Functionalities?
Multi-Model API Routing Module Investigate mainstream large model API interface specifications (e.g., OpenAI ChatCompletion, DeepSeek-R1/V3 interfaces, Gemini RESTful API) to design a unified calling layer that supports dynamic switching and load balancing. Evaluate model switching costs (response latency, billing strategies), prioritizing Tencent Cloud DeepSeek API (supports web search and RAG enhancement) and the Llama3 open-source model (local deployment for cost reduction).