julep
julep copied to clipboard
(Maybe?) Add a semantic cache
use https://huggingface.co/nvidia/dragon-multiturn-context-encoder for semantic cache embeddings
use https://huggingface.co/nvidia/dragon-multiturn-context-encoder for semantic cache embeddings