llm-inference topics

LLM.swift is a simple and readable library that allows you to interact with large language models locally with ease for macOS, iOS, watchOS, tvOS, and visionOS.

eastriverlee

gguf

ios

llm

llm-inference

spatten

66

Stars

7

Forks

Watchers

[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning

mit-han-lab

attention

hardware-acceleration

llm-inference

rtl

export_llama_to_onnx

92

Stars

11

Forks

Watchers

export llama to onnx

luchangli03

llama

llm

llm-inference

onnx

friendli-client

40

Stars

8

Forks

Watchers

Friendli: the fastest serving engine for generative AI

friendliai

ai

generative-ai

gpt

gpt3

llm-vscode-inference-server

52

Stars

8

Forks

Watchers

An endpoint server for efficiently serving quantized open-source LLMs for code.

wangcx18

llm

llm-inference

vllm

vscode-extension

tree-prompt

29

Stars

3

Forks

Watchers

Tree prompting: easy-to-use scikit-learn interface for improved prompting.

csinva

ai

artificial-intelligence

classification

controllability

llm-sharp

39

Stars

6

Forks

Watchers

Language models in C#

K024

baichuan

csharp

language-model

llama