tokenizer topic

List tokenizer repositories

basic-dignified

18
Stars
1
Forks
Watchers

Program classic system's 8-bit Basic on a modern environment with modern tools and paradigms (MSX and CoCo modules included).

openai-function-tokens

17
Stars
1
Forks
Watchers

Predict the exact openai token usage of functions

openshield

83
Stars
10
Forks
83
Watchers

OpenShield is a new generation security layer for AI models

taibun

23
Stars
1
Forks
Watchers

Taiwanese Hokkien Transliterator and Tokeniser

rwkv-tokenizer

35
Stars
2
Forks
Watchers

A fast RWKV Tokenizer written in Rust

BeatLearning

29
Stars
2
Forks
Watchers

Open Source Generative AI Models for Automatic Rhythm Game Beatmap Generation (for acoustic people)

xcodec

95
Stars
3
Forks
Watchers

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

HuggingFace

15
Stars
1
Forks
Watchers

Generated C# SDK based on official HuggingFace OpenAPI specification

llm_utils

33
Stars
1
Forks
Watchers

llm_utils: Basic LLM tools, best practices, and minimal abstraction.

ktoken

17
Stars
0
Forks
Watchers

Kotlin multiplatform BPE tokenizer library for OpenAI models