tokenizer topic
gpt3-tokenizer
Isomorphic JavaScript/TypeScript Tokenizer for GPT-3 and Codex Models by OpenAI.
tiktoken-rs
Ready-made tokenizer library for working with GPT and tiktoken
SharpToken
SharpToken is a C# library for tokenizing natural language text. It's based on the tiktoken Python library and designed to be fast and accurate.
Cledev.OpenAI
.NET 7 SDK for OpenAI with a Blazor Server playground
openai-tools
A collection of tools for working with OpenAI
go-gpt-3-encoder
Go BPE tokenizer (Encoder+Decoder) for GPT2 and GPT3
GPTEncoder
Swift BPE Encoder/Decoder for OpenAI GPT Models. A programmatic interface for tokenizing text for OpenAI ChatGPT API.
Roy_VnTokenizer
Vietnamese tokenizer (Maximum Matching and CRF)
talismane
NLP framework: sentence detector, tokeniser, pos-tagger and dependency parser