tokenizer topic
Neural-Net-Zero-to-Hero-with-Andrej
This repository contains the collection of explorative notebooks pure in python and in the language that we, humans can read. Have tried to compile all lectures from the Andrej Karpathy's 💎 playlist...
character-tokenizer
A character tokenizer for Hugging Face Transformers
MambaByte
Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta
FileQL
A tool that allow you to run SQL-like query on local files instead of database files using the GitQL SDK.
ChatGPT-Token-Usage-Pre-Calculator
Perfect for anyone who needs to quickly calculate the token amount of ChatGPT in prompts for their project.
llama3-tokenizer-js
JS tokenizer for LLaMA 3 and LLaMA 3.1
AWS-LLM-SageMaker
SageMaker Ployglot based RAG opensearch
Tokenizer
Typescript and .NET implementation of BPE tokenizer for OpenAI LLMs.
JSRETK
JavaScript Reverse Engineering Toolkit (JSRETK) - Experimental tools for analyzing (minified/obfuscated) JavaScript