Explosion
Explosion
floret
🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy
ml-datasets
🌊 Machine learning dataset loaders for testing and example scripts
murmurhash
💥 Cython bindings for MurmurHash2
spacy-benchmarks
💫 Runtime performance comparison of spaCy against other NLP libraries
spacy-huggingface-hub
🤗 Push your spaCy pipelines to the Hugging Face Hub
spacy-pkuseg
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
spacy-ray
☄️ Parallel and distributed training with spaCy and Ray
tokenizations
Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/