Argilla
                                            Argilla
                                        
                                    argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
spacy-wordnet
spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface
get_started_with_deep_learning_for_text_with_allennlp
Getting started with AllenNLP and PyTorch by training a tweet classifier
biome-text
Custom Natural Language Processing with big and small models 🌲🌱
adept-augmentations
A Python library aimed at dissecting and augmenting NER training data.
distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach
awesome-llm-datasets
👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)
kcap17-tutorial
Material for tutorial "Hybrid techniques for knowledge-based NLP: Knowledge graphs meet machine learning and all their friends" at KCAP 2017, Austin (Texas)