data-generation topic
GRADE-RR
GRADE: Generating Animated Dynamic Environments for Robotics Research
FlexKBQA
FlexKBQA: A Flexible LLM-Powered Framework for Few-Shot Knowledge Base Question Answering
mockingbird
Mockingbird is a mock streaming data generator
faker-cxx
C++ Faker library for generating fake (but realistic) data.
trainer
Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs.
be_great
A novel approach for synthesizing tabular data using pretrained large language models
REaLTabFormer
A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.
bin2ml
A command line tool for extracting machine learning ready data from software binaries powered by Radare2
noisemix
NoiseMix - data generation for natural language
jazznet
jazznet dataset of piano patterns for music audio machine learning research