data-generation topic

List data-generation repositories

GRADE-RR

38
Stars
5
Forks
Watchers

GRADE: Generating Animated Dynamic Environments for Robotics Research

FlexKBQA

70
Stars
4
Forks
Watchers

FlexKBQA: A Flexible LLM-Powered Framework for Few-Shot Knowledge Base Question Answering

mockingbird

88
Stars
10
Forks
Watchers

Mockingbird is a mock streaming data generator

faker-cxx

212
Stars
88
Forks
Watchers

C++ Faker library for generating fake (but realistic) data.

trainer

29
Stars
7
Forks
Watchers

Simple interface to synthesize complex and highly dimensional datasets using Gretel APIs.

be_great

240
Stars
37
Forks
Watchers

A novel approach for synthesizing tabular data using pretrained large language models

REaLTabFormer

187
Stars
23
Forks
Watchers

A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.

bin2ml

55
Stars
2
Forks
Watchers

A command line tool for extracting machine learning ready data from software binaries powered by Radare2

noisemix

41
Stars
7
Forks
Watchers

NoiseMix - data generation for natural language

jazznet

59
Stars
0
Forks
Watchers

jazznet dataset of piano patterns for music audio machine learning research