data-generation topic

List data-generation repositories

CodeMixed-Text-Generator

48
Stars
12
Forks
Watchers

This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.

ProGen

20
Stars
0
Forks
Watchers

[EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.

GSR-Net

16
Stars
0
Forks
Watchers

Graph SuperResolution Network using geometric deep learning.

SymGen

16
Stars
1
Forks
Watchers

[EMNLP'23] Code for Generating Data for Symbolic Language with Large Language Models

Gen4Gen

96
Stars
5
Forks
Watchers

🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"

seed_factory

27
Stars
0
Forks
Watchers

A toolkit for test data generation

SynTable

15
Stars
0
Forks
Watchers

The official code implementation for SynTable - A Synthetic Data Generation Pipeline for Unseen Object Amodal Instance Segmentation of Cluttered Tabletop Scenes

GraphGen

571
Stars
44
Forks
571
Watchers

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation