ocr-vqgan topic

List ocr-vqgan repositories
trafficstars

ocr-vqgan

72
Stars
1
Forks
Watchers

OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGA...

figure-diffusion

37
Stars
4
Forks
37
Watchers

Generating figures from research papers, using textual captions from the paper.