ocr-vqgan topic
List
ocr-vqgan repositories
trafficstars
ocr-vqgan
72
Stars
1
Forks
Watchers
OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGA...
figure-diffusion
37
Stars
4
Forks
37
Watchers
Generating figures from research papers, using textual captions from the paper.