crnn.pytorch icon indicating copy to clipboard operation
crnn.pytorch copied to clipboard

add gen_image, img2lmdb

Open ZhangXinNan opened this issue 7 years ago • 5 comments

  • add gen_image.py generate image for training
  • create_dataset.py is from crnn
  • create_dataset_main.py image 2 lmdb

ZhangXinNan avatar Jul 29 '17 06:07 ZhangXinNan

Training examples mostly come from standard datasets, and there are various fantastic text image synthesize engine. So I don't think we should have a image generator here.

Also, images -> lmdb comes from origin repository, there is no need for a duplicate.

Thanks for your PR.

meijieru avatar Jul 31 '17 08:07 meijieru

@ZhangXinNan while have you test the generated chinese character could be save to lmdb, currently some of us have faced the "TypeError: Won't implicitly convert Unicode to bytes; use .encode()" errors.

jxlijunhao avatar Oct 20 '17 02:10 jxlijunhao

@jxlijunhao 我也遇到这个问题了,发现是python版本的原因,改成python2.7就好了

daoqinzi avatar Mar 08 '18 01:03 daoqinzi

@daoqinzi 可以把自己的数据集,含有多个中文的区域的图片制作成lmdb格式的文件吗?如何做呢?可否详细说明一下

gittigxuy avatar Mar 08 '18 08:03 gittigxuy

@gittigxuy 可以先按照 @ZhangXinNan 的 gen_image create_dataset_main 跟一边代码就清楚些了

daoqinzi avatar Mar 09 '18 06:03 daoqinzi