mmocr icon indicating copy to clipboard operation
mmocr copied to clipboard

How to use synthtext dataset to train text detection model?? When i try to make it according to docs,something i cannot understand happened.

Open manjaro-git opened this issue 2 years ago • 3 comments

load index 24016 with error too many values to unpack (expected 2)prepare index 809612 with error too many values to unpack (expected 2)load index 409555 with error too many values to unpack (expected 2) load index 711016 with error too many values to unpack (expected 2)prepare index 361880 with error too many values to unpack (expected 2)prepare index 333686 with error too many values to unpack (expected 2)load index 460071 with error too many values to unpack (expected 2) load index 7897 with error too many values to unpack (expected 2)

prepare index 665514 with error too many values to unpack (expected 2)load index 64251 with error too many values to unpack (expected 2)prepare index 622545 with error too many values to unpack (expected 2)

prepare index 470651 with error too many values to unpack (expected 2) load index 177569 with error too many values to unpack (expected 2)

prepare index 305736 with error too many values to unpack (expected 2) prepare index 750848 with error too many values to unpack (expected 2)

prepare index 272870 with error too many values to unpack (expected 2)

load index 484761 with error too many values to unpack (expected 2) prepare index 409556 with error too many values to unpack (expected 2)load index 333686 with error too many values to unpack (expected 2) load index 809612 with error too many values to unpack (expected 2)prepare index 24017 with error too many values to unpack (expected 2)prepare index 240571 with error too many values to unpack (expected 2)load index 665514 with error too many values to unpack (expected 2)prepare index 711017 with error too many values to unpack (expected 2)load index 361880 with error too many values to unpack (expected 2)prepare index 64252 with error too many values to unpack (expected 2)load index 470651 with error too many values to unpack (expected 2) prepare index 460072 with error too many values to unpack (expected 2)

load index 272870 with error too many values to unpack (expected 2)load index 305736 with error too many values to unpack (expected 2)

prepare index 7898 with error too many values to unpack (expected 2)

load index 750848 with error too many values to unpack (expected 2)prepare index 177570 with error too many values to unpack (expected 2)

manjaro-git avatar Jul 14 '22 13:07 manjaro-git

I have found that the type of data.lmdb provided by docs or generated by tools/data/textdet/synthtext_converter is not supported by the LmdbAnnFileBackend which used in the official configs. And according to the docs, mmocr provides a new file called reg2lmbd , but it doesn't support the label with .mat file type, as synthtext has.So how can i transfer the gt.mat of synthtext dataset to the right file???That is very confusing!

manjaro-git avatar Jul 15 '22 02:07 manjaro-git

75 line filename, text = line.strip('/n').split(' ') in LmdbAnnFileBackend is the cause of error.I think the format of data.lmdb doesn't match the code.And i wanna kown what the text is ? what is the format of text?

manjaro-git avatar Jul 15 '22 03:07 manjaro-git

Thanks for reporting the bug. It should have been fixed in #1159 and lmdb can be loaded with the provided config.

gaotongxiao avatar Jul 15 '22 12:07 gaotongxiao