TextZoom icon indicating copy to clipboard operation
TextZoom copied to clipboard

Pretrained Models

Open avinabsaha opened this issue 4 years ago • 29 comments

@JasonBoy1 can you release pre-trained models?

avinabsaha avatar Sep 24 '20 04:09 avinabsaha

The models were deleted by me... Maybe I should train it recently.

WenjiaWang0312 avatar Sep 24 '20 17:09 WenjiaWang0312

@JasonBoy1 could you share models when they are available?

avinabsaha avatar Sep 26 '20 12:09 avinabsaha

@JasonBoy1 could you share models when they are available?

Sure.

WenjiaWang0312 avatar Sep 27 '20 08:09 WenjiaWang0312

Just wondering if you had chance to build the model? Thanks in advance!

anuda avatar Oct 14 '20 18:10 anuda

Just wondering if you had chance to build the model? Thanks in advance!

Sorry, too busy recently.

WenjiaWang0312 avatar Oct 16 '20 13:10 WenjiaWang0312

I uploaded the best model weights after 42000 epochs of training on TextZoom data: https://drive.google.com/file/d/1j-g17V5kBmqS8giWNZuHe2d_GefAepCs/view?usp=sharing

I hope, it is helpful.

yustiks avatar Jan 26 '21 13:01 yustiks

@yustiks thanks for sharing. Did you test the model? does it achieve similar results as in the paper ? @JasonBoy1 could you please share your pretrained model if you have them ? Thanks

dkaliroff avatar Jan 27 '21 16:01 dkaliroff

@yustiks please let us know about the performance of your trained model in comparison to the results mentioned in the paper.

avinabsaha avatar Jan 28 '21 18:01 avinabsaha

@dkaliroff @avinabsaha currently, I can't evaluate the model due to error mentioned here: https://github.com/JasonBoy1/TextZoom/issues/28

but you can try to evaluate by yourself.

Visually, the results are good.

yustiks avatar Jan 28 '21 21:01 yustiks

@dkaliroff @avinabsaha for example, aster test prediction is: {'accuracy': {'easy': 0.7332}, 'psnr_avg': 23.423454, 'ssim_avg': 0.866824, 'fps': 18.50144357823526} {'accuracy': {'medium': 0.562}, 'psnr_avg': 18.642635, 'ssim_avg': 0.659524, 'fps': 16.28411871321792} {'accuracy': {'hard': 0.3917}, 'psnr_avg': 19.63501, 'ssim_avg': 0.731286, 'fps': 17.888223484995095}

which is comparable with the paper results Real: 'easy': 75.1% 'medium': 56.3% 'hard': 40.1% 'average': 58.3%

yustiks avatar Jan 28 '21 22:01 yustiks

I uploaded the best model weights after 42000 epochs of training on TextZoom data: https://drive.google.com/file/d/1j-g17V5kBmqS8giWNZuHe2d_GefAepCs/view?usp=sharing

I hope, it is helpful.

Thank you! i will give it a try

anuda avatar Feb 04 '21 17:02 anuda

I have tried but I have CUDA issues. What environment did you use?

mpolvere96 avatar Feb 21 '21 13:02 mpolvere96

I have this error Traceback (most recent call last): File “main.py”, line 46, in main(config, args) File “main.py”, line 19, in main Mission.train() File “/opt/notebooks/TextZoom/src/interfaces/super_resolution.py”, line 33, in train model_dict = self.generator_init() File “/opt/notebooks/TextZoom/src/interfaces/base.py”, line 149, in generator_init model = model.to(self.device) File “/root/.conda/envs/textZoom/lib/python3.6/site-packages/torch/nn/modules/module.py”, line 381, in to return self._apply(convert) File “/root/.conda/envs/textZoom/lib/python3.6/site-packages/torch/nn/modules/module.py”, line 187, in _apply module._apply(fn) File “/root/.conda/envs/textZoom/lib/python3.6/site-packages/torch/nn/modules/module.py”, line 187, in _apply module._apply(fn) File “/root/.conda/envs/textZoom/lib/python3.6/site-packages/torch/nn/modules/module.py”, line 187, in _apply module._apply(fn) File “/root/.conda/envs/textZoom/lib/python3.6/site-packages/torch/nn/modules/rnn.py”, line 117, in _apply self.flatten_parameters() File “/root/.conda/envs/textZoom/lib/python3.6/site-packages/torch/nn/modules/rnn.py”, line 113, in flatten_parameters self.batch_first, bool(self.bidirectional)) RuntimeError: cuDNN error: CUDNN_STATUS_EXECUTION_FAILED

mpolvere96 avatar Feb 21 '21 13:02 mpolvere96

I have this error Traceback (most recent call last): File “main.py”, line 46, in main(config, args) File “main.py”, line 19, in main Mission.train() File “/opt/notebooks/TextZoom/src/interfaces/super_resolution.py”, line 33, in train model_dict = self.generator_init() File “/opt/notebooks/TextZoom/src/interfaces/base.py”, line 149, in generator_init model = model.to(self.device) File “/root/.conda/envs/textZoom/lib/python3.6/site-packages/torch/nn/modules/module.py”, line 381, in to return self._apply(convert) File “/root/.conda/envs/textZoom/lib/python3.6/site-packages/torch/nn/modules/module.py”, line 187, in _apply module._apply(fn) File “/root/.conda/envs/textZoom/lib/python3.6/site-packages/torch/nn/modules/module.py”, line 187, in _apply module._apply(fn) File “/root/.conda/envs/textZoom/lib/python3.6/site-packages/torch/nn/modules/module.py”, line 187, in _apply module._apply(fn) File “/root/.conda/envs/textZoom/lib/python3.6/site-packages/torch/nn/modules/rnn.py”, line 117, in _apply self.flatten_parameters() File “/root/.conda/envs/textZoom/lib/python3.6/site-packages/torch/nn/modules/rnn.py”, line 113, in flatten_parameters self.batch_first, bool(self.bidirectional)) RuntimeError: cuDNN error: CUDNN_STATUS_EXECUTION_FAILED

Seems, there is some problem with CUDA drivers.

yustiks avatar Feb 21 '21 14:02 yustiks

Thank you! Which version of cuda and torch did you use?

mpolvere96 avatar Feb 21 '21 15:02 mpolvere96

I uploaded the best model weights after 42000 epochs of training on TextZoom data: https://drive.google.com/file/d/1j-g17V5kBmqS8giWNZuHe2d_GefAepCs/view?usp=sharing

I hope, it is helpful.

Hi, bro. Thanks for your model, bur the url is not available now. Could you please provide a new address to get it.

JerryLeolfl avatar Sep 07 '21 10:09 JerryLeolfl

@yustiks please let us know about the performance of your trained model in comparison to the results mentioned in the paper.

@avinabsaha

Just wondering if you had chance to build the model? Thanks in advance!

Sorry, too busy recently.

@JasonBoy1 Thanks for your great work. Is the pretrained model available now?

JerryLeolfl avatar Sep 07 '21 10:09 JerryLeolfl

I uploaded the best model weights after 42000 epochs of training on TextZoom data: https://drive.google.com/file/d/1j-g17V5kBmqS8giWNZuHe2d_GefAepCs/view?usp=sharing I hope, it is helpful.

Hi, bro. Thanks for your model, bur the url is not available now. Could you please provide a new address to get it.

should work now

yustiks avatar Sep 07 '21 10:09 yustiks

I uploaded the best model weights after 42000 epochs of training on TextZoom data: https://drive.google.com/file/d/1j-g17V5kBmqS8giWNZuHe2d_GefAepCs/view?usp=sharing

I hope, it is helpful.

I try to test in textzoom but the result is all noise. image

xianglei96 avatar Mar 10 '22 02:03 xianglei96

I uploaded the best model weights after 42000 epochs of training on TextZoom data: https://drive.google.com/file/d/1j-g17V5kBmqS8giWNZuHe2d_GefAepCs/view?usp=sharing I hope, it is helpful.

I try to test in textzoom but the result is all noise. image

In this picture, what is the input, and what is the output?

yustiks avatar Mar 10 '22 02:03 yustiks

邮件已收到,我会尽快回复~

xianglei96 avatar Mar 10 '22 02:03 xianglei96

I uploaded the best model weights after 42000 epochs of training on TextZoom data: https://drive.google.com/file/d/1j-g17V5kBmqS8giWNZuHe2d_GefAepCs/view?usp=sharing I hope, it is helpful.

I try to test in textzoom but the result is all noise. image

In this picture, what is the input, and what is the output?

img

xianglei96 avatar Mar 10 '22 02:03 xianglei96

I uploaded the best model weights after 42000 epochs of training on TextZoom data: https://drive.google.com/file/d/1j-g17V5kBmqS8giWNZuHe2d_GefAepCs/view?usp=sharing I hope, it is helpful.

I try to test in textzoom but the result is all noise. image

In this picture, what is the input, and what is the output?

img

Interesting. How did you use weights to run the model?

yustiks avatar Mar 10 '22 04:03 yustiks

https://drive.google.com/file/d/1j-g17V5kBmqS8giWNZuHe2d_GefAepCs/view?usp=sharing

Use the model provided by you, the dataset and code provided by the author. image

xianglei96 avatar Mar 10 '22 05:03 xianglei96

I uploaded the best model weights after 42000 epochs of training on TextZoom data: https://drive.google.com/file/d/1j-g17V5kBmqS8giWNZuHe2d_GefAepCs/view?usp=sharing I hope, it is helpful.

Hi, bro. Thanks for your model, bur the url is not available now. Could you please provide a new address to get it.

should work now

I had a weird problem. When I try to import your model, I see a problem with the size of the modules:

size mismatch for module.block1.0.weight: copying a param with shape torch.Size([64, 4, 9, 9]) from checkpoint, the shape in current model is torch.Size([64, 3, 9, 9]). size mismatch for module.block8.1.weight: copying a param with shape torch.Size([4, 64, 9, 9]) from checkpoint, the shape in current model is torch.Size([3, 64, 9, 9]). size mismatch for module.block8.1.bias: copying a param with shape torch.Size([4]) from checkpoint, the shape in current model is torch.Size([3])

Have you ever encountered such a problem?

sfxjh avatar Apr 03 '22 16:04 sfxjh

邮件已收到,我会尽快回复~

xianglei96 avatar Apr 03 '22 16:04 xianglei96

I uploaded the best model weights after 42000 epochs of training on TextZoom data: https://drive.google.com/file/d/1j-g17V5kBmqS8giWNZuHe2d_GefAepCs/view?usp=sharing I hope, it is helpful.

Hi, bro. Thanks for your model, bur the url is not available now. Could you please provide a new address to get it.

should work now

I had a weird problem. When I try to import your model, I see a problem with the size of the modules:

size mismatch for module.block1.0.weight: copying a param with shape torch.Size([64, 4, 9, 9]) from checkpoint, the shape in current model is torch.Size([64, 3, 9, 9]). size mismatch for module.block8.1.weight: copying a param with shape torch.Size([4, 64, 9, 9]) from checkpoint, the shape in current model is torch.Size([3, 64, 9, 9]). size mismatch for module.block8.1.bias: copying a param with shape torch.Size([4]) from checkpoint, the shape in current model is torch.Size([3])

Have you ever encountered such a problem?

From what I can see, your input size is just one image, but instead, it should be the list of images. Try to create a new array, and add the image as an element to this array, after that, run a model.

yustiks avatar Apr 03 '22 16:04 yustiks

I uploaded the best model weights after 42000 epochs of training on TextZoom data: https://drive.google.com/file/d/1j-g17V5kBmqS8giWNZuHe2d_GefAepCs/view?usp=sharing

I hope, it is helpful. What is your learning rate? If I set batch size 16,what should the learning rate be set to?

Everythingismetaphor avatar Sep 23 '23 14:09 Everythingismetaphor

邮件已收到,我会尽快回复~

xianglei96 avatar Sep 23 '23 14:09 xianglei96