keras-io
keras-io copied to clipboard
examples/vision/image_enhancer.py
Thanks for adding the text copy! It looks like the text copy is going to need further work and proofreading. Is there someone else you'd be able to ask for help with it?
Thanks for adding the text copy! It looks like the text copy is going to need further work and proofreading. Is there someone else you'd be able to ask for help with it?
@fchollet If you can suggest an expert I can associate to make it better !
@fchollet The objective that holds this as an example is improvements in OCR engines accuracy. When direct image is passed through OCR(tesseract), it tends to read poorly, while passing through our model and then through OCR engine it leads to restore corrupt characters at significant level. So, if I can add that metrics like (character error rate/word error rate)it would be more better example. your thoughts?
@anish9 would you mind sharing a colab file of your contribution? It'd be useful.
@anish9 would you mind sharing a colab file of your contribution? It'd be useful.
@innat https://colab.research.google.com/drive/1KOEc1Zkeo9EUfeOILcBb-so9FV4v6LB3?usp=sharing
@anish9 would you mind sharing a colab file of your contribution? It'd be useful.
@innat colab.research.google.com/drive/1KOEc1Zkeo9EUfeOILcBb-so9FV4v6LB3?usp=sharing
@anish9 I couldn't successfully run your code on colab (GPU). It crashes in the middle of the first epoch. Can you please re-check the file?
@anish9 would you mind sharing a colab file of your contribution? It'd be useful.
@innat colab.research.google.com/drive/1KOEc1Zkeo9EUfeOILcBb-so9FV4v6LB3?usp=sharing
@anish9 I couldn't successfully run your code on colab (GPU). It crashes in the middle of the first epoch. Can you please re-check the file?
@innat, There was an issue with cache
in TF data pipeline which caused the RAM crash in colab, please remove the cache
check again : https://colab.research.google.com/drive/1KOEc1Zkeo9EUfeOILcBb-so9FV4v6LB3?usp=sharing
@innat please review the updated changes
@anish9 I tried to run the colab on GPU. But for it takes too much time to complete per epoch (~500s). Is it expected?
@anish9 I tried to run the colab on GPU. But for it takes too much time to complete per epoch (~500s). Is it expected?
@innat it varies based on GPU hardware, yeah in colab it's taking long ...
@anish9 Thanks for making this nice tutorial on such a challenging problem. I can relate. It would be a nice starter for the interested practitioners in the OCR domain. And thanks for giving me the opportunity to share some of our opinions regarding your submission. I believe it becomes much better in a sense by following most of the contribution guidelines. 👍
@fchollet could you please take look into this submission now?
@anish9 Thanks for making this nice tutorial on such a challenging problem. I can relate. It would be a nice starter for the interested practitioners in the OCR domain. And thanks for giving me the opportunity to share some of our opinions regarding your submission. I believe it becomes much better in a sense by following most of the contribution guidelines. +1
@innat, Thanks for spending your time on it, your feedbacks were helpful!
@fchollet, please review this example
@anish9 there is a batch of outstanding comments from the last round of reviews that still need to be addressed, hence why I haven't reviewed yet. Please take a look. In addition, the text copy would still need significant improvements -- can you find someone who could help you on that front?
@fchollet I would need someones help since i am relatively new to this,
@anish9 Let me know if you need any assistance here.
@anish9 Let me know if you need any assistance here.
@innat, yes it would be nice if you can help me in the docs.
@anish9 (cc. @fchollet )
I think you should not depend on pytesseract
and fastwer
to compute the word error rate. Instead, you should implement a similar thing in tensorflow. Some pointer. Otherwise, it might be a bad UX to restart the kernel on colab for these packages' installation and non-agnostic path dependencies of pytesseract in all OS. Please consider fixing this. I will send a colab file with the required modification then.
@innat I ain't sure about an alternative to tesseract, especially considering their performance for document OCR, the novel objective was not about performance but driving exposure on how the model is useful for users in one of the real-world use-case and definitely this is not the only use case.
You're right. But if I remember correctly, it's required to restart the kernel after installing these packages to load properly, which I think is not pleasant (bad UX) and may not follow the design guideline.
Also, I was not suggesting finding the equivalent of tesseract
in tensorflow/keras but the relevant functionality. You're working on this, so if you think some component (metric/loss/layer) is missing regarding OCR and very impactful in practice, I would highly encourage you to raise a ticket on GitHub for that (for example. in keras-cv
).
What do you think?
@innat yes, will be valuable for the community, we can raise a ticket!
@anish9 Thanks again for this PR. It looks like it's been more than a year since we've heard from you. Are you still working on this? Please let us know. Otherwise we'll close the request. Thanks!