This PR aims to add pytorch demo equivalent to the tensorflow one. See https://github.com/mindee/doctr/issues/658.

Question

Should we need also to update something for the live demo ? https://github.com/mindee/doctr#live-demo

Aug 03 '22 16:08 odulcy-mindee

Codecov Report

Merging #1008 (b93328e) into main (7c73cdf) will decrease coverage by 0.01%. The diff coverage is n/a.

@@            Coverage Diff             @@
##             main    #1008      +/-   ##
==========================================
- Coverage   94.93%   94.91%   -0.02%     
==========================================
  Files         135      135              
  Lines        5590     5590              
==========================================
- Hits         5307     5306       -1     
- Misses        283      284       +1

Flag	Coverage Δ
unittests	`94.91% <ø> (-0.02%)`	:arrow_down:

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
doctr/transforms/functional/base.py	`95.65% <0.00%> (-1.45%)`	:arrow_down:

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

Aug 03 '22 17:08 codecov[bot]

Hello everyone :wave:

I agree that having two different apps isn't necessarily the best option. Apart from training scripts (and actually we could also do it for this, but considering the user group, it's not a big deal), everything in docTR is handled with env variables to select the backend.

I would argue it's one of the core strengths of the project: the user doesn't see the difference but under the hood, a lot is done differently. I thought about it a month ago and I figured that the model loading and preprocessing+inference+postprocessing should be conditional on the backend. My suggestion is either:

keep a single script, with conditional statements where it's relevant to perform the operations
move the DL backend dependent ops to another module, and call this from the app.py

The second might sound better, but having a single file for a Streamlit/Gradio app is good advantage. However, this is only my opiniong :man_shrugging: (good news is that we can reuse everything you've done here @odulcy-mindee if we go for any of my suggestions)

Aug 06 '22 09:08 frgfm

@odulcy-mindee @frgfm I think the main problem to solve is to find a good way to handle the different framework specific dependencies (if we dont want to wait until onnx part is done) than we could go in a similar way i have had in mind for #663

Aug 06 '22 12:08 felixdittrich92

@frgfm @felixdittrich92 thanks for your review ! Also thanks for the closed PR linked @felixdittrich92, I haven't seen it before.

I agree with you and the philosophy of the repo: it would be better to put everything in one script named app.py. I'll do a small code refactoring of this demo app.

I'll try to add a drop down menu to select the backend but I'm not sure how it'll behave with Streamlit (increase in RAM for instance). If it does not work, the user will have to set the env variables through the command line as it has been suggested.

Aug 08 '22 09:08 odulcy-mindee

I tried to add a drop down menu to select Tensorflow or PyTorch but it turns out that it was a bit ugly to reimport functions from the selected backend at runtime.

Without any specification, it will use TensorFlow backend. You can also select backend using environment variable:

USE_TF=1 streamlit run demo/app.py

Or

USE_TORCH=1 streamlit run demo/app.py

I also added a quick mention of which backend is in used in sidebar:

Aug 08 '22 15:08 odulcy-mindee

Hi everyone :wave:

I agree that perhaps we should wait a bit to switch the demo to ONNX. I've done it for other projects and for inference it makes way more sense and deletes a LOT of dependencies, making the minimal environment considerably lighter!

Otherwise, going for the double backend:

on dev side, changing the env variable makes sense
on HF spaces, that won't be possible though.

So either we:

go for separate apps (that complicates things for maintenance though, but it can be temporary)
or we try to plan for a full ONNX + no PT/TF env (that means docTR itself won't be loaded)

Since the demo is a good way to quickly try out a model, I'd say that if we cannot afford to do ONNX right away, we need a way to use PyTorch. So should we go for env vars that select the backend (my favorite option), or separate apps?

Aug 11 '22 11:08 frgfm

My favorite way would definitely be ONNX. But I think that's still a bit too far away at the moment. I'm not sure if changing the environment variable during runtime is really that problematic (HF Space), maybe we should try it!? (If it should work than i think it would be the best option for the moment)

Aug 11 '22 12:08 felixdittrich92

Hi @odulcy-mindee :wave: any updates ? :) Otherwise lets convert to Draft

Aug 24 '22 07:08 felixdittrich92

Hello @felixdittrich92 @frgfm,

I think we can keep this implementation as it is: it has the benefit to show how we should do an inference with TensorFlow and PyTorch thanks to load_predictor and forward_image functions.

For HuggingFace, we'll not have the ability to change backend but I think it's not the purpose of it: it's more a toy to play with and to see a concrete application of our app demo.

I'll rebase this branch on main to check that everything is up-to-date.

Let's do another PR for ONNX when it's ready !

Aug 24 '22 14:08 odulcy-mindee

Hi @odulcy-mindee :wave:,

So I think that the deployment as an HF space has a much greater benefit / range than being able to test the demo locally. But yes for the moment I would agree until we are ONNX ready (should be possible in 0.6.0). @frgfm wdyt ?

@odulcy-mindee Can you solve the conflicts please :)

Aug 26 '22 11:08 felixdittrich92

@felixdittrich92 I rebased on main, it should good. Concerning HuggingFace Space, I don't know how to update the demo :confused: We can swap the script on HF with the TF demo so we'll keep the same behavior as before.

Aug 26 '22 12:08 odulcy-mindee

@felixdittrich92 I rebased on main, it should good. Concerning HuggingFace Space, I don't know how to update the demo confused We can swap the script on HF with the TF demo so we'll keep the same behavior as before.

I cannot help without access so you should ask: @charlesmindee or @mehdimindee (https://huggingface.co/mindee) :+1:

Aug 26 '22 12:08 felixdittrich92

Add pytorch demo

Question

Codecov Report