Text-Image-Augmentation
Text-Image-Augmentation copied to clipboard
About Joint Training
Thanks for your work, but I am wondering how to joint training, which is mentioned in the paper (Algorithm 1 Joint Learning Scheme).
I have two related questions as well:
- Is the "moving state" S the deltas between src_pts and dst_pts
(src_pts - dst_pts)
which contains2(N+1)
points? - Is the loss function for joint learning cross-entropy loss? If so, what is the representation of GT for this loss function? Any explanation/idea/suggestion/discussion is welcomed :)
@DongfeiJi This version is the default random augmentation. We guess it is sufficient for practical use.
@yanfjz
- The "moving state" denotes the moving directions.
- Yes, we use cross-entropy loss. The GT is the moving state that increases difficulty.
@Canjie-Luo
I read the paper Learn to Augment: Joint Data Augmentation and Network Optimization
for Text Recognition.
and I'm so interested in joint training (Algorithm 1 Joint Learning Scheme),can I ask for the whole code of paper?There is something I can't understand and I want to retrain it .I'll appreciate it very much .Here's my email [email protected]
@Canjie-Luo
I have some questions about the paper.
- Output size of the agent network is 2x(N+1)x2x2. Isn't just two coordinates enough to predict the direction of each point?
- In Algorithm 1, randomly select one point in S and switch to the opposite direction to make S prime. Does this mean choosing one moving state in a mini-batch?
I read the paper Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition. and I'm so interested in joint training (Algorithm 1 Joint Learning Scheme),can I ask for the whole code of paper? my email: [email protected]
@shubham303 Have you got the code?
I would be glad to get joint training code as well. my email: [email protected]
Thanks in advance!
@shubham303
I would really appreciate if you can share that code with me!
my email: [email protected]
Thanks :)
@matiascoronados I don't have the code.
Hey guys, due to the intellectual property protocol, I cannot release the code.