doctr
doctr copied to clipboard
[transforms] Extends the list of supported data augmentations
As discussed in #654, the artefact detection needs to improve its robustness. In order to do so and prevent overfitting, I would suggest gradually extending the list of our supported transformations:
- [x] uniform photometric transformations (noise, blur, etc.) #758 #654 #768 #771
- [x] non-uniform photometric transformations (shadows, etc.) #811
- [ ] geometric transformations (flips, rotations, crops, etc.) #773 #799
Ideally, a given transform should be implemented in doctr/transforms/modules
so that with the corresponding backend, we can do:
transfo = ....
pil_img = ...
augmented_img = transfo(pil_img)
for transformations that only change the image.
and:
transfo = ....
pil_img = ...
target = {...}
augmented_img, augmented_target = transfo(pil_img, target)
for transformations that alter the target. In doing so, that will work nicely with our Dataloaders :+1:
@frgfm only @SiddhantBahuguna PR left or do you have other augmentations in mind ? :)
For now, I think the random perspective would be the last one required yeah :+1:
Hi @SiddhantBahuguna any way in near future that you finish your draft PR ? :) Or should we keep it free to take ?
Hi @SiddhantBahuguna any way in near future that you finish your draft PR ? :) Or should we keep it free to take ?
Greetings @felixdittrich92 , I am really sorry for the unwanted delay. I will get it done this week positively :) Sorry again !
Sounds nice :hugs: :+1:
@SiddhantBahuguna Do you think you could finish your PR that we can close this ? :)