pix2pix icon indicating copy to clipboard operation
pix2pix copied to clipboard

OT: Adding a Spatial Transformer Layer(s)?

Open Quasimondo opened this issue 8 years ago • 1 comments

Sorry if this is a bit off-topic but this seems to be the best place to ask:

Do you think that adding one or more spatial transformer layers to the pix2pix architecture could increase the spatial range of the model and reduce "mosaic" artifacts? Right now it seems that there is a certain limit on how far the network can "see" around each pixel - so if the structures in a pair are very different it will not learn something meaningful. If I understand STLs correctly they allow to add moving, scaling and rotation of the input so I would imagine that this could help extending the range or finding the related structures in the paired image.

Quasimondo avatar Jan 25 '17 09:01 Quasimondo

Interesting idea. If I understood you correctly, you mean adding a STL on top of the images before feeding them to the discriminator, right? This essentially introduces (learned) data augmentation to the training, which could potentially improve the results. Please let us know your findings if you decide to give it a try.

tinghuiz avatar Jan 25 '17 22:01 tinghuiz