SFM icon indicating copy to clipboard operation
SFM copied to clipboard

Artifacts for rgb images

Open JackeyDeng opened this issue 3 years ago • 4 comments

as shown below, left image is a rgb img input, right image is the result of channel-wise frequency mask. There are many artifacts, but this is plausible because different channel have different frequency, so how to remove those artifacts or can we just train with this kind of results.

JackeyDeng avatar Jun 02 '21 09:06 JackeyDeng

2021-06-02 16-32-40 的屏幕截图

JackeyDeng avatar Jun 02 '21 09:06 JackeyDeng

I agree that this could be due to a chromatic aberration across spectral channels, we studied this in a previous project. But I can see the image is getting more blurry so you are effectively applying a low pass filter with the SFM mask, normally the network will be able to ignore the artifacts we got here.

majedelhelou avatar Jun 02 '21 11:06 majedelhelou

Well, But those colorful noise is due to data out of bounds, this will lead to output's DCT totally different from expected. No frequency will be masked. 2021-06-03 13-53-53 的屏幕截图 top left is input img, lower left is the masked dct of input, top right is the image recovered from masked dct, there are many data out of bounds, lower right is the dct of result img. This can result in totally different dct map, can those rgb images be used to train?

JackeyDeng avatar Jun 03 '21 07:06 JackeyDeng

It is important to keep in mind that generally a binary frequency masking is matched with a spatial convolution, but all kernels are not necessarily physically realisable, so not over [0,255].

However, do not confuse high frequency components and high frequency content. We tested different configurations with masking noise or not, and changing the type of masks etc. What matters most is erasing the image frequency content. Even if there remains noise frequency content/artifacts. And in your example you can clearly see visually that the image high frequency content was reduced (the image becomes blurry). Having said that, it can be an interesting research direction to explore non-binary SFM masks, having a smoothly decreasing mask instead of a binary one can enable you to avoid these artifacts over your [0,255] range. You can even explore applying SFM in the spatial domain to only have realistic kernels. We went for the binary option for theoretical reasons, and also to have one simple solution across methods.

majedelhelou avatar Jun 05 '21 09:06 majedelhelou