3DDFA_V2 icon indicating copy to clipboard operation
3DDFA_V2 copied to clipboard

what makes such lightweight backbone works so well?

Open makpia opened this issue 4 years ago • 11 comments

compared to the previous version of your work, 3ddfa, 3ddfa_v2's structure is much simpler, but achieves better results. so i wonder if the meta-joint loss is the reason that enable mobilenet to outperform previous works. i would like to know your opinion on applying these methods(look ahead, combine different losses) to solving other tasks.

makpia avatar Oct 13 '20 05:10 makpia

If it's not for the data, it can even be done more finely, and this method is the best solution so far.

chengfeng0113 avatar Oct 13 '20 06:10 chengfeng0113

If it's not for the data, it can even be done more finely, and this method is the best solution so far.

so,how we can generate high quality data for better result?fix the landmarks error in 300w data(like eyes) or rebuild a 3dmm model to replace bfm model?

wjzlwz avatar Oct 21 '20 09:10 wjzlwz

If it's not for the data, it can even be done more finely, and this method is the best solution so far.

so,how we can generate high quality data for better result?fix the landmarks error in 300w data(like eyes) or rebuild a 3dmm model to replace bfm model?

You can improve from the following directions:

  1. Use better face shape models and expression models, such as BFM2019, but you need to find the keypoints of the face from many vertices. I use the BFM2009 model.
  2. Use better 2D keypoint models.
  3. Use a better 3DMM model, I use the 3DDFA model.
  4. Consider the face data balance of various Euler angles when making the face profile.
  5. To solve the problem of poor performance with eyes open and closed, you can consider using StyleGAN2 and InterFacegan to synthesize face data with eyes closed.

chengfeng0113 avatar Oct 26 '20 06:10 chengfeng0113

I used resnet50 instead of mobilenet_V1, but strangely, the results were worse. Why is such a simple model better?

lu17 avatar Dec 17 '20 02:12 lu17

@chengfeng0113 what is BFM model? I am newly attached to this field. And How should I do to systhesis short video like 3ddfa_v2 paper?

shoutOutYangJie avatar Dec 17 '20 13:12 shoutOutYangJie

@chengfeng0113 what is BFM model? I am newly attached to this field. And How should I do to systhesis short video like 3ddfa_v2 paper? Regarding the BFM model, you can refer to BFM. Oh ha, I also want to know how to synthesize short videos online, I can only use FaceProfiling to synthesize faces with continuous changes in Euler angles offline.

chengfeng0113 avatar Dec 18 '20 01:12 chengfeng0113

@chengfeng0113 so after face profiling, the image boundary appears "zero" pixel ? I notice that in 3ddfaV1, face profiling will make "zero" pixel at image boundary. But in 3ddfav2, as author shows, there is no "zero" pixel.

shoutOutYangJie avatar Dec 18 '20 02:12 shoutOutYangJie

@chengfeng0113 and do you know what is "tri"? I am not familiar with this field. thank you.

shoutOutYangJie avatar Dec 18 '20 02:12 shoutOutYangJie

do you know how to generate the 3D render picture with color? thanks. @chengfeng0113

ghost avatar Dec 31 '20 05:12 ghost

i have try resnet-22 for reconstruction, but it's worse than your gif which put on the github? especially the mouth always open. can you tell me why? if you did something different from you public data?

liyuanyaun avatar Sep 17 '21 01:09 liyuanyaun

i have try resnet-22 for reconstruction, but it's worse than your gif which put on the github? especially the mouth always open. can you tell me why? if you did something different from you public data?

@cleardusk

liyuanyaun avatar Sep 26 '21 07:09 liyuanyaun