YuanxunLu comments

Results 26 comments of


                                            YuanxunLu

about face tracking

Of course the latter one. LSTM is a kind of RNN network, and it should take sequential data as input. 240 frames equal to 4 seconds under the 60 FPS...

about face tracking

First, LSTM takes sequential data as input and its output is also sequential, therefore T frames input results in T frames output. Please carefully check the definition of LSTM networks....

about face tracking

Any parametric monocular face reconstruction method would be an alternative, like FaceScape, DECA, 3DDFA_v2, etc.

Hello, I have two questions. Can you help me answer them （3D face tracking and headpose）

3D facial tracking is used to obtain 3D facial landmarks and head poses. Deca is an alternative option to get these parameters. In fact, any parametric 3D facial reconstruction method...

Hello, I have two questions. Can you help me answer them （3D face tracking and headpose）

id & scale parameters are results of 3D facial tracking. To get 3D shoulder points, we first detect 2D shoulder points using LK flow and reconstruct 3D shoulder points by...

Hello, I have two questions. Can you help me answer them （3D face tracking and headpose）

1. velocity head pose is just the delta speed, velocity_t = pose_t+1 - pose_t. It is an implicit supervision term. You can define it as you will. 2. The 3D...

Hello, I have two questions. Can you help me answer them （3D face tracking and headpose）

These two files are 3d facial tracking results (3D landmarks, head poses, etc.). Check the inference code and replace them with your tracking results works.

More details about the clip separation for training

There're no overlaps between the two clips. I just use one clip for training. Check issue #27.

how to transform 2d shoulder points to 3d shoulder points?

Check the code and the citing paper.

Streaming audio data inference

Smoothing on the results helps.