FaceFormer icon indicating copy to clipboard operation
FaceFormer copied to clipboard

getting same hidden states value from Wav2Vec2 for my dataset

Open ujjawalcse opened this issue 2 years ago • 2 comments

Hey @EvelynFan , I tried to train the model on my custom datasets, but Wav2Vec2 is producing same hidden states value for all audio frames, Here is the reference,

torch.Size([1, 88800])
hidden_states: tensor([[[-0.0847,  0.0599, -0.0042,  ...,  0.1818,  0.0301, -0.0014],
         [-0.0847,  0.0599, -0.0042,  ...,  0.1818,  0.0301, -0.0014],
         [-0.0847,  0.0599, -0.0042,  ...,  0.1818,  0.0301, -0.0014],
         ...,
         [-0.0847,  0.0599, -0.0042,  ...,  0.1818,  0.0301, -0.0014],
         [-0.0847,  0.0599, -0.0042,  ...,  0.1818,  0.0301, -0.0014],
         [-0.0847,  0.0599, -0.0042,  ...,  0.1818,  0.0301, -0.0014]]],
       device='cuda:0')

Can you suggest some way out? Thanks.

ujjawalcse avatar Sep 29 '22 13:09 ujjawalcse

i have same question

xiaodongyichuan avatar Mar 08 '23 09:03 xiaodongyichuan

@xiaodongyichuan @ujjawalcse Did anyone fix this problem?

Shirley-0708 avatar Oct 23 '23 13:10 Shirley-0708