PHO-LID icon indicating copy to clipboard operation
PHO-LID copied to clipboard

seq

Open whh07141 opened this issue 2 years ago • 1 comments

hi,I want know how to set the T'_i ,I have extract speech representation

whh07141 avatar Apr 08 '24 10:04 whh07141

Hi,

You can have a pre-processing before or during training (torch reshape), by reshaping the features from (T, feat_dim) into the shape like (T/20, 20, feat_dim) and discarding the left frames (e.g., (101, 20) => (5, 20, 20) )

Hope this can help.

Lhx94As avatar Apr 26 '24 01:04 Lhx94As