HTS-Audio-Transformer icon indicating copy to clipboard operation
HTS-Audio-Transformer copied to clipboard

Does this framework's output have been compared with other features?

Open MisakaMikoto96 opened this issue 2 years ago • 1 comments

Does this framework's output have been compared with other features like wav2vec, hubert?

MisakaMikoto96 avatar Apr 10 '23 08:04 MisakaMikoto96

Hi,

No really, because HTS-AT itself is our proposed audio transformer, in this paper, we just use it for audio classification and SED tasks. But we use this HTS-AT architecture in other tasks, such as contrastive language-audio pretraining, CLAP. We compare this audio representation with other TF-domain SoTA. I think wav2vec can be compared, even though we did not conduct such experimenes before.

RetroCirce avatar May 01 '23 23:05 RetroCirce