pheme
pheme copied to clipboard
Alternative to Soundstorm (S2A) model
This paper : https://arxiv.org/pdf/2401.01099.pdf , suggest better masking strategy with Grouped Acoustic Token like HiFi-Codec which results far better quality that Soundstorm.