supervoice-voicebox
supervoice-voicebox copied to clipboard
Drop the unmasked tokens
Nice work! May i ask the 0.9 probability of dropping unmasked tokens to condition on audio only is important? Could you share the detail of AB study?