UMOE-Scaling-Unified-Multimodal-LLMs
UMOE-Scaling-Unified-Multimodal-LLMs copied to clipboard
Audio Understanding for Uni-MoE v2
I found that Uni-MoE v2 is not trained on audio understanding tasks and not utilizing the BEATs audio encoder.
Is Uni-MoE v2 not designed for understanding general audio events, like natural sounds?