h310558606

Results 12 issues of h310558606

Could you tell me when will you release the track-conditional museGan? Thanks a lot!

question

When I run 'python main.py --config configs/sep_vqvae.yaml --eval' using you pretrained model, I can only see the lower half body. ![body](https://user-images.githubusercontent.com/38098690/174282240-059defa0-9e04-4029-9057-7d30b30ea3d5.png) [

现在生成的人脸可以眨眼吗?这个实现的难度大吗?要怎样实现呢?

请问训练代码什么时候可以开源呢?

基于FunASR/egs进行模型训练,得到一个适用于小语种ASR的paraformer模型,然后将此模型训练得到的的模型文件和config文件替换掉damo/speech_paraformer_asr_nat-zh-cn-16k-aishell1-vocab4234-pytorch 下的模型文件与config文件,接下来使用如下代码进行推理: from modelscope.pipelines import pipeline from modelscope.utils.constant import Tasks inference_pipeline = pipeline( task=Tasks.auto_speech_recognition, model='models_from_modelscope/damo1/speech_paraformer_asr_nat-zh-cn-16k-aishell1-vocab4234-pytorch', ) rec_result = inference_pipeline(audio_in='https://isv-data.oss-cn-hangzhou.aliyuncs.com/ics/MaaS/ASR/test_audio/asr_example_zh.wav') print(rec_result) **返回结果为空值!!!** ![image](https://github.com/alibaba-damo-academy/FunASR/assets/38098690/4d40d17f-c2bf-4289-9051-f3db8c8e6310) ![05(1)](https://github.com/alibaba-damo-academy/FunASR/assets/38098690/17ed9ad2-df1c-4313-a470-be5e15f31251) ![03](https://github.com/alibaba-damo-academy/FunASR/assets/38098690/730396ed-46e6-40c4-8149-856de9464955)

示例音频听起来效果很差,噪音很大,请问最近有改善吗?

Can the eyes blink in the result video?

Could you tell me when you plan to release the source code?

When I run recognize.py, I got the following error: WARNING:root:Using legacy_rel_pos and it will be deprecated in the future. WARNING:root:Using legacy_rel_selfattn and it will be deprecated in the future. /home/heyayun/anaconda3/envs/turkic/lib/python3.9/site-packages/torch/functional.py:641:...