h310558606
h310558606
Could you tell me when will you release the track-conditional museGan? Thanks a lot!
When I run 'python main.py --config configs/sep_vqvae.yaml --eval' using you pretrained model, I can only see the lower half body. data:image/s3,"s3://crabby-images/32ee1/32ee1caf691fc55171cc522c5f89596d9a803a4c" alt="body" [
眨眼的问题
现在生成的人脸可以眨眼吗?这个实现的难度大吗?要怎样实现呢?
训练代码
请问训练代码什么时候可以开源呢?
基于FunASR/egs进行模型训练,得到一个适用于小语种ASR的paraformer模型,然后将此模型训练得到的的模型文件和config文件替换掉damo/speech_paraformer_asr_nat-zh-cn-16k-aishell1-vocab4234-pytorch 下的模型文件与config文件,接下来使用如下代码进行推理: from modelscope.pipelines import pipeline from modelscope.utils.constant import Tasks inference_pipeline = pipeline( task=Tasks.auto_speech_recognition, model='models_from_modelscope/damo1/speech_paraformer_asr_nat-zh-cn-16k-aishell1-vocab4234-pytorch', ) rec_result = inference_pipeline(audio_in='https://isv-data.oss-cn-hangzhou.aliyuncs.com/ics/MaaS/ASR/test_audio/asr_example_zh.wav') print(rec_result) **返回结果为空值!!!** data:image/s3,"s3://crabby-images/03f87/03f878da1cacbef1397fc3fcb9c575d3f0242882" alt="image" data:image/s3,"s3://crabby-images/f068c/f068cee45909645cc8ec8ac62bac09b2b44d502a" alt="05(1)" data:image/s3,"s3://crabby-images/8ddc1/8ddc1a55250dc66c73f01c78ea18d869f1ac7137" alt="03"
示例音频听起来效果很差,噪音很大,请问最近有改善吗?
Can the eyes blink in the result video?
Could you tell me when you plan to release the source code?
When I run recognize.py, I got the following error: WARNING:root:Using legacy_rel_pos and it will be deprecated in the future. WARNING:root:Using legacy_rel_selfattn and it will be deprecated in the future. /home/heyayun/anaconda3/envs/turkic/lib/python3.9/site-packages/torch/functional.py:641:...