刘一为

Results 12 comments of 刘一为

What configuration commands or configurations are used to train p7-1280 and p7-1920?

``` export NO_TCMALLOC="True" ``` Set this up and it's working fine for now

I also encountered a similar problem, for the recognition of single-line text, there will be such a situation of incomplete recognition. Using yolov8x-p6

> 当前的尝试涉及重新平衡班级比例。最初,细长物体仅占所有矩形的不到 20%。目前正在调整以制作约 80% 的所有类别的细长物体:**第二个模型的结果要好得多。请参阅下面的结果。** > > ![图像](https://private-user-images.githubusercontent.com/166127086/322754218-dfc52610-cc8e-4550-8af4-dba051beef49.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTMyODQ1OTAsIm5iZiI6MTcxMzI4NDI5MCwicGF0aCI6Ii8xNjYxMjcwODYvMzIyNzU0MjE4LWRmYzUyNjEwLWNjOGUtNDU1MC04YWY0LWRiYTA1MWJlZWY0OS5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNDE2JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDQxNlQxNjE4MTBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT00YWMyODVlY2FmNDg2YTZmNmJjYzNkMDQzOTRlZjNmN2YwZWY5NDk0MjU1M2UwNWVmNGEyMWU4NjhhMGJlOGFlJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.lMeMbjak1zrp9NytyMkvg9cScoKxmfWvi1iljkgp3rc) ![图像](https://private-user-images.githubusercontent.com/166127086/322753499-64615a06-07f0-4a5d-9f5a-85d9095b338d.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTMyODQ1OTAsIm5iZiI6MTcxMzI4NDI5MCwicGF0aCI6Ii8xNjYxMjcwODYvMzIyNzUzNDk5LTY0NjE1YTA2LTA3ZjAtNGE1ZC05ZjVhLTg1ZDkwOTViMzM4ZC5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNDE2JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDQxNlQxNjE4MTBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0zMWU4ODA5ZDNhMDEzM2Y3ZDhmNTY3Njk5MmIxNzYzMmI5YmUxMjdlMmE2ZDE5NTJiODI4MmExODIwNWY4NWVjJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.8N9Jggl7EhvOqv8VKp-klNtUP6LSOqXDYzLNr_apJ_8) ![图像](https://private-user-images.githubusercontent.com/166127086/322754014-69d0f3ac-4040-42f1-8cad-6311fcdafd82.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTMyODQ1OTAsIm5iZiI6MTcxMzI4NDI5MCwicGF0aCI6Ii8xNjYxMjcwODYvMzIyNzU0MDE0LTY5ZDBmM2FjLTQwNDAtNDJmMS04Y2FkLTYzMTFmY2RhZmQ4Mi5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjQwNDE2JTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI0MDQxNlQxNjE4MTBaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT1kODU2NTBjMDZlMDE0ZDRiMTYwNzgyYzUyZmM5OTMwZTJlZDhlOTM4ODcyZGE1MDBhZTNjN2NmYTA0NDhiMmE3JlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCZhY3Rvcl9pZD0wJmtleV9pZD0wJnJlcG9faWQ9MCJ9.1CgeNWPcIdxOA0Ye6e3pt0dLk2Y8cOi0cEDhJF3_-Eo) I have also tried to train using only a long data set, but the final recognition results are similar to...

# testMNNFromOnnx.py的结果 Dir exist onnx/test.onnx inputVar.name:t shapes:[1, 17] tensor(int64) inputVar.name:language shapes:[1, 17] tensor(int64) inputVar.name:x shapes:[1, 17] tensor(int64) inputVar.name:sid shapes:[1] tensor(int64) inputVar.name:bert_0 shapes:[17, 1024] tensor(float) inputVar.name:bert_1 shapes:[17, 1024] tensor(float) inputVar.name:bert_2 shapes:[17,...

> 模型中有随机算子,结果是不能对齐的 这个有什么修改建议?

> 模型中有 random 算子,结果是不能对齐的 onnx模型推理出来的pcm是一段hello world语音,但mnn推理出来的并不能正常发音

``` bool save_audio_pcm(const std::vector &audio, const std::string &filePath) { std::ofstream file(filePath, std::ios::binary); if (!file.is_open()) { // 文件打开失败 return false; } for (float sample: audio) { if (sample > 1 ||...

> save_audio_pcm 的代码是? 或者完整可跑的工程可以发一下? 这里是否有后继的进展或还需要提供什么吗?

[code.zip](https://github.com/alibaba/MNN/files/15136977/code.zip) 这个是推理的代码,USE_MNN标记控制使用onnx还是mnn,调用入口方法 ``` auto mmm = sl_t2s::t2s_model_test(); mmm.init("onnx or mnn path"); mmm.exe_onnx("output pcm path"); ``` 最终输出结果pcm可使用python转成wav来播放 ``` import numpy as np import soundfile as sf pcm_data = np.fromfile('xxx.pcm', dtype=np.float32) sf.write('../out/xxx.wav',...