Yueyue Na comments

Results 9 comments of


Yueyue Na

solve gev problem and the scaling ambiguity

Hello, Thanks for watching! My provided code on github is only supported for 2 channel BSS, the demixing matrix can be acquired analytically by solving the 2 x 2 generalized...

Could you help to explain your heig2.m computation please?

Hi, thanks for watching. Heig2.m is used to solve the 2 x 2 generalized eigendecomposition problem for 2 x 2 Hermitian matrices. It only supports 2 x 2 Hermitian matrices,...

Could you help to explain your heig2.m computation please?

Hi, original IVA cannot solve the channel order ambiguity. To solve this problem, some extra information should be involved. E.g. a, keyword spotting (KWS) system can be concatenated to the...

改变demo script中的serdb值，AuxP的结果有较大差异

你好：感谢关注！问题①：在当前的demo中，不同serdb的AuxP的结果有一些差异，请问与什么因素有关？如何改进？那个程序只是一个demo，是在时域上的操作。实际应用中需要在频域上做AEC，即使用子带分解将数据变换到频域，之后在各个频段上分别做AEC，最后在将数据变换回时域。可以尝试在频域上试一下。按照我处理过的数据的经验，性能应该是信回比越高越好。问题②：用Aux-ICA模型做AEC，在阿里系的产品中有实际的工程应用吗？抱歉涉及产品的问题我不便回答。在 2021-07-22 17:21:58，"huankiki" ***@***.***> 写道： Hi, Yueyue Na， demo script中的serdb的原始设定值为-20，AuxP的结果很好。现象①：改变serdb的值，如-15dB/-10dB等，在保证nearend和mic信号不截幅的前提下，AuxP的结果并不好，尤其是从10s处的echo path change开始。现象②：serdb=-15，改变gamma的值，gamma=0.5（原始值为0.2），在保证nearend和mic信号不截幅的前提下，AuxP的结果变好，但当serdb=-10，gamma=0.5，AuxP的结果仍然很差。问题①：在当前的demo中，不同serdb的AuxP的结果有一些差异，请问与什么因素有关？如何改进？问题②：用Aux-ICA模型做AEC，在阿里系的产品中有实际的工程应用吗？期待你的解惑，谢谢！ — You are receiving this because you are...

改变demo script中的serdb值，AuxP的结果有较大差异

你好，感谢关注。应该没有错，是-20dB。如果SER20dB相当于回声很小了，基本上不做AEC也能听清目标语音内容。在 2024-04-25 15:42:02，"王贤锐（Henry）" ***@***.***> 写道： SER是不是写错了实验中是20dB — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you commented.Message ID: ***@***.***>

实录音频的效果

你好，感谢关注！ github上那个代码是用作理论说明的，做的是时域算法，离工程实际差距较远。我们工程上用的也是同样的理论，但是是频域算法，相当于每个频段都做相似的分离操作。抱歉工程代码目前还不能开源。低频部分效果较差的话也可以试试先做时频变换，再在每个频段分别做AEC。在 2021-02-25 11:05:45，"kbsuper6" 写道：纳老师您好，我用了实录的音频跑您这份代码，貌似效果不太稳定。我试过用频域的auxIVA去做回声消除，低频部分比较差。您那边有更接近工程实际的解决方法吗？谢谢。 — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or unsubscribe.

实录音频的效果

噢，刚才没注意您已经用了频域的方法了。可以试下增长滤波器长度，因为低频回声路径较长，可能要的滤波器阶数较高。另外由于时频变换中的循环卷积误差，AEC自适应部分也不能把回声消得特别干净，可以再尝试下在自适应部分后面加上后滤波的方法。在 2021-02-25 11:05:45，"kbsuper6" 写道：纳老师您好，我用了实录的音频跑您这份代码，貌似效果不太稳定。我试过用频域的auxIVA去做回声消除，低频部分比较差。您那边有更接近工程实际的解决方法吗？谢谢。 — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or unsubscribe.

实录音频的效果

1. 实录音频，有一定的震动，说明已经产生了非线性回声，即设备震动产生了额外的回声成分，这部分成分在参考里没有，所以无法通过线性AEC消除。可以考虑后滤波的方法，或者后面加IVA，将非线性回声也当作一种噪声源处理。 2. 我只试过在交互场景中使用IVA，交互场景可以通过唤醒词来区分目标声源。通话场景没接触过，猜测可能可以通过信噪比来选择，或者引入其它监督信息，例如假定位于设备正前方的为目标声源。可以参考这个文章Nesta, Francesco, and Zbyněk Koldovský. "Supervised independent vector analysis through pilot dependent components." 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2017. 3....

实录音频的效果

不好说，非线性的情况比较复杂，需要具体音频具体分析了。在 2021-12-03 16:24:08，"zuowanbushiwo" ***@***.***> 写道：我的实测场景效果也非常的差，我使用subband 方法，用的是Cosine 调制的滤波器组。是不是如果有非线性失真，会放大失真，造成更坏的结果。 — You are receiving this because you commented. Reply to this email directly, view it on GitHub, or unsubscribe. Triage...