odas icon indicating copy to clipboard operation
odas copied to clipboard

Respeaker 4-mic array: LED is way more accurate than SSL

Open ruofeidu opened this issue 4 years ago • 1 comments

When observing the raw data from SSL and the live results from odas_web, I found that the LED indicator is way more accurate than then the visualization and the raw data. Sometimes I saw one or two sound source potentials keep remaining in the visualization and raw data, even if the source source is no longer speaking.

So my questions are:

  1. How can I replicate the accurate algorithm of the LED indicators on the 4-mic array board?
  2. How can I best process raw SSL data to get the dominant sound source? I found the energy is also larger than 0.998 even though sometimes the sound source stops speaking.

ruofeidu avatar Nov 10 '20 07:11 ruofeidu

Hi, I'm pretty sure the reason for the high accuracy of the LED indicators is because they do not track sound sources and rather dispaly instantaneous (every 8ms) DOA information. The tracking function in my experience is far less accurate. If you pull the json file (see image) which is found in the configuration file, this is the information that is fed into the LED indicators. So it is available but does not have any audio stream asociated with it. image

jake266637 avatar Jan 21 '21 12:01 jake266637