Sungkyun Chang
Sungkyun Chang
@Mihonarium I am **not** already working on it. I have thought your [notebook](https://gist.github.com/Mihonarium/e3fd355cb560b82373fd2186139f1bc2) was awesome. I will appreciate your help:) If you PR a draft of interface for a new...
@mhmd-mst Hi, I'll answer briefly. 1. It would be fine, perhaps the warning was from the different batch size for inference . 2. 3 audio files as DB? It is...
1. No, batch-size is just about the in/output shape of our NN fingerprinter. The batch size can be set depending on your machine type (GPU/CPU) and its memory. It is...
@mhmd-mst 3 audio files, and 3 fps per each file. Your fps.shape = (9, 128). So, fps[0:3] is your first file. ```python test_ids = [0, 3, 6] # start_id of...
It depends on the dimension of fingerprint and how to define the `same` segment. In the training process, all segments except themselves are expressed differently, and the larger the dimension,...
@Rodrigo29Almeida Try setting SAVE_IMG to False in the config file. But I don't quite understand it, because the training should return the `sim_mtx` when calculating the loss at the first...
@Rodrigo29Almeida As for that first error (though I'm not entirely sure until I try it myself), it could come from https://github.com/mimbres/neural-audio-fp/blob/main/model/fp/specaug_chain/layers/ncutout_tarray.py While dealing with binary masks in graph-mode, we should...
This is definitely a bug. Suddenly, starting today, the prefix is being ignored, and all graphs are being grouped in the Chart section!