asusdisciple issues

Results 23 issues of


                                            asusdisciple

Inconsistent results with same sentences

I found that the usage of your splitter model gives very inconsistent results. Take for example the amharic language (lang_code="am"). If I take for example these two sentences from the...

Performance Bug when calculating speaker embeddings

### Tested versions Appears in 3.1.0 ### System information Ubuntu 22, Lenovo P1 Gen 5 Workstation A4500 ### Issue description So I stumbled upon something, which I found very strange....

cannot_reproduce

Torchaudio.load() reference not found in Pycharm

### 🐛 Describe the bug Pycharm Professional 2023.3 is unable to locate the reference to basic functions like torchaudo.save() or torchaudio.load(). I do not not why this happens, maybe the...

Not possible to build

This cant be build at all. Package like torchaudio==0.5.1 can not be found anymore. Cython errors. Any way this will be corrected? I cant get to run DiffVC in this...

Feature Request: Use extract_embedding with a batch_size parameter

It would be nice if you could `extract_embedding `with a `batch_size` parameter. Also accepting Numpy arrays or torch tensor instead of only file like types would be nice.

enhancement

Bad sentence splitting performance on flores 200 hindi language

I tested the indic nlp package to split sentences on the hindi file in the flores 200 dataset. However the performance is really bad with an F1 score of 0.26....

Bug in tokenizer for Tibetian Language

At the moment you cant use tibetian language tokenizer. It gives the error message: `TypeError: "module" object is not callable` The error is thrown here in sentence_split.py: ``` elif split_algo...

Value Error: array is too big when loading .flac file

I get this error: ``` line 1323, in _create_empty_array return np.empty(shape, dtype, order='C') ValueError: array is too big; `arr.size * arr.dtype.itemsize` is larger than the maximum possible size. ``` All...

Due to recent development, maybe release training components/scripts?

I know the author did not want to share the training procedure due to ethical reasons. But looking at the development (for example VITS) isnt that policy a bit outdated?...

Regarding IMAGENET1K_V1 and IMAGENET1K_V2 weights

### 🐛 Describe the bug I found a very strange "bug" while I was trying to find similiar instances in a vector database of pictures. The model I used is...