asusdisciple

Results 20 issues of asusdisciple

I found that the usage of your splitter model gives very inconsistent results. Take for example the amharic language (lang_code="am"). If I take for example these two sentences from the...

### Tested versions Appears in 3.1.0 ### System information Ubuntu 22, Lenovo P1 Gen 5 Workstation A4500 ### Issue description So I stumbled upon something, which I found very strange....

cannot_reproduce

### 🐛 Describe the bug Pycharm Professional 2023.3 is unable to locate the reference to basic functions like torchaudo.save() or torchaudio.load(). I do not not why this happens, maybe the...

This cant be build at all. Package like torchaudio==0.5.1 can not be found anymore. Cython errors. Any way this will be corrected? I cant get to run DiffVC in this...

It would be nice if you could `extract_embedding `with a `batch_size` parameter. Also accepting Numpy arrays or torch tensor instead of only file like types would be nice.

enhancement

I tested the indic nlp package to split sentences on the hindi file in the flores 200 dataset. However the performance is really bad with an F1 score of 0.26....

At the moment you cant use tibetian language tokenizer. It gives the error message: `TypeError: "module" object is not callable` The error is thrown here in sentence_split.py: ``` elif split_algo...

I get this error: ``` line 1323, in _create_empty_array return np.empty(shape, dtype, order='C') ValueError: array is too big; `arr.size * arr.dtype.itemsize` is larger than the maximum possible size. ``` All...

I know the author did not want to share the training procedure due to ethical reasons. But looking at the development (for example VITS) isnt that policy a bit outdated?...

### 🐛 Describe the bug I found a very strange "bug" while I was trying to find similiar instances in a vector database of pictures. The model I used is...