matiaslindgren
matiaslindgren
Thanks! I think this repo works best as a self-contained, small example. Instead of gathering many different features here, it might be clearer to focus only on one thing. However,...
It would be nice to have some pre-trained models trained on different datasets at some point but I don't think it's going to happen any time soon. If your 50-language...
Does [this page](https://www.50languages.com/language-mp3.php) contain all the samples? If so, I'm unsure if the amount of data is enough. Deep-learning based language identification models usually need at least 5 hours (preferably...
Thanks for preparing the zip files. I downloaded the tar-file and listened to some of the Finnish and Swedish samples. Unfortunately, there are too few speakers (only 1 or 2)...
I tried adapting the suggestion from https://github.com/pandas-dev/pandas/issues/57052#issuecomment-1911986604 (`pandas._libs.parsers.raise_parser_error`) but there are quite a few failing tests. `SystemError: ... returned a result with an exception set` is the catch-all exception raised...
I implemented a new layer called `pymap_checked` in `pandas/_libs/khash` for the `PyObject` hash table. It will catch every exception thrown during khash computation for `PyObject`s. The next problem is fixing...