Bardo-Konrad issues

Results 21 issues of


                                            Bardo-Konrad

Issues replicating the examples

My predict.py: ``` from utils.config_manager import ConfigManager from utils.audio import Audio from scipy.io.wavfile import write config_loader = ConfigManager('ljspeech_autoregressive_transformer\standard', model_kind='autoregressive') audio = Audio(config_loader.config) model = config_loader.load_model() was = 'President Trump met...

How to use it with your own dataset?

- are `fsID `and `salience `necessary and if so, what do they mean? - where to set the filename of the csv in the `metadata `subfolder? - I get `ValueError:...

Docker

Please offer a docker image

[Feature request] Appropriate intonation using xtts_v2 und voice cloning

**🚀 Feature Description** Appropriate intonation using xtts_v2 und voice cloning **Solution** There is a certain structure to intonation that gives a natural flow, the same with using pauses. So the...

wontfix

feature request

[Bug] The voice-cloned speaker continues with garbage after to-be-spoken text was finished or mid-sentence

### Describe the bug Sometimes the speech pauses then the speaker continues but it's neither written nor is it any language, but it's clearly the same speaker. Unless you want...

bug

wontfix

Ollama logging for ConnectionResetError

I access ollama using the python library. It communicates well but after some exchanges I always get the following. It seems that I need to reset ollama via python or...

Parsing PDF takes forever

I get endless output like this ``` Parsing nodes: 100%|██████████████████████████████████████████████████████████████████████████| 1/1 [00:00

How does this compare to coqui?

Doesn't work for longer recordings using press_to_toggle

I used it for a 10 minute recording and it kept on repeating many sentences and did not detect the rest. Fortunately it saved it as wav in temp, so...

What are best practises to use equirectangular image frames from a 360 video as input?

My approach Use `aliceVision_split360Images.exe --splitMode equirectangular --equirectangularNbSplits 8 --equirectangularSplitResolution 768 --fov 90.0` Result is random. Some datasets are found using `exhaustive matching`, many others are not. Using it in `meshroom`...