joan126 comments

Results 25 comments of


joan126

Assertion failed: axesInput.allValuesKnown() && "Axes input for unsqueeze operation should be a constant tensor."

> `tf.expand_dims(tensor, axis=1)` suggests that the unsqueeze axis should be a constant `1`. > > What TF2ONNX version are you using? Are you able to share the model? tf.expand_dims(tensor, axis=1)...

Speaker adaptation - Fine tuning

> Hey @Rayhane-mamah and @begeekmyfriend! Have you tried to fine tune pretrained model on different voice? How much data did you use for it? How much steps did you train...

是否能用百条语料改变说话人的声音

> https://www.guiji.ai/site/article?id=251 这文章中提到freeze decoder的参数做说话人自适应，应该是freeze encoder吧...encoder只与文本向量有关，与说话人无关的。

Guided Attention Loss

guided attention loss use T,N should represent max_text_length and max_mel_frames of overall dataset or of one batch?

Guided Attention Loss

> No it is the average value of all samples. You have to estimate it ahead. In espnet [espnet](https://github.com/espnet/espnet/blob/c370ab2daeabe1d4ceb1747a795b491d3a412936/espnet/nets/pytorch_backend/e2e_tts_tacotron2.py#L25) and moliza/TTS repo, N,T represent max_text_length and max_mel_frame of a batch

which am is used in kaldi ?

> Neither, it just uses the GMM-HMM pipeline for training through LDA+SAT. The experiments that we did with nnet2 several years had very slight benefits if at all over the...

[BUG]: timed out when using 64 GPUs.

met same issue when run examples/language/gpt/gemini/run_gemini.sh , have you solved this ? @bestbzw

[BUG]: colossalai多机多卡训练，机器间通讯不了，或服务

怎么配置的？

[BUG]: colossalai多机多卡训练，机器间通讯不了，或服务

> 不报这个错了，，但是还是无法运行 socket 问题怎么解决的？我也遇到了

[BUG]: colossalai多机多卡训练，机器间通讯不了，或服务

> 我也遇到了之前有人说配置/etc/hosts 配置后，起作用了吗？我配置了也不行，不管是用torchrun还是colossalai run都不行