superhg

Results 8 issues of superhg

Hi, @yl4579 here is my training loss curve and eval loss curve, convert result is not good. The loss curve indicates overfitting? ![image](https://user-images.githubusercontent.com/6316449/178232871-6ee57f72-727c-445d-8aa8-5cdac1f22646.png) ![image](https://user-images.githubusercontent.com/6316449/178232940-3d120ed4-d4ec-4696-9241-aa6d7ab4efba.png)

the difference between any-to-many and any-to-any using multi-speakers is whether to use speaker-encoder?

hello,我对比了工程里的conformer实现代码,发现了一些与espnet对不上的,排除了一些版本的问题,发现有部分代码差别很大,如果直接用espnet最新的代码来训练conformer + ctc,有需要修改的地方吗?尤其是那个subsample这个地方

### run gemini example failed `when run gemini example demo, below error msg occurs: [W socket.cpp:601] [c10d] The client socket has failed to connect to [::ffff:10.19.49.102]:35027 (errno: 110 - Connection...

bug

finetune Belle数据集的时候遇到了一个问题: ` File "/tal-vePFS/LLM/hegang/workspace/ChatGLM-chinese-insturct/modeling_chatglm.py", line 836, in forward mask_position = seq.index(mask_token) ValueError: 150000 is not in list 1%| | 1448/203736 [48:57

这里数据集里面 self.num_samples = 1000 * self.ds_len 为什么乘1000? `class BlockDataset(data.Dataset): def __init__(self, ds, tokenizer, max_seq_len=1024, sample_across_doc=True, non_sentence_start=0.0, filter_english=False, **kwargs): """ sentence_start: the stripped article must start with a complete sentence """...

使用belle数据训练的时候,遇到这个错误,看了一下是训练文本文本中含有150000这个数字,出现了很多次。 ` File "/tal-vePFS/LLM/hegang/workspace/ChatGLM-chinese-insturct/modeling_chatglm.py", line 836, in forward mask_position = seq.index(mask_token) ValueError: 150000 is not in list `