Dinghao Zhou issues

Results 41 issues of


                                            Dinghao Zhou

[LLM] support causallm model

拆分该pr：https://github.com/wenet-e2e/wenet/pull/2460

fix comment in fsdp_tp_example.py

cla signed

fix rdzv_id

rdzv_id may be not equal in multi node setup using RANDOM

cla signed

when to support gpu?

great work! But when to support/release gpu?

[transformer] add qk norm

多模态和部分大模型中采用qk norm 来稳定训练（apple dmel中也用到了）(有益于bestrq 训练和稳定梯度) TODO: - [ ] conformer result

[sensevoice] support sensevoice small arch

sensevoice small arch 和paraformer 的encoder 几乎一致， cmvn也是一样这里支持下sensevoice的fintune - [ ] convert script

[dataset] supoort shard by many jsonl files

- [ ] 需要验证下 ```python # usage1: json_files = ["1.jsonl", "2.jsonl", "3.jsonl"] dataset = WenetRawDatasetSource(json_files, partiaion=True, shard_by_files=True) # usage2: json_files = "all.jsonl" dataset = WenetRawDatasetSource(json_files, partiaion=True, shard_by_files=True) ```

[ssl] add nestrq

TODO - [ ] make it work - [ ] causal mask for conformer (need update encoder forward args) - [x] subsequent tokens id for loss

priority

Dinghao Zhou