train-from-scratch topic

List train-from-scratch repositories

OpenBA-v2

18
Stars
0
Forks
Watchers

OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-15B.