ppl.pmx
ppl.pmx copied to clipboard
add llama pipeline parallel
add llama pipeline parallel, do not need to do extra split model. It split tp and pp while loading model