llama.cpp
llama.cpp copied to clipboard
[Draft] Tensor Parallel support to llama.cpp
- [x] I have read the contributing guidelines
- Self-reported review complexity:
- [ ] Low
- [ * ] Medium
- [ ] High Add tensor parallel support to llama.cpp, still draft code now.