Leyang Xue issues

Results 11 issues of


                                            Leyang Xue

Paxos vs Raft: Have we reached consensus on distributed consensus?

Howard, Heidi, and Richard Mortier. "Paxos vs Raft: Have we reached consensus on distributed consensus?." Proceedings of the 7th Workshop on Principles and Practice of Consistency for Distributed Data. 2020

area/distributed-systems

TODO-未读

type/paper

Dev

- release experts parallel version - correct README - support arctic and grok - remove installation dependency - remove circular dependency issue

Arctic Support

TODO for first release

- [x] API design - [x] Document for installation and PyPI - [x] performance table - [x] Support Mixtral multi-GPU - [ ] Load trace

Support Constrained Server Memory

Colab server T4 has 12GB DRAM, 16GB GPU, quantized mixtral has 26GB in size with single checkpoint, cannot bot be loaded into memory on creating the custom format for offloading

enhancement

Fix: Add formatting for commits

Format on PR to main branch - use [pre-commit hooks](https://github.com/pre-commit/pre-commit) - use DeepSpeed github [workflow](https://github.com/microsoft/DeepSpeed/blob/master/.github/workflows/formatting.yml) Developer need to run `pre-commit run --all-files` before PR

Leyang Xue

Paxos vs Raft: Have we reached consensus on distributed consensus?

Dev

Arctic Support

TODO for first release

Support Constrained Server Memory

Fix: Add formatting for commits

feat: performance improvement and Qwen3 support

[Feature Request] Add shared pinned memory pool for offloading enabled frameworks

[Feature Request] Improve cold start latency with ServerlessLLM sllm_store

feat: Merge kernels from vLLM and FlashInfer