petals icon indicating copy to clipboard operation
petals copied to clipboard

Question about overlapped serving blocks

Open jeremyzhangsq opened this issue 1 year ago • 0 comments

Consider a case that a pre-trained model is only hosted on three servers: the first one hosts blocks 1-4, the second hosts blocks 2-64, and the third hosts blocks 32-128.

I want to know if the overlapping of serving blocks will affect a client's fine-tuning or inference.

Thanks.

jeremyzhangsq avatar Aug 15 '24 06:08 jeremyzhangsq