openvino
openvino copied to clipboard
TP flow in CPU plugin
Details:
- two sub streams run in parallel when modelDistributionPolicy=TENSOR_PARALLEL in latency mode
Tickets:
- ticket-id