research

Results 9 issues of research

**Description** Low QPS with momentary traffic surges cause significant increases in inference TP99 latency. **Triton Information** 23.06 ![31](https://github.com/triton-inference-server/server/assets/30279992/6b16d1d7-41e2-4816-b444-1322df96c863) ![企业微信截图_f293c4aa-fc1e-4939-b7a9-02f102fb2be0](https://github.com/triton-inference-server/server/assets/30279992/0a721f81-aeb7-4d21-9caa-f0eb4a4c5efd)

question

一张T4卡跑的模型,

``` class Trainer: def __init__(self, global_rank, gpu_id: int, trainer_config: TrainerConfig, model: RecNet, optimizer, world_size: int, data_cfg: DataConfig): self.global_rank = global_rank self.config = trainer_config self.world_size = world_size self.dataloader = Data(data_cfg) self.epochs_run...

xlarun: command not found, I used the container you provided, but the command is not found.

``` public class FlatStorage implements Serializable { private MemoryBuffer buf; private Map featureMetadata; public FlatStorage(int bufferSize) { this.buf = MemoryUtils.buffer(bufferSize); this.featureMetadata = new HashMap(); } public void addFeature(String name, int...

question

### Question How much faster is Fury compared to Protobuf, and does the measurement include the time taken for Protobuf construction?

question

### Question How does Fury perform in terms of speed and size when serializing and deserializing Java arrays?

question