JetStream issues

6

- Update gprc proto to support - Request: token id or text (one of). - Response: token id, text or both of them. - Currently, request with either token id...

JoeZijunZhou

Support I/O with text and token ids

- Customer request: We use multiple languages for clients and cannot implement detokenization in each one. Need to have server-side detokenization support.

JoeZijunZhou

llama3 hack

qihqi

Refactor jestream to allow different tokenizers

1

## Issue Currently we assume few things in jetstream which hinders it's generalization: 1. tokenizer is SentencePiece based. 2. pad_id is 0 3. after encode, we pad to nearest power...

qihqi

float division by zero in benchmark

2

Command: ` python benchmarks/benchmark_serving.py --tokenizer /home//data/tokenizer.model --num-prompts 300 --dataset-path /home//data/ShareGPT_V3_unfiltered_cleaned_split.json --dataset sharegpt --save-request-outputs` Logs: > File "/home//JetStream/benchmarks/benchmark_serving.py", line 778, in > main(parsed_args) > File "/home//JetStream/benchmarks/benchmark_serving.py", line 574, in main >...

FanhaiLu1

JetStream
JetStream copied to clipboard

Metadata

Unit test coverage cleanup

Update JetStream grpc proto to support I/O with text and token ids

Support I/O with text and token ids

llama3 hack

Refactor jestream to allow different tokenizers

float division by zero in benchmark

Add np padding support

Add healthcheck support for JetStream

Add JetStream E2E test CI

Remove jax dependencies in JetStream

← Metadata

Owner

Metadata

JetStream JetStream copied to clipboard

Metadata

← Metadata

Owner

Metadata

JetStream
JetStream copied to clipboard