JetStream issues

Question: `prometheus_port` flag for pytorch server

https://github.com/AI-Hypercomputer/JetStream/blob/main/docs/observability-prometheus-metrics-in-jetstream-server.md only mentions the `prometheus_port` if the jetstream runs with `maxengine_server`. However, such option doesn't exist under https://github.com/AI-Hypercomputer/jetstream-pytorch. I wonder if `jetstream-pytorch` also expose prometheus metrics related to the inference...

JeffLuoo

MOE with JetStream

Could someone explain or point to a doc that explains how MOE is implemented on Jetstream? Specifically, the all-to-all communications, static vs dynamic, sparse matmuls. I would like to understand...

patrick-toulme

Any Plan to Support TPU V3 ?

1

Any Plan to Support TPU V3? Kaggle has been offering TPU V3 and it is a good learning & testing ground for TPU related release.

ChewKokWah

Add gcloud setup same as Maxtext

gcloud setup has been failing in Dockerfile. Tried previously - pip install gcloud-cli-sdk install In the PR - Added changes similar to Maxtext- https://github.com/AI-Hypercomputer/maxtext/blob/4ac910d3435c75ce3f922459c71181068d1d5e4e/maxtext_gpu_dependencies.Dockerfile#L14C1-L22C51

mailvijayasingh

pull ready

[*] Typo fixes

The most exciting PR you'll receive this decade

SamuelMarks

JetStream
JetStream copied to clipboard

Metadata

Question: `prometheus_port` flag for pytorch server

MOE with JetStream

Any Plan to Support TPU V3 ?

Add gcloud setup same as Maxtext

[*] Typo fixes

← Metadata

Owner

Metadata

JetStream JetStream copied to clipboard

Metadata

Question: `prometheus_port` flag for pytorch server

MOE with JetStream

Any Plan to Support TPU V3 ?

Add gcloud setup same as Maxtext

[*] Typo fixes

← Metadata

Owner

Metadata

JetStream
JetStream copied to clipboard