fastertransformer_backend icon indicating copy to clipboard operation
fastertransformer_backend copied to clipboard

Request to support GCS file path

Open aasthajh opened this issue 3 years ago • 2 comments

Hi, I'm trying to deploy a Faster Transformer based LLM using Triton on a GCP instance. I was wondering if there's a way to provide the file path to the Google Cloud Storage bucket when passing the model check point in the config.pbtxt file?

parameters {
  key: "model_checkpoint_path"
  value: {
    string_value: "gs://triton_sample_models/model-ul2-ft/ul2/1/2-gpu"
  }
}

aasthajh avatar Oct 17 '22 20:10 aasthajh

Currently FasterTransformer Backend only supports reading model files and configuration files from the local disk to initialize. In later versions, we will gradually support reading configuration files and model files from kafka and the cloud.

byshiue avatar Oct 18 '22 02:10 byshiue

Sorry to bring up this issue again @byshiue Is there a way to provide model path in GCS bucket to parameter "model_checkpoint_path" in config.pbtxt file now? Thank you!

zinzinhust96 avatar Apr 19 '23 08:04 zinzinhust96