streaming icon indicating copy to clipboard operation
streaming copied to clipboard

Streaming via ssh across clusters

Open gaow0007 opened this issue 1 year ago • 2 comments

🚀 Feature Request

I want to use streaming to access a remote datacenter via ssh with certain privacy-related permission.

Motivation

I want to use a separate cluster to store the dataset, and want to set the permissions for different images in the datasets for different users.

[Optional] Implementation

Cloud you provide some potential API or examples which might be helpful to this feature?

Additional context

gaow0007 avatar Mar 15 '23 10:03 gaow0007

Hi @gaow0007 , could you please explain your use-case a bit more in detail ? Currently, Streaming dataset support AWS S3, GCS blob Storage, and OCI blob storage which downloads the dataset on the fly onto a remote instance. Streaming Dataset automatically authenticates based on your cloud provider and provided authentication and perform either download/upload the shard files.

karan6181 avatar Apr 11 '23 23:04 karan6181

Sure.Considering that dataset is split into different partitions, and some users can access the entire datasets, however others only can access partial partitions due to privacy issues.

Additionally, is it possible to provide a ssh example to use streaming dataset to facilitate local-mode debug

gaow0007 avatar Apr 12 '23 12:04 gaow0007

Hey @gaow0007, support for sample-level permissioning for datasets is not currently planned on being supported by Streaming. Closing out this issue since it's been inactive for a while.

snarayan21 avatar May 29 '24 19:05 snarayan21