lakefs-spec icon indicating copy to clipboard operation
lakefs-spec copied to clipboard

TensorFlow and LakeFS

Open willianbeneducci opened this issue 5 months ago • 6 comments

What is the motivation and/or use case?

This is a very nice tutorial in how to use Pandas and LakeFS integrated LakeFS File System integrated to read a *.csv file directly from repository. This is very useful.

I am looking for something similar to work with TensorFlow and images particularly. I checked and seems to me that there is nothing implemented for this so far. TensorFlow IO

This might be something very interesting to have, once that LakeFS works well with huge datasets. We can benefit from this "connection" and avoid downloading everything to a local/cloud machine to train models.

How can we implement this feature?

Ideally a plugin for TensorFlow would be awesome or an addition to the LakeFS-spec. Thanks for all the good work so far!

willianbeneducci avatar Jan 24 '24 19:01 willianbeneducci