feast icon indicating copy to clipboard operation
feast copied to clipboard

Getting access to datasets in parquet format

Open vecorro opened this issue 3 years ago • 3 comments

Sometimes is hard to use GCP (budget/contract constraints for example). Would it be possible to get the parquet dataset files used in the tutorials so people can run them locally without the need of using GCP?

Thanks!

vecorro avatar Jun 08 '22 22:06 vecorro

Hi @vecorro, you are free to use Parquet. Just be aware of the fact that you won't get nearly the same performance without a scalable system like a cloud datawarehouse.

woop avatar Jun 13 '22 14:06 woop

Thanks, @woop I think that running the tutorials on Parquet should be ok as the data volume is small. I just wonder if there is a repo where all the Parquet files used in the tutorials can be pulled from.

vecorro avatar Jun 13 '22 14:06 vecorro

@vecorro Hi vecorro, specifically which tutorials are you interested in? For any feast init, the parquet file is generated from pandas dataframes in feast.

kevjumba avatar Aug 03 '22 22:08 kevjumba

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale[bot] avatar Dec 16 '22 03:12 stale[bot]