dvc icon indicating copy to clipboard operation
dvc copied to clipboard

Add support for kaggle datasets as a remote storage

Open Muhammad-Abdelsattar opened this issue 2 years ago • 2 comments

Currently kaggle has an API for creating and updating datasets and models using kaggle-api. It gives more than 100GB of private datasets for each user and unlimited storage for public ones. I think it would be extremely useful to be supported.

Muhammad-Abdelsattar avatar Oct 16 '23 13:10 Muhammad-Abdelsattar

can i pick this one up?

pioneerHitesh avatar Oct 17 '23 08:10 pioneerHitesh

Unfortunately, we don't have the bandwidth to support additional remote types right now. If you are interested in using kaggle for remote storage, it would help to upvote and provide some more context on how it would be useful.

dberenbaum avatar Oct 17 '23 20:10 dberenbaum

We don't have plans to add new remote implementation ourselves. Of course, contributions are always welcome.

In the future, we have plans to offer plugin system so that we can create remotes out of tree. Remotes like s3/azure, etc are already out of tree, but they are hardcoded in the registry. We'll refactor this in the future to make it easier for creating new remote plugins.

I am closing this issue ticket for now.

skshetry avatar Mar 25 '24 11:03 skshetry