dvc.org icon indicating copy to clipboard operation
dvc.org copied to clipboard

guide: checkpoints for Tensorflow

Open iesahin opened this issue 4 years ago • 9 comments

I wrote something similar to iterative/dvclive#69 for all ways of using checkpoints in get-started-checkpoints I don't know if TF/Keras callbacks can be distributed with DVC or should we submit them to TF.contrib.

We need to be clear about the use of checkpoints and caveats in the docs. A special Checkpoints for Tensorflow document is useful to tell these and share the callbacks.

Originally posted by @iesahin in https://github.com/iterative/example-repos-dev/issues/47#issuecomment-849517197

iesahin avatar May 27 '21 10:05 iesahin

May be better as a blog post?

jorgeorpinel avatar May 27 '21 17:05 jorgeorpinel

I think checkpoints' usage in different ML libraries deserves a UG chapter. We need guides for Tensorflow, xgboost, PyTorch and non-Python usage. These are more difficult to wrap around than, say, configurations for different cloud providers.

iesahin avatar May 27 '21 17:05 iesahin

Agree about non-Python usage in general.

Idk about the specific libs. It may imply maintaining docs about 3rd party tools that may change at any point in time. It's already a risk we have with some guides e.g. https://neptune.ai/blog/best-7-data-version-control-tools-that-improve-your-workflow-with-machine-learning-projects or even https://dvc.org/doc/cml/start-github but those to some extent are harder to avoid since the integrations are built into DVC/CML.

That said, a single guide which just mentions several ML libs and with very very simple code samples (that will hardly break with changes in those libs) and/or links to their docs, that I could see.

jorgeorpinel avatar May 27 '21 18:05 jorgeorpinel

One reason dvclive is a separate library is so that we can have dependencies on ML frameworks there without weighing down the core dvc library. There are a couple of related dvclive issues: https://github.com/iterative/dvclive/issues/5 and https://github.com/iterative/dvclive/issues/70.

dberenbaum avatar May 27 '21 19:05 dberenbaum

I think, even if we don't bundle these integrations to dvclive due to maintenance purposes, there should be pages about how to use dvc(live) with tf/keras/pytorch/xgboost/R/Caffe... for search engine bots to find these words close enough and update their embeddings :)

I think most of the implementations are straightforward. As discussed in iterative/dvclive#5, they can be left to the user but we need to provide documentation for users to write their own.

iesahin avatar May 31 '21 12:05 iesahin

cc @pared

dberenbaum avatar Jun 01 '21 15:06 dberenbaum

I think, even if we don't bundle these integrations to dvclive due to maintenance purposes, there should be pages about how to use dvc(live) with tf/keras/pytorch/xgboost/R/Caffe... for search engine bots to find these words close enough and update their embeddings :)

I think most of the implementations are straightforward. As discussed in iterative/dvclive#5, they can be left to the user but we need to provide documentation for users to write their own.

Related: https://github.com/iterative/dvc.org/issues/2552

daavoo avatar Jun 11 '21 10:06 daavoo

I think the URL is incorrect, I got a 404 @daavoo :)

iesahin avatar Jun 13 '21 10:06 iesahin

I think the URL is incorrect, I got a 404 @daavoo :)

Indeed, the issue has been transferred:

https://github.com/iterative/dvclive/issues/87

daavoo avatar Jun 14 '21 09:06 daavoo