keepsake issues

Proper display of nested dict in param.

1

![Screenshot 2021-03-17 at 12 51 28](https://user-images.githubusercontent.com/12573521/111470202-803ab880-871f-11eb-8b4e-da6e06ec6ee4.png) Support proper display of nested params file since many application provided a nested dictionary of parameters. For example: ``` Params data._target_: project.data.MNISTDataModule data.batch_size: 32...

lkhphuc

help wanted

type/enhancement

Fixing matplotlib deprecation warning in plot() #562

1

enochkan

matplotlib deprecation warning in plot()

3

Getting this warning in tests: ``` tests/test_plot.py::test_num_plots /Users/ben/p/keepsake/python/keepsake/experiment.py:446: MatplotlibDeprecationWarning: Adding an axes using the same arguments as a previous axes currently reuses the earlier instance. In a future version, a...

bfirsh

type/bug

help wanted

Resume experiments

1

# Why? Sometimes the training process dies, and you want to resume from a checkpoint, keeping your partially trained model weights and metric history. # How? Editing an existing experiment...

bfirsh

type/roadmap

PyTorch Lightning tests shouldn't hit network

4

Currently the PyTorch Lightning tests download mnist and train a real model. We shouldn't do that to just test that the callback works -- they should run a fake training...

bfirsh

good first issue

help wanted

type/chore

Replicable runtime environments

2

# Why One part of reproducibility is ensuring the same code, training data, and hyperparams. But another is ensuring the same runtime environment -- system dependencies, Python dependencies, CUDA versions,...

bfirsh

type/roadmap

Make it clear that checkpoints don't de-duplicate

You should save weights as `weights.pth` and overwrite, not `weights_1.pth`, `weights_2.pth`, etc. This is a mistake we often see and it isn't clear from the documentation that you should overwrite.

bfirsh

area/documentation

If file doesn't have write permissions, it breaks repository

3

If a file with permissions `-r--------` is saved to the repository, then you can no longer check out because Replicate doesn't have permission to write to the file. Pretty minor...

bfirsh

type/bug

good first issue

help wanted

priority/low

Use venv for Python development

12

Addresses https://github.com/replicate/replicate/issues/424

hemildesai

PL callback slows down training

Bug report from pchalasani on Discord: "for some reason adding the PL-callback to my RNN model slowed the training down quite a lot, plus there was some unexpected output about...

andreasjansson

type/bug

needs investigation

keepsake
keepsake copied to clipboard

Metadata

Proper display of nested dict in param.

Fixing matplotlib deprecation warning in plot() #562

matplotlib deprecation warning in plot()

Resume experiments

PyTorch Lightning tests shouldn't hit network

Replicable runtime environments

Make it clear that checkpoints don't de-duplicate

If file doesn't have write permissions, it breaks repository

Use venv for Python development

PL callback slows down training

← Metadata

Owner

Metadata

keepsake keepsake copied to clipboard

Metadata

← Metadata

Owner

Metadata

keepsake
keepsake copied to clipboard