ecosystem
ecosystem copied to clipboard
Add parameter server train & side-car eval on k8s
ResNet56 model (with custom training loop) variables are created on parameter server jobs, and updated by workers. Evaluation is done using a dedicated job which uses the checkpoints saved during the training (side-car evaluation).
The model is trained on CIFAR10 dataset.