Tristan issues

Results 10 issues of


                                            Tristan

RelaxedOneHotCategorical not working with bfloat16 mixed precision

Hi, I was trying to train a model using a RelaxedOneHotCategorical wrapped in a DistributionLambda layer on TPU. Unfortunately when using the bfloat16 TPU mixed precision option I run into...

Zeta logo and description refresh

Choice of loss function

In the styleGAN 2 paper they state using a "non-saturating logistic loss", is there any particular reason you opted for hinge loss in this implementation? For reference I believe the...

Dataloader optimisation

- mapping transform BEFORE windowing - var seq lens and goal tiling are computationally expensive (we ought to do on device) so I've removed from dataloader - multi-threaded tfrecords mapper...

bug

NVAE seems to be SOTA for VAE-based models. Unlike VQ-VAE it retains the original VAE objective and is continuous. Pytorch implementation here: https://github.com/NVlabs/NVAE Main improvements are: Multi-scale architecture, separable convolution,...

Fix VAE training stability

Currently @sholtodouglas uses dynamic gradient clipping (gradient clipped if 4x the previous norm). This still seems to cause some issues with loss spikes. Other options are static clipping, gradient penalties,...

Change actions to be relative & in end-effector space (not joint space)

We hypothesis that relative actions will be easier to learn since the model does not have to learn or account for the DC component of the signal. In the literature...

enhancement

Change rotation representation from Quaternion to 5D representation

Using higher dim embeddings for the rotation matrix seems to perform considerably better than quaternion or euler angle representations due to NNs learning continuous vector spaces and rotation being continuous...

invalid

JS example throwing errors

Was just wondering if the JS example for interfacing with a Serum market needs updating. I've tried running it and run into the same error that the `Market.load()` expects a...

Conv1d for long sequence learning

Adding conv1d layer which we hope can help with the large sequence lengths of DNA data. Has support for: - Causal convolutions (with causal padding which prevents it from looking...