Vincent Hellendoorn

Results 6 issues of Vincent Hellendoorn

The unparser for formatted values incorrectly referred to `_fstring_JoinedStr`, which expects a `values` property that `FormattedValue`s don't have. This PR updates to the correct call.

When using a model trained with AliBi positional embedding for inference, the cached matrix gets invalidated and recomputed after every generated token, which is very expensive. This PR offers an...

The previous version constraints only successfully built PLUR in a Python 3.9 environment. The below refines some of these versions to work with Python 3.8, and in particular, to be...

Enables training with larger effective batch sizes by taking multiple steps between gradient updates. I've always found this useful since batch size correlates strongly with performance even for small models...

**Is your feature request related to a problem? Please describe.** Training on multiple nodes without SLURM or Kubernetes is challenging right now because each node stores only part of the...

feature request
deprioritized

Hi, is there any way to feed sequence data to node-cntk? I have a pre-trained model that I can feed one-hot data, but the sequence axis seems to throw it...