swift-models icon indicating copy to clipboard operation
swift-models copied to clipboard

Models and examples built with Swift for TensorFlow

Results 46 swift-models issues
Sort by recently updated
recently updated
newest added

The DQN implementation within the Gym examples additionally needs Matplotlib and Numpy to work correctly, and we should create a section in the Readme there that describes this model and...

good first issue

The CoLA dataset is reliant on a [firebase url with included token](https://github.com/tensorflow/swift-models/blob/master/Datasets/CoLA/CoLA.swift#L70-L71). Identify the source and determine whether we should host these files from GCS instead. /cc @BradLarson @eaplatanios

help wanted
good first issue

With the release of the 0.9 toolchain containing X10, we now have a higher-performance alternative to the default eager execution mode. In the interest of having swift-models demonstrate best practices,...

Currently, `Alphabet`'s [`dictionary`](https://github.com/tensorflow/swift-models/blob/master/Support/Text/WordSeg/Alphabet.swift#L25) maps from `String` rather than `Character` to support tokens of length > 1 character. Using `Character` instead of `String` would work if we used special Unicode characters...

Currently, the `WordSeg` dataset uses characters from all datasets to create an instance of `Alphabet`, but only the training set should be used. This potentially involves architectural changes to `CharacterSequence`,...

Examples/GPT2-Inference occasionally crashes unexpectedly. Figure out why!

We recently generated encoded version of WikiText2 and got both that and original uncoded files into GCS bucket. (https://github.com/tensorflow/swift-models/pull/490). To keep aligned, we wanted to do the same for WikiText103.

All of our model inference unit tests use the default eager mode for execution. With X10 now available as an additional backend, we might want to add a parallel set...

Is there any resource/notebook on Using CheckpointWriter api inside a training loop for any model in swift?

There's a very cool post [here](https://blaisecruz.com/pytorch-on-tpus/) (HT @BradLarson) with benchmarks for PyTorch XLA. It would be great to see how x10 compares by adding a DistilGPT2 variant and `DistilGPT2-WikiText2` example.

help wanted