swift-models
swift-models copied to clipboard
Add a DistilGPT2 variant
There's a very cool post here (HT @BradLarson) with benchmarks for PyTorch XLA. It would be great to see how x10 compares by adding a DistilGPT2 variant and DistilGPT2-WikiText2 example.