transformers icon indicating copy to clipboard operation
transformers copied to clipboard

[Benchmarks] index

Open stas00 opened this issue 3 years ago • 0 comments

This issue is to document the important transformers benchmarks in one place, so that they are easy to find.

To add a new benchmark entry post it in an Issue (separately or as a comment in an existing issue) and then link from here. If you have edit rights please add a link directly to this post, otherwise please add a note in the comments and I will update this post.

Please do not post actual benchmarks in the comments of this Issue. This is only an index.

Thank you!

Fastest speed combinations

Precision: fp16 vs bf16 vs tf32 vs fp32

Batch size / gradient accumulation steps

Gradient checkpointing

Optimizers:

  • Adam torch vs. apex vs HF vs adafactor: RTX-3090, A100
  • re-run the above a year later with the same list of optimizers, plus BNB's 8bit optimizer PCIe 80GB A100

Network / Interconnects:

stas00 avatar Dec 31 '21 06:12 stas00