curated-transformers
curated-transformers copied to clipboard
Option to only return the last hidden layer output from models
In many applications we only need the last layer and letting go of references to intermediate layers can save some memory during inference.