predict_pv_yield icon indicating copy to clipboard operation
predict_pv_yield copied to clipboard

New paper: "Improving Vision Transformer Efficiency and Accuracy by Learning to Tokenize"

Open JackKelly opened this issue 2 years ago • 0 comments

Detailed Description

Very interesting new paper from Google:

Basically: A way to dynamically pick which "patches" of images / videos to attend to.

This could be a really nice way of including much larger input images (which is important for longer-time-horizon nowcasting).

JackKelly avatar Dec 07 '21 20:12 JackKelly