Gamba icon indicating copy to clipboard operation
Gamba copied to clipboard

Question to image_pos_token

Open 0nandon opened this issue 6 months ago • 1 comments

Thanks for the nice work.

According to this code, it seems that image token is also given for the image_pos_token which is different with the original paper.

From the paper, the camera extrinsic parameters and intrinsic parameters are transformed to image_pos_token by forwarding through MLP layer if my understanding is right. It will be grateful if you let me know where I can find the code for computing image_pos_token.

Upvote & Fund

  • We're using Polar.sh so you can upvote and help fund this issue.
  • We receive the funding once the issue is completed & confirmed by you.
  • Thank you in advance for helping prioritize & fund our backlog.
Fund with Polar

0nandon avatar Aug 18 '24 09:08 0nandon