LMDrive icon indicating copy to clipboard operation
LMDrive copied to clipboard

About how specifically LMDrive works

Open PhilWallace opened this issue 9 months ago • 1 comments

Thanks for the great work!

I am new to LLM-based ADS, and I have some questions about how LMDrive works. As stated in the paper, LMDrive is developed based on Q-former. And how Q-former works is as below:

image

As I took it, the key idea of LMDrive is to use the idea of Q-former for LLM-based driving. Efforts done in the paper is to training the Q-Former, merging multi-sensor data, and so on. Is this understanding correct?

Additionally, what is the specifically output of LMDrive? It seems to output future waypoints instead of direct control signals.

Many thanks for your attention!

PhilWallace avatar May 28 '24 14:05 PhilWallace