UI-TARS icon indicating copy to clipboard operation
UI-TARS copied to clipboard

<point> vs |box_start|, |box_end|

Open tcnguyen opened this issue 6 months ago • 1 comments

Hi team,

I saw 2 different prompts using different tokens

click(start_box='<|box_start|>(x1,y1)<|box_end|>')

click(point='<point>x1 y1</point>')

Which one should I use for UITARS 1.5-7B https://huggingface.co/ByteDance-Seed/UI-TARS-1.5-7B?

Do you recommend UITARS 1.5-7B over UI-TARS-7B like UI-TARS-7B-DPO?

Thank you.

tcnguyen avatar May 27 '25 15:05 tcnguyen

"click(start_box='<|box_start|>(x1,y1)<|box_end|>')" is for UI-TARS-1.5-7B.

Yes, UI-TARS-1.5-7B is better than UI-TARS-7B.

JjjFangg avatar May 28 '25 12:05 JjjFangg

@JjjFangg many thanks!

tcnguyen avatar Jun 20 '25 09:06 tcnguyen