chuheww comments

Results 14 comments of


                                            chuheww

v1.5版本的7B模型在element_ocr场景下大幅低于v1版本的2B模型，是否符合预期

v1.5版本的7B模型在element_ocr场景下大幅低于v1版本的2B模型，是否符合预期

> 如果这里所说的 2B 模型指的是 UI-TARS-2B-SFT，可以尝试使用如下 prompt： > > `` Output only the coordinate of one point in your response. What element matches the following task: **User Instruction** > >...

v1.5版本的7B模型在element_ocr场景下大幅低于v1版本的2B模型，是否符合预期

> 您方便提供一下推理参数嘛建议使用greedy推理来评测哈您好，感谢您的回复我是初学者，给您或许带来了一些回答上的干扰，我直接贴上我的测试代码，希望您可以给予修改意见初始化方面 def __init__( self, model_path="./UI-TARS-2B-SFT", device_map="auto", ): self.tokenizer = AutoTokenizer.from_pretrained( model_path, trust_remote_code=True, use_fast=True ) self.processor = AutoProcessor.from_pretrained( model_path, trust_remote_code=True, use_fast=True ) self.model = AutoModelForVision2Seq.from_pretrained( model_path,...

1.5版本的元素坐标识别准确度下降严重

> > 使用最新的/UI-TARS-desktop-v0.1.0。 > > 能解决吗，我照着readme里面的坐标处理调用的1.5 7b还是不正确，客户端用的处理不同？您好请问这个问题解决了吗 1.5 7b效果很差问题