Junyang Wang comments

Results 125 comments of


                                            Junyang Wang

请问是否提供与演示视频相同的可视化界面

> > I have already set up a visual interface, and even on cloud devices, so you don’t need to use your own equipment. If you're interested, you can leave...

请问这个报错怎么解决呢？

可以参考[链接](https://github.com/modelscope/modelscope/issues/931)

How to improve the execution speed of OCR, grounding-dino, and chatgpt-4o models to transition mobile-agent from laboratory research to engineering use?

Hello. As you said, both the OCR model and GroundingDino can be loaded via GPU. For the OCR model, you need to install the corresponding version of tensorflow-gpu. There is...

modelscope里面默认使用cuda? raise AssertionError("Torch not compiled with CUDA enabled") AssertionError: Torch not compiled with CUDA enabled

如果没有GPU，可以自行更换其他OCR模型，按照输出格式修改 MobileAgent/text_location.py

能否透露一下V3版本的开源模型

> 请问一下能不能指导一下你们V3准备用的开源模型，我使用了llava-next-72b-hf这个72B的模型，但是基本没有效果，和GPT4o没法比，你们有什么好的想法能交流一下吗 v3目前还暂不能开源和公布技术细节。如果想要提升效果，可以从指令遵循能力、外部知识注入和遵循能力、多图理解能力等角度对模型做提升，这些能力对于手机场景是十分关键的