web-llm
web-llm copied to clipboard
model request: Llama-3-8B-Web
It surpasses GPT-4V (zero-shot *) by over 18% on the WebLINX benchmark, achieving an overall score of 28.8% on the out-of-domain test splits.