FastChat icon indicating copy to clipboard operation
FastChat copied to clipboard

[Chatbot Arena] Add GLM-4 variants: AirX, Air and Flash

Open EwoutH opened this issue 1 year ago • 0 comments
trafficstars

Currently GLM-4-0520 is available on the leaderboard and performs really well. However, Zhipu AI also has other variants available, which are 10x, 100x and 1000x as cheap. It would be very interesting to see how they perform.

They are also the only LLM provider that cover a 1000x price range. It could be one of the most interesting data points to see how LLM performance scaled on an (assumed) similar platform.

See https://open.bigmodel.cn/pricing

Model Overview Price(1K tokens) Est. price 1M tokens in USD
GLM-4-0520 Our most advanced and intelligent model to date, with an 18.6% improvement in instruction compliance, 128k context, released on 2024-06-05. ¥0.1 $ 0.63
GLM-4V Supports visual QA, image captioning, visual positioning, and complex object detection among other image understanding tasks, with 2k context. ¥0.05 $ 0.31
GLM-4-AirX High-performance version of GLM-4-Air, same effectiveness, 2.6 times faster inference speed. ¥0.01 $ 0.063
GLM-4-Air Best cost-performance model, similar overall performance to GLM-4, with 128k context, fast and affordable. ¥0.001 $ 0.0063
GLM-4-Flash Suitable for simple tasks, fastest speed, most affordable version,with 128k context. ¥0.0001 $ 0.00063

On Chatbot Arena leaderboard:

  • [x] GLM-4-0520
  • [ ] GLM-4V
  • [ ] GLM-4-AirX
  • [ ] GLM-4-Air
  • [ ] GLM-4-Flash

EwoutH avatar Jun 25 '24 11:06 EwoutH