gemini-cli icon indicating copy to clipboard operation
gemini-cli copied to clipboard

Increased number of 503 errors - "The model is overloaded. Please try again later."

Open agmsb opened this issue 3 months ago • 61 comments

What happened?

When using gemini-cli in non-interactive mode, observing an increased number of the following error:

"error":{"message":"{\n  \"error\": {\n    \"code\": 503,\n    \"message\": \"The model is overloaded. Please try again later.\",\n    \"status\": \"UNAVAILABLE\",

We may be seeing similar reports in #7222, #7205, and #7222.

What did you expect to happen?

Clear handling of this error (do we retry, which model is the error coming from, etc.)

Client information

version: 0.2.1 OS: linux sandbox: none model: 2.5 pro

Login information

GEMINI API KEY

Anything else we need to know?

No response

agmsb avatar Aug 27 '25 16:08 agmsb

Found possible duplicate issues:

  • #4360
  • #7205

If you believe this is not a duplicate, please remove the status/possible-duplicate label.

gemini-cli[bot] avatar Aug 27 '25 16:08 gemini-cli[bot]

I am also experiencing the same issue. It can't complete even one request.

Lokesh-pawar avatar Aug 27 '25 18:08 Lokesh-pawar

Wonder if this is maybe caused by an uptick in model load of users due to 🍌

jackwotherspoon avatar Aug 27 '25 19:08 jackwotherspoon

I have the same problem with 0.2.1 version, impossible work with 2.5-pro, I have to change to flash due to this problem

mariete77 avatar Aug 28 '25 07:08 mariete77

I have encountered the same problem, and it happens quite frequently.

chiuchihhao avatar Aug 28 '25 09:08 chiuchihhao

This should be a critsit, it's becoming extremely frustrating.

zackoch avatar Aug 29 '25 18:08 zackoch

I have the same problem with 2.5-pro it's always overloaded !

MohammedElattar avatar Sep 04 '25 12:09 MohammedElattar

I have the same problem with 2.5-pro it's always overloaded !

Try updating to preview 4. Didn't have issues during a long sesh yesterday

zackoch avatar Sep 04 '25 13:09 zackoch

same issue

vgdh avatar Sep 17 '25 13:09 vgdh

same

akrisk avatar Sep 17 '25 15:09 akrisk

same issue

VEKINForTesting avatar Sep 18 '25 09:09 VEKINForTesting

same issue

Pra3t0r5 avatar Sep 18 '25 12:09 Pra3t0r5

Update to 0.5.3 doesn't fix that. First prompt of the day:

Image

andreaganduglia avatar Sep 18 '25 12:09 andreaganduglia

same issue

Image

beinimaliesi avatar Sep 22 '25 07:09 beinimaliesi

I have the same issue and found also this issue: python genai issue

makoit avatar Sep 22 '25 12:09 makoit

Same issue here on v0.5.5, and the fact that's being around for a while doesn't look good. Prior to API key usage I never experienced it, while at the moment I see 61 of these errors using API key.

Image

Image

Gonoshift avatar Sep 22 '25 13:09 Gonoshift

Same here, using API key connected to project with billing account, both latest stable and nightly Gemini CLI. Works flawlessly with the free tier personal auth, until quota is used up of course.

torvald avatar Sep 23 '25 07:09 torvald

The very first query of the day.

Image

andreaganduglia avatar Sep 25 '25 13:09 andreaganduglia

got new api error :

✕ [API Error: {"error":{"message":"{\n "error": {\n "code": 429,\n "message": "You exceeded your current quota, please check your plan and billing details. For more information on this error, head to: https://ai.google.dev/gemini-api/docs/rate-limits.",\n "status": "RESOURCE_EXHAUSTED"\n }\n}\n","code":429,"status":"Too Many Requests"}}] Please wait and try again later. To increase your limits, request a quota increase through AI Studio, or switch to another /auth method

login method : gemini api key

arsalan-zeus avatar Sep 27 '25 08:09 arsalan-zeus

How does gemini-2.5-flash model compare to pro? Is it suitable for long running coding tasks? I'm constantly getting [API Error: The service is currently unavailable. (Status: UNAVAILABLE)] and it's getting frustrating

divyranjan17 avatar Sep 29 '25 03:09 divyranjan17

For tracking since this issue primarily focuses on the 503 errors and the fact that we haven't seen 503 errors a while, we will continue to keep an eye on this for the time being.

srithreepo avatar Oct 02 '25 22:10 srithreepo

Installed just today and the very first query with gemini-2.5-pro returned:

✕ [API Error: got status: UNAVAILABLE. {"error":{"code":503,"message":"The model is 
    overloaded. Please try again later.","status":"UNAVAILABLE"}}]

$ date -u
Fri  3 Oct 2025 11:55:49 UTC

The next query went through just fine.

ckujau avatar Oct 03 '25 11:10 ckujau

Is there any way to avoid this issue right now, while it's present? E.g., does switching auth from API key to OAuth login with Google Account fix it?

bradyneal avatar Oct 03 '25 20:10 bradyneal

Facing the same issue

gvsakhil avatar Oct 06 '25 08:10 gvsakhil

Same issue, any definitive solution?

FroJhoDev avatar Oct 06 '25 14:10 FroJhoDev

At this point this service is borderline a scam. It consistently drops every tuesday, wednesday, thursday and saturday. And on the days it doesn't it can still give this error from time to time, so even then it's not perfect.

It's a sinking ship unless they come up with something to finally fix it.

Holneer avatar Oct 08 '25 16:10 Holneer

Let's ask something simple to Gemini...

Image

andreaganduglia avatar Oct 13 '25 14:10 andreaganduglia

Lots of 503 400 etc lately. Esp the last week. And random quota is used. When it isn't.

eonist avatar Oct 19 '25 17:10 eonist

Episode 503 – The Error Strikes Back.

Image Image Image

andreaganduglia avatar Oct 22 '25 09:10 andreaganduglia

Is anyone using Gemini in production given that this error keeps happening? How do you handle it? Fallback to OpenAI?

bradyneal avatar Oct 22 '25 23:10 bradyneal