jeromeroussin comments

Results 10 comments of


                                            jeromeroussin

[Bug]: Pydantic warnings at every embedding call for Azure

I am making calls like this to the proxy with openai version < 1.0: ```python embeddings = await openai.Embedding.acreate(model=model, input=['random sentence here'], user='foo') ```

support of ColBert/ColBertv2

I am also interested in this

[Bug]: "timeout" and "stream_timeout" set at the model level in config.yaml do not work

Here is the timeout stacktrace for one of those 6000s (non-streaming) timeouts if that helps: ``` Traceback (most recent call last): File "/usr/local/lib/python3.12/site-packages/httpx/_transports/default.py", line 101, in map_httpcore_exceptions yield File "/usr/local/lib/python3.12/site-packages/httpx/_transports/default.py",...

[Bug]: "log_event_type" reported in custom callbacks for calls that reach max_parallel_requests no longer "failed_api_call" but instead "post_api_call"

Right, a few words missing. I edited the wording.

[Bug]: custom model timeout does not work

Addressed in a later bug report: https://github.com/BerriAI/litellm/issues/7001

[Bug]: custom model timeout does not work

The issue still exists in 1.57.3. The "timeout" value of our models is not being respected and we're seeing timeouts at the default 6000s value

[Bug]: custom model timeout does not work

A stacktrace of a 6000s timeout: ``` Traceback (most recent call last): File "/usr/local/lib/python3.12/site-packages/httpx/_transports/default.py", line 72, in map_httpcore_exceptions yield File "/usr/local/lib/python3.12/site-packages/httpx/_transports/default.py", line 377, in handle_async_request resp = await self._pool.handle_async_request(req) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^...

jeromeroussin

[Bug]: Pydantic warnings at every embedding call for Azure

support of ColBert/ColBertv2

[Bug]: "timeout" and "stream_timeout" set at the model level in config.yaml do not work

[Bug]: "log_event_type" reported in custom callbacks for calls that reach max_parallel_requests no longer "failed_api_call" but instead "post_api_call"

[Bug]: custom model timeout does not work

[Bug]: custom model timeout does not work

[Bug]: custom model timeout does not work

[Bug]: custom model timeout does not work

[Feature]: Add logging callbacks for assistants API

[Feature]: Enable Vector Store API routes and Vector Store Files API routes