agents issues

Add retrieval config support for google LLM

This should close #4407 Tested with the following setup. ```python } session = AgentSession( ... llm=google.LLM( model="gemini-2.5-flash", vertexai=True, location="global", # retrieval_config=types.RetrievalConfig( # lat_lng=types.LatLng(latitude=53.350140, longitude=-6.266155) # ), # or retrieval_config={ "lat_lng":...

chenghao-mou

Add extra comments about Google model deprecation

Tested the ambiguous ones: "gemini-2.0-flash-exp" and "gemini-live-2.5-flash-preview-native-audio". Both still work with the right setting, but are not mentioned in any official docs or change logs.

chenghao-mou

Update STT tests and add batch recognition flag

1

- Add batch recognition flag in STT capabilities - Added manual workflow to test a PR/branch/revision - Updated tests to support all STT vendors except two of them: ```python #...

chenghao-mou

allow pushing frames to VAD when agent speech is uninterruptible

4

This should close #4413 What happened: - VAD received audio frames, changing user stage to speaking; - Uninterruptible speech created, discarding audio frames for both STT and VAD. User state...

chenghao-mou

The `gemini-2.5-flash-native-audio-preview-12-2025` model cannot be used with modalities text for hybrid architecture with a separate TTS plugin

3

### Bug Description ### Error message ``` websockets.exceptions.ConnectionClosedError: received 1007 (invalid frame payload data) Cannot extract voices from a non-audio request. ``` ### Code to reproduce ```python from livekit.agents import...

sagorbrur

bug

Inference: Improved support for mid session TTS updates

adrian-cowham

Switch VideoAvatars for AgentSession with multiple agents

4

### Bug Description When using VideoAvatars with multiple agents under a single AgentSession, the audio input/output (QueueAudioOutput) is initialized once when the avatar starts. Currently, there’s no way to reset...

Viktoriagrg

bug

Agent stops responding if interrupted during a tool call with self.session.say

6

### Bug Description If the agent has a `self.session.say` inside of a tool call, and you interrupt right before it. The agent will be stuck and unable to respond. All...

aumeshm

bug

Unable to access current span on `add_shutdown_callback` or on `on_session_end`

1

### Bug Description https://docs.livekit.io/deploy/observability/data/#save-conversation-history-example https://docs.livekit.io/deploy/observability/data/#session-reports I want to be able to add the whole session report as a span attribute to the root span. I am unable to do it...

debajyoti-truefoundry

bug

Adding InyaAI plugin

2

Gnani-AI-Mintlify

agents
agents copied to clipboard

Metadata

Add retrieval config support for google LLM

Add extra comments about Google model deprecation

Update STT tests and add batch recognition flag

allow pushing frames to VAD when agent speech is uninterruptible

The `gemini-2.5-flash-native-audio-preview-12-2025` model cannot be used with modalities text for hybrid architecture with a separate TTS plugin

Inference: Improved support for mid session TTS updates

Switch VideoAvatars for AgentSession with multiple agents

Agent stops responding if interrupted during a tool call with self.session.say

Unable to access current span on `add_shutdown_callback` or on `on_session_end`

Adding InyaAI plugin

← Metadata

Owner

Metadata

agents agents copied to clipboard

Metadata

← Metadata

Owner

Metadata

agents
agents copied to clipboard