client Implement Assistants Streaming

What:

[ ] Bug Fix
[X] New Feature

Description:

This PR implements Full Assistants Streaming API support. https://platform.openai.com/docs/api-reference/assistants-streaming/events

ThreadMessageDelta (Including nested classes)
ThreadRunStepDelta (Including nested classes)
StreamedThreadRunResponseFactory (To map the events to the correct classes)
EventStreamResponse (Modified iterator to work with the events format)
EventStreamResponseItem (Contains the event type and the data, similar to how the api returns the data)

Lack of types annotations for stream events. Needed to set them in place, so my ide could give me at least some hints what is going on
The example with tool output is not working. As soon as you set new $stream variable, foreach loop ends. Was able to run it using while loop and generator instance directly:

$stream = $client->threads()->runs()->createStreamed(
    threadId: 'thread_tKFLqzRN9n7MnyKKvc1Q7868',
    parameters: [
        'assistant_id' => 'asst_gxzBkD1wkKEloYqZ410pT5pd',
    ],
);

$iterator = $stream->getIterator();

while ($iterator->valid()) {
    $item = $iterator->current();
    $iterator->next();

    // ----

    if ($item->event === 'thread.run.requires_action') {
        $stream = $client->threads()->runs()->submitToolOutputsStreamed(
            threadId: $item->data->threadId,
            runId: $item->data->id,
            parameters: [
                'tool_outputs' => [[
                    'tool_call_id' => 'call_KSg14X7kZF2WDzlPhpQ168Mj',
                    'output' => '12',
                ]],
            ]
        );
        $iterator = $stream->getIterator();
    }

    // ----
}

Apr 09 '24 22:04 punyflash

Hey all, Is there anybody available to review this PR and maybe merge and release it so we can use the streaming functionality?

Thx for your efforts.

🥇@gehrisandro and 🥈@nunomaduro are the people

Apr 09 '24 22:04 spont4e

@punyflash Thanks for pointing out these issues

Lack of types annotations for stream events. Needed to set them in place, so my ide could give me at least some hints what is going on

Yeah this is something worth adding, I don't actually know how to set those up. So if you could point me to something or request the necessary changes, that'd be helpful.

The example with tool output is not working.

Yeah that example does need to be updated, Heres an updated example, closer to how i've been using it. I'll probably update the example in the readme with this.

$stream = $client->threads()->runs()->createStreamed(
    threadId: 'thread_tKFLqzRN9n7MnyKKvc1Q7868',
    parameters: [
        'assistant_id' => 'asst_gxzBkD1wkKEloYqZ410pT5pd',
    ],
);

do{
    foreach($stream as $response){
        $response->event // 'thread.run.created' | 'thread.run.in_progress' | .....
        $response->data // ThreadResponse | ThreadRunResponse | ThreadRunStepResponse | ThreadRunStepDeltaResponse | ThreadMessageResponse | ThreadMessageDeltaResponse

        switch($response->event){
            case 'thread.run.created':
            case 'thread.run.queued':
            case 'thread.run.completed':
            case 'thread.run.cancelling':
                $run = $response->data;
                break;
            case 'thread.run.expired':
            case 'thread.run.cancelled':
            case 'thread.run.failed':
                $run = $response->data;
                break 3;
            case 'thread.run.requires_action':
                // Overwrite the stream with the new stream started by submitting the tool outputs
                $stream = $client->threads()->runs()->submitToolOutputsStreamed(
                    threadId: $run->threadId,
                    runId: $run->id,
                    parameters: [
                        'tool_outputs' => [
                            [
                                'tool_call_id' => 'call_KSg14X7kZF2WDzlPhpQ168Mj',
                                'output' => '12',
                            ]
                        ],
                    ]
                );
                break;
        }
    }
} while ($run->status != "completed")

Apr 12 '24 09:04 EthanBarlo

You can check these changes for adding faking capabilities. They can be implemented similarly in this branch https://github.com/petermjr/openai-php/compare/v0.8.4...v0.8.5

Apr 16 '24 10:04 petermjr

Hi,

Thanks for the great and full-featured SDK. Is there an estimated timeline for when this feature might be integrated and when we might expect the next release?

Regards

Apr 18 '24 10:04 subet

@subet Unfortunately I have no idea for a timeline on this. In the meantime you can use the new functionality with this https://github.com/openai-php/client/pull/367#issuecomment-2046120856

Im quite confident that all the functionality has been implemented using the same style as the rest of the API. We have at least 4 people using it on their projects without issues, So I do think its close to being ready to go.

The next stage is for @gehrisandro and @nunomaduro to review the PR.

Apr 18 '24 16:04 EthanBarlo

Hi @EthanBarlo

Thank you very much for your PR! I am going to review it now.

Apr 24 '24 18:04 gehrisandro

Hi, I have been actively tracking the progress of this PR, but lately, it looks like there has been no movement after the tests were run. Is this what is blocking it or?

May 02 '24 12:05 slaven-ii

Hi there, I did not check the code to see the code.

My question is it cost optimize?

I'm new to open ai gpt. I don't know much about there pricing. But they say the have output cost for tokens.

The actual way this library do to fetch the response is by fetching all the messages: $messages = $client->threads()->messages()->list('...');. Does your code fetch the last message like this?

Because if the output tokens is about fetching the messages it would be bad fetching all messages again and again.

May 10 '24 08:05 Guervyl

@Guervyl The output tokens refers to text generations from the Ai models. So retrieving previous messages, and old text generations does not cost anything.

May 10 '24 09:05 EthanBarlo

Thanks, @EthanBarlo, for your PR!

Changed some stuff to work similar to the other streaming endpoints.

Will make a new release this week.

May 16 '24 13:05 gehrisandro

Thank you guys! Amazing work.

Please note that the current version is pointing OpenAi-beta assistants=v1, it would crash if you're using some new parameters like file_search in v2

May 22 '24 10:05 magarrent

client
client copied to clipboard

Implement Assistants Streaming

What:

Description:

Related:

client client copied to clipboard

Implement Assistants Streaming

What:

Description:

Related:

client
client copied to clipboard