lsy1973

Results 3 comments of lsy1973

save problem, any solve? * image width didn't work, will have some offset

Try this code. I modified get_video_chunk_content to get_video_audio_chunk_content, which now accepts video and audio inputs separately. I asked the model what animal was in the video, and it answered correctly....

> Your question is "what animal was in the video" in audio format, that is the audio file is your question? Can I specify the question (prompt) in text format?...