Alpha-Co-Vision
Alpha-Co-Vision copied to clipboard
A real-time video caption to conversation bot that captures frames generates captions and creates conversational responses using a Large Language Models base to create interactive video descriptions.
When I run the script, the camera just flashed for one second, and I saw this error. ``` loc("varianceEps"("(mpsFileLoc): /AppleInternal/Library/BuildRoots/97f6331a-ba75-11ed-a4bc-863efbbaf80d/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShadersGraph/mpsgraph/MetalPerformanceShadersGraph/Core/Files/MPSGraphUtilities.mm":228:0)): error: input types 'tensor' and 'tensor' are not broadcast compatible...
will continue optimize the code - Add the config.json - Add the openai gpt-3.5-turbo integration
- Integrate with Edge-TTS - More optimized configs in config.json