multimodal-ai topic
oreilly-multimodal-ai
Learn how multimodal AI merges text, image, and audio for smarter models
neocortex-unity-sdk
Neocortex Unity SDK for Smart NPCs and Virtual Assistants
prompt-to-puzzle
A web app that dynamically generates playable 'Spot the Difference' games from a single text prompt using a multimodal pipeline with Google's Gemini and Imagen models.
Building-Business-Ready-Generative-AI-Systems
This GitHub repository contains the complete code for building Business-Ready Generative AI Systems (GenAISys) from scratch. It guides you through architecting and implementing advanced AI controllers...
Snappy
🐊 Snappy's unique approach unifies vision-language late interaction with structured OCR for region-level knowledge retrieval. Like the project? Drop a star! ⭐
multimodal-ai
Enterprise-ready solution leveraging multimodal Generative AI (Gen AI) to enhance existing or new applications beyond text—implementing RAG, image classification, video analysis, and advanced image em...
SmartRAG
⚡ Production-ready .NET Standard 2.1 RAG library with 🤖 multi-AI provider support, 🏢 enterprise vector storage, 📄 intelligent document processing, and 🗄️ multi-database query coordination. 🌍 Cros...