multimodal-ai topic

List multimodal-ai repositories

oreilly-multimodal-ai

26
Stars
13
Forks
26
Watchers

Learn how multimodal AI merges text, image, and audio for smarter models

neocortex-unity-sdk

25
Stars
1
Forks
25
Watchers

Neocortex Unity SDK for Smart NPCs and Virtual Assistants

prompt-to-puzzle

45
Stars
1
Forks
45
Watchers

A web app that dynamically generates playable 'Spot the Difference' games from a single text prompt using a multimodal pipeline with Google's Gemini and Imagen models.

Building-Business-Ready-Generative-AI-Systems

121
Stars
37
Forks
121
Watchers

This GitHub repository contains the complete code for building Business-Ready Generative AI Systems (GenAISys) from scratch. It guides you through architecting and implementing advanced AI controllers...

Snappy

72
Stars
14
Forks
72
Watchers

🐊 Snappy's unique approach unifies vision-language late interaction with structured OCR for region-level knowledge retrieval. Like the project? Drop a star! ⭐

multimodal-ai

21
Stars
13
Forks
21
Watchers

Enterprise-ready solution leveraging multimodal Generative AI (Gen AI) to enhance existing or new applications beyond text—implementing RAG, image classification, video analysis, and advanced image em...

SmartRAG

15
Stars
4
Forks
15
Watchers

⚡ Production-ready .NET Standard 2.1 RAG library with 🤖 multi-AI provider support, 🏢 enterprise vector storage, 📄 intelligent document processing, and 🗄️ multi-database query coordination. 🌍 Cros...