Deep-Learning-Applications-with-GUI
Deep-Learning-Applications-with-GUI copied to clipboard
List of FOSS Deep Learning Applications with GUI and/or CLI
Deep-Learning-Applications-with-GUI
List of FOSS Deep Learning (Desktop) Applications with GUI and/or CLI. Most Deep Learning models require a complicated setup and are not ready to use out of the box. At best there is a Colab or Jupiter notebook available, which is good for testing, but not for production use.
The software in this list is ready to use (CLI or GUI).
:camera_flash: NeRF ๐ Text ๐จ Image :notes: Audio ๐๏ธ Video frame interpolation ๐ผ๏ธ๐๏ธ Video upscaler ๐ผ๏ธ Image/Video upscaling ๐งฉ Ready to use Plugins
:camera_flash: 3D / NeRF: Neural Radiance Fields
https://github.com/NVlabs/instant-ngp#interactive-training-and-rendering
๐ Text
Jan
Jan is an open-source ChatGPT alternative that runs 100% offline on your computer.
https://github.com/janhq/jan
text-generation-webui
A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.
Its goal is to become the AUTOMATIC1111/stable-diffusion-webui of text generation.
https://github.com/oobabooga/text-generation-webui
chatai
https://github.com/Capsize-Games/chatai
Audio to text
https://github.com/Const-me/Whisper
https://github.com/kaixxx/noScribe
https://github.com/CheshireCC/fatser-whisper-GUI (highly recommended)
https://github.com/Purfview/whisper-standalone-win (ready to use .exe for cli use)
https://github.com/FL33TW00D/whisper-turbo (wip)
subtitle
https://github.com/URUWorks/TeroSubtitler (whisper support)
video-subtitle-generator
https://github.com/YaoFANGUK/video-subtitle-generator
Translation
https://github.com/argosopentech/argos-translate#installation https://github.com/ymoslem/DesktopTranslator
Chat-Style LLM (Alpaca/Llama) (Chat GTP Clone)
https://github.com/antimatter15/alpaca.cpp https://github.com/tatsu-lab/stanford_alpaca https://github.com/tloen/alpaca-lora
dalai
Run LLaMA and Alpaca on your computer.
https://github.com/cocktailpeanut/dalai
GTP4All
Installs a native chat-client with auto-update functionality that runs on your desktop with the GPT4All-J model baked into it.
https://github.com/nomic-ai/gpt4all
Gpt4All Web UI
This is a Flask web application that provides a chat UI for interacting with llamacpp based chatbots such as GPT4all, vicuna etc...
https://github.com/nomic-ai/gpt4all-ui
AutoGPT GUI
A graphical user interface to AutoGPT
https://github.com/thecookingsenpai/autogpt-gui
๐จ Image
Fooocus
Fooocus is an image generating software.
https://github.com/lllyasviel/Fooocus
Style2Paints
Style2paints V4 is the current best AI driven lineart colorization tool.
https://github.com/lllyasviel/style2paints
Source code not available for the latest release.
JavaScript, Python, Tensorflow
StableStudio
StableStudio, the open-source version of DreamStudio!
https://github.com/Stability-AI/StableStudio
NMKD Stable Diffusion GUI - AI Image Generator
https://github.com/n00mkrad/text2image-gui
https://nmkd.itch.io/t2i-gui
stable-diffusion-webui-forge
https://github.com/lllyasviel/stable-diffusion-webui-forge
stable-diffusion-ui
Easiest 1-click way to install and use Stable Diffusion on your own computer. Provides a browser UI for generating images from text prompts and images. Just enter your text prompt, and see the generated image
https://github.com/cmdr2/stable-diffusion-ui
MochiDiffusion
This app uses Apple's Core ML Stable Diffusion implementation to achieve maximum performance and speed on Apple Silicon based Macs while reducing memory requirements.
https://github.com/godly-devotion/MochiDiffusion
ComfyUI
This ui will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart based interface. For some workflow examples and see what ComfyUI can do you can check out:
https://github.com/comfyanonymous/ComfyUI
Stable Diffusion on AMD GPU
https://github.com/azuritecoin/OnnxDiffusersUI (deprecated) https://github.com/lshqqytiger/stable-diffusion-webui-directml
Diffusion Bee - Stable Diffusion GUI App for MacOS
https://github.com/divamgupta/diffusionbee-stable-diffusion-ui
WebUI
ready to use colab based notebooks with web ui
https://github.com/camenduru/stable-diffusion-webui-colab
InvokeAI
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.
https://github.com/invoke-ai/InvokeAI
airunner
https://github.com/Capsize-Games/airunner $
Lama Cleaner
A free and open-source inpainting tool powered by SOTA AI model.
https://github.com/Sanster/lama-cleaner
https://panicbyte.itch.io/lama-cleaner
carefree-creator
AI magics meet Infinite draw board.
https://github.com/carefree0910/carefree-creator
photoshot
An open-source AI avatar generator web app
https://github.com/shinework/photoshot
DeOldify.NET
DeOldify.NET: cross-platform application for coloring black and white photos"
https://github.com/ColorfulSoft/DeOldify.NET
Queryable
Offline photo search App for iOS
https://github.com/mazzzystar/Queryable
X-AnyLabeling
Effortless data labeling with AI support from Segment Anything and other awesome models.
https://github.com/CVHub520/X-AnyLabeling
:notes: Audio
clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio
https://github.com/jianchang512/clone-voice
tacotron2-tts-GUI
GUI wrapper for synthesize. Allows CPU-only synthesis via a toggleable switch. Portable exe file is available (that runs on CPU only).
https://github.com/lokkelvin2/tacotron2-tts-GUI
riffusion
Riffusion is an app for real-time music generation with stable diffusion.
https://github.com/riffusion/riffusion-app
whisper-ui
Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification.
This is a simple Streamlit UI for OpenAI's Whisper speech-to-text model. It let's you download and transcribe media from YouTube videos, playlists, or local files. You can then browse, filter, and search through your saved audio files.
https://github.com/hayabhay/whisper-ui for https://github.com/openai/whisper
https://grisk.itch.io/whisper-gui
buzz
GUI for whisper
https://github.com/chidiwilliams/buzz
Ultimate Vocal Remover GUI v5.5.1
This application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 and v4 4-stem models).
https://github.com/Anjok07/ultimatevocalremovergui
SpleeterGUI
SpleeterGUI - Music source separation desktop app
Windows Desktop Front end for Spleeter - AI source separation
https://github.com/boy1dr/SpleeterGui
SoftVC VITS Singing Voice Conversion Fork
https://github.com/34j/so-vits-svc-fork
subsync subtitle alignment
This is an automatic movie subtitle synchronization tool. Synchronization is done by listening to the audio track, translating it if necessary.
https://github.com/sc0ty/subsync/
๐๏ธ Video frame interpolation
Video Dubbing
https://github.com/jianchang512/pyvideotrans
DAIN-App
The backbone of this project is a algorithm called DAIN. (https://github.com/baowenbo/DAIN)
https://github.com/BurguerJohn/Dain-App
https://grisk.itch.io/dain-app?download
DAIN Vulkan GUI
AI-Powered video interpolater (eg. 30fps -> 60fps) for Vulkan devices. Based on dain-ncnn-vulkan and ffmpeg
https://github.com/Mar2ck/DAIN-Vulkan-GUI
dain-ncnn-vulkan
ncnn implementation of DAIN, Depth-Aware Video Frame Interpolation.
https://github.com/nihui/dain-ncnn-vulkan
Dain-App
https://github.com/BurguerJohn/Dain-App
cain-ncnn-vulkan
Video Frame Interpolation
https://github.com/nihui/cain-ncnn-vulkan
Flowframes
Flowframes is a simple but powerful app that utilizes AI frameworks to interpolate videos in order to increase their framerate with little to no noticable quality loss. (DAIN, CAIN, RIFE)
https://github.com/n00mkrad/flowframes
https://nmkd.itch.io/flowframes
๐ผ๏ธ๐๏ธ Video upscaler
video2x
A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K, SRMD and RealSR. Started in Hack the Valley 2, 2018. https://github.com/k4yt3x/video2x
https://github.com/Communist02/AnimePlayer
๐๏ธ Video Deep Face
https://github.com/iperov/DeepFaceLive
๐ผ๏ธ Image/Video upscaling
Final2x
This is a powerful tool that allows for image super-resolution to arbitrary sizes using multiple models, designed to enhance the resolution and quality of images, making them clearer and more detailed. Presently, it supports several models, such as RealCUGAN, RealESRGAN, Waifu2x, and SRMD.
https://github.com/Tohrusky/Final2x
Upscayl
Upscayl is a cross-platform application built with the Linux-first philosophy. This means that we prioritize Linux builds over others but that doesn't mean we'll break things for other OSes :) Makes use of Real-ESRGAN
https://github.com/upscayl/upscayl
Anime4kSharp
https://github.com/shadow578/Anime4kSharp
dandere2x-tremx
video compression https://github.com/Tremeschin/dandere2x-tremx
Cupscale
Image Upscaling GUI based on ESRGAN - WORK IN PROGRESS
https://github.com/n00mkrad/cupscale
IEU.Winforms
Windows only GUI for ESRGAN with additional features https://github.com/ptrsuder/IEU.Winforms
NVIDIA-Ansel-AI-Enhancer
NVIDIA Upscaler, utilizes NVIDIAยดs Ansel RTX. Requires latest drivers and a RTX GPU. Free, but not open source.
https://github.com/dynamiquel/NVIDIA-Ansel-AI-Enhancer
RealSR
Real-World Super-Resolution via Kernel Estimation and Noise Injection
https://github.com/jixiaozhong/RealSR
(https://github.com/nihui/RealSR)
realsr-ncnn-vulkan
https://github.com/nihui/realsr-ncnn-vulkan
Shell upscaler
https://github.com/n00mkrad/shell-upscaler
Acts as a GUI for esrgan-launcher.
srmd-ncnn-vulkan
https://github.com/nihui/srmd-ncnn-vulkan
VapourSynth-Super-Resolution-Helper
https://github.com/AlphaAtlas/VapourSynth-Super-Resolution-Helper
Waifu2x-Extension-GUI
Photo/Video/GIF enlargement using machine learning
https://github.com/AaronFeng753/Waifu2x-Extension-GUI
C++, python, qt, caffe, ncnn_vulkan
AGPL-3.0 License
waifu2x-ncnn-vulkan
https://github.com/nihui/waifu2x-ncnn-vulkan
GAN / Style Transfer
TorchRayLib
TorchRayLib++: A CMake based AI & generative art platform, integrating the rayib GUI and the PyTorch C++ Deep Learning Library.
https://github.com/QuantScientist/TorchRayLib
more
https://github.com/Seprentes/ScaleUp https://github.com/HolyWu/vs-realesrgan https://github.com/tumuyan/RealSR-NCNN-Android https://github.com/TransparentLC/realesrgan-gui/blob/master/README.en-US.md https://github.com/Lucchetto/SuperImage https://github.com/Tohrusky/Final2x
:toolbox: DL Training GUI
AIDeveloper
GUI-based software for training, evaluating and applying deep neural nets for image classification
https://github.com/maikherbig/AIDeveloper
๐งฉ Ready to use Plugins
vstSpleeter
audio source separation library vst plugin
https://github.com/gvne/vstSpleeter
various apps (untested)
Paper2GUI is an AI desktop Application toolbox for everyone no matter your background with it being free and ready to use out of the box, already supporting 10+ AI models, covering the fields of speech synthesis, video frame filling, video overscoring, target detection, image stylization, OCR recognition, etc. Supporting Windows, Mac OS, Linux systems.
https://github.com/Baiyuetribe/paper2gui/blob/main/README_en.md
Free: https://www.nvidia.com/en-us/studio/canvas/