Wonder 3D: Single Image to 3D using Cross-Domain Diffusion |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
image-to-3D stable-diffusion |
Latent Consistency Models: Synthesizing High-Resolution Images with Few-step Inference |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-to-image image-to-image video-to-video stable-diffusion |
SadTalker: A realistic and stylized talking head video generation method from a single image and audio |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
image-to-video video-generation |
WizardCoder-Python-34B on Colab's free tier - GGUF |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-generation GGUF GGML WizardCoder |
SeamlessM4T: Multimodal model for speech translation |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
translation speech-to-speech speech-to-text text-to-speech |
Alternatives to Stable Diffusion "WebUI" on Google Colab's Free Tier |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-to-image stable-diffusion |
SDXL 1.0 with T2I Adapter, ControlNet, Inpainting and train LoRA |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-to-image stable-diffusion LoRA inpainting |
llama-cpp-python with Llama 2 13B-chat |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-generation GGML Llama 2 |
AutoGPTQ with WizardCoder 15B |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-generation GPTQ WizardCoder |
SDXL 0.9 |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-to-image stable-diffusion |
Massively Multilingual Speech (MMS) |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
speech-to-text text-to-speech spoken-language-identification |
Segmentation Demos, Metaseg, SegGPT, Prismer |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
image-segmentation video-segmentation |
ControlNet |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-to-image inpainting outpainting stable-diffusion |
Track Anything Colab |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
video-segmentation video-inpainting stable-diffusion |
Subtitles transcription and translation |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
speech-to-text translation |
LoRA Stable Diffusion with diffusers |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
LoRA stable-diffusion text-to-image |
DreamBooth Stable Diffusion |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
DreamBooth text-to-image stable-diffusion |
Real ESRGAN and GFPGAN |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
image-restoration super-resolution |
Latent Upscaler |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
image-restoration super-resolution |
CodeFormer: Video Image Restoration |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
image-restoration video-restoration |
VToonify |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
style-transfer |
Inverse Cross Attention SD |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-to-image stable-diffusion inpainting |
CrossAttention SD Koiboi |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-to-image stable-diffusion |
MAXIM |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
image_denoising low_light_enhancement image_retouching dehazing_indoors dehazing_outdoors image_deraining image_deblurring |
N2V for Image Denoising of Single-Channel Images |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
image_denoising |
OCR with Pytesseract and OpenCV |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
OCR |
Donut |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
OCR document-parsing |
InstructPix2Pix with Diffusers |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-to-image stable-diffusion |
Deforum |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
stable-diffusion text-to-video interpolation |
In-painting, Image-to-Image, Depth-to-Image, SD Diffusers |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-to-image stable-diffusion |
Stable Diffusion Custom Model |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-to-image stable-diffusion |
TPU Stable Diffusion Fast Jax |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-to-image stable-diffusion |
AltDiffusion M9 Multilingual CLIP |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-to-image stable-diffusion |
Fine-tuned VAE Decoder Compare Stable Diffusion |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-to-image stable-diffusion |
Petals: Guanaco-65B, LLaMA-65B, BLOOM and BLOOMZ in Colab |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-generation |
Infinity Colab |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
inpainting outpainting text-to-image stable-diffusion |
Music Video Killed The Radio Star, Defusion |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-to-video stable-diffusion |
Interpreting Stable Diffusion Using Cross Attention, DAAM, Diffusers |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-to-image stable-diffusion |
Deepfloyd IF Free Tier Google Colab |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-to-image stable-diffusion |
Segment Anything |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
image-segmentation |
Microsoft Bringing Old Photo Back to Life |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
image-restoration |
UniColoror: A Unified Framework for Multi-Modal Colorization with Transformer |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
image-restoration |
DISCO: Disentangled Image Colorization via Global Anchors |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
image-restoration |
DeOldify Colab |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
image-restoration |
DeOldify VideoColorizer Colab |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
video-restoration |
Deep Exemplar-based Video Colorization |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
video-restoration |
BLOOM BigScience with bitsandbytes int8 |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-generation |
OPT Open Pre-trained Transformers |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-generation |
LLaMA 4-bit Quantization |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-generation GPTQ |
ChatPDF h2ogpt |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-generation |
Diarization Whisper and Pyannote Transcripts with Speaker Names |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
speech-to-text diarization |
NER Biomedical with Stanza and Summarization |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-generation |
Wav2Vec2 Transcription |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
speech-to-text |
Fix Grammar, Punctuation Correction, Language Detection |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
text-generation |
FILM: Video Generation |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
interpolation |
PyTTI Tools and FiLM |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
interpolation |
SAHI and Detectron2 |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
object-detection |
SAHI Inference for YOLOv5 |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
object-detection |
DE-DETR, Panoptic, and Others |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
object-detection image-segmentation |
Detectron2 Tutorial |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
object-detection segmentation |
Detic_clip and Detectron2 |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
object-detection segmentation |
U-2-Net Demonstration Colab 2 |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
background-removal |
U-2-Net Step by Step Demonstration |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
background-removal |
DIS Ichotomous Image Segmentation |
![Open](https://github.com/R3gm/InsightSolver-Colab/raw/main/assets/colab.svg) |
background-removal |