EVERYTHING AI FREE DOWNLOADS - By DICE
SCRIPT VERSION 1.5
Face Fusion 2.6.0
Next generation face swapper and enhancer
https://github.com/facefusion/facefusion-pinokio
SCRIPT VERSION 1.5
Hallo
[NVIDIA Only] Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
https://github.com/fudan-generative-vision/hallo
SCRIPT VERSION 1.5
Flash Diffusion
Accelerating any conditional diffusion model for few steps image generation
https://gojasper.github.io/flash-diffusion-project/
SCRIPT VERSION 1.5
Chat-With-Mlx
[Mac Onlyl] An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework.
https://github.com/qnguyen3/chat-with-mlx
PCM
Phased Consistency Model - generate high quality images with 2 steps
https://huggingface.co/spaces/radames/Phased-Consistency-Model-PCM
SCRIPT VERSION 1.5
Stable Audio
An Open Source Model for Audio Samples and Sound Design
https://github.com/Stability-AI/stable-audio-tools
SCRIPT VERSION 1.5
SillyTavern
a local-install interface that allows you to interact with text generation AIs (LLMs) to chat and roleplay with custom characters.
SCRIPT VERSION 1.5
AITown
Build and customize your own version of AI town - a virtual town where AI characters live, chat and socialize
https://github.com/a16z-infra/ai-town
Augmentoolkit
Turn any raw text into a high-quality dataset for AI finetuning
https://github.com/e-p-armstrong/augmentoolkit
LoRA the Explorer
Stable Diffusion LoRA Playground HuggingFace:
https://huggingface.co/spaces/multimodalart/LoraTheExplorer
lavie
Text-to-Video (T2V) generation framework from Vchitect
https://github.com/Vchitect/LaVie
SCRIPT VERSION 1.3
Dust3r
Geometric 3D Vision Made Easy
https://dust3r.europe.naverlabs.com/
SCRIPT VERSION 1.5
LlamaFactory
Unify Efficient Fine-Tuning of 100+ LLMs
https://github.com/hiyouga/LLaMA-Factory
SCRIPT VERSION 1.5
Invoke
The Gen AI Platform for Pro Studios
https://github.com/invoke-ai/InvokeAI
SCRIPT VERSION 1.5
Openui
Describe UI and see it rendered live. Ask for changes and convert HTML to React, Svelte, Web Components, etc. Like vercel v0, but open source
https://github.com/wandb/openui
XTTS
clone voices into different languages by using just a quick 3-second audio clip. (a local version of
https://huggingface.co/spaces/coqui/xtts
RVC
1 Click Installer for Retrieval-based-Voice-Conversion-WebUI
https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI
LCM
Fast Image generator using Latent consistency models
https://replicate.com/blog/run-latent-consistency-model-on-mac
SCRIPT VERSION 1.3
Whisper-WebUI
A Web UI for easy subtitle using whisper model
https://github.com/jhj0517/Whisper-WebUI
Realtime BakLLaVA
llama.cpp with BakLLaVA model describes what does it see
https://github.com/Fuzzy-Search/realtime-bakllava
Realtime StableDiffusion
Demo showcasing ~real-time Latent Consistency Model pipeline with Diffusers and a MJPEG stream server
https://github.com/radames/Real-Time-Latent-Consistency-Model
SCRIPT VERSION 1
StreamDiffusion
[NVIDIA ONLY] A Pipeline-Level Solution for Real-Time Interactive Generation
https://github.com/cumulo-autumn/StreamDiffusion
SCRIPT VERSION 1
Moore-AnimateAnyone
[NVIDIA GPU ONLY] Unofficial Implementation of Animate Anyone
https://github.com/MooreThreads/Moore-AnimateAnyone
SCRIPT VERSION 1
Moore-AnimateAnyone-Mini
[NVIDIA ONLY] Efficient Implementation of Animate Anyone (13G VRAM + 2G model size)
https://github.com/sdbds/Moore-AnimateAnyone-for-windows
SCRIPT VERSION 1
PhotoMaker
Customizing Realistic Human Photos via Stacked ID Embedding
https://github.com/TencentARC/PhotoMaker
SCRIPT VERSION 1.1
BRIA RMBG
Background removal model developed by BRIA.AI, trained on a carefully selected dataset and is available as an open-source model for non-commercial use
https://huggingface.co/spaces/briaai/BRIA-RMBG-1.4
SCRIPT VERSION 1.2
Gligen
An intuitive GUI for GLIGEN that uses ComfyUI in the backend
https://github.com/mut-ex/gligen-gui
SCRIPT VERSION 1.2
MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean
https://github.com/myshell-ai/MeloTTS
Chatbot-Ollama
open source chat UI for Ollama
https://github.com/ivanfioravanti/chatbot-ollama
SCRIPT VERSION 1.2
Differential-diffusion-ui
Differential Diffusion modifies an image according to a text prompt, and according to a map that specifies the amount of change in each region
https://differential-diffusion.github.io/
SCRIPT VERSION 1.2
Supir
[NVIDIA ONLY] Text-driven, intelligent restoration, blending AI technology with creativity to give every image a brand new life
SCRIPT VERSION 1.5
ZeST
ZeST: Zero-Shot Material Transfer from a Single Image. Local port of
https://huggingface.co/spaces/fffiloni/ZeST (Project: https://ttchengab.github.io/zest/)
SCRIPT VERSION 1.5
StoryDiffusion Comics
create a story by generating consistent images
https://github.com/HVision-NKU/StoryDiffusion
SCRIPT VERSION 1.2
Lobe Chat
An open-source, modern-design ChatGPT/LLMs UI/Framework. Supports speech-synthesis, multi-modal, and extensible (function call) plugin system.
https://github.com/lobehub/lobe-chat
SCRIPT VERSION 1.5
Parler-tts
a lightweight text-to-speech (TTS) model that can generate high-quality speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation).
https://huggingface.co/spaces/parler-tts/parler_tts_mini
SCRIPT VERSION 1.5
Instantstyle
Upload the picture of an image, and generate images with that image style. Instant generation with no LoRA required
https://huggingface.co/spaces/InstantX/InstantStyle
SCRIPT VERSION 1.5
Openvoice2
Openvoice 2 Web UI - A local web UI for Openvoice2, a multilingual voice cloning TTS
https://x.com/myshell_ai/status/1783161876052066793
SCRIPT VERSION 1.5
IDM-VTON
Improving Diffusion Models for Authentic Virtual Try-on in the Wild
https://huggingface.co/spaces/yisol/IDM-VTON
SCRIPT VERSION 1.5
Devika
Agentic AI Software Engineer
https://github.com/stitionai/devika
SCRIPT VERSION 1.2
Open WebUI
User-friendly WebUI for LLMs, supported LLM runners include Ollama and OpenAI-compatible APIs
https://github.com/open-webui/open-webui
SCRIPT VERSION 1.5
CosXL
Edit images with just prompt, an unofficial demo for CosXL and CosXL Edit from Stability AI,
https://huggingface.co/spaces/multimodalart/cosxl
SCRIPT VERSION 1.5
Face-to-all
Diffusers InstantID + ControlNet inspired by face-to-many from fofr (https://x.com/fofrAI) - a localized Version of
https://huggingface.co/spaces/multimodalart/face-to-all
SCRIPT VERSION 1.5
CustomNet
A unified encoder-based framework for object customization in text-to-image diffusion models
https://huggingface.co/spaces/TencentARC/CustomNet
SCRIPT VERSION 1.5
Brushnet
A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion
https://huggingface.co/spaces/TencentARC/BrushNet
SCRIPT VERSION 1.5
Arc2Face
A Foundation Model of Human Faces
https://huggingface.co/spaces/FoivosPar/Arc2Face
SCRIPT VERSION 1.2
TripoSR
a state-of-the-art open-source model for fast feedforward 3D reconstruction from a single image, developed in collaboration between Tripo AI and Stability AI.
https://huggingface.co/spaces/stabilityai/TripoSR
SCRIPT VERSION 1.2
ZETA
Zero-Shot Text-Based Audio Editing Using DDPM Inversion
https://huggingface.co/spaces/hilamanor/audioEditing
SCRIPT VERSION 1.2
Remove-video-bg
Video background removal tool
https://huggingface.co/spaces/amirgame197/Remove-Video-Background
SCRIPT VERSION 1.1
[NVIDIA GPU ONLY] LGM
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation
https://huggingface.co/spaces/ashawkey/LGM
SCRIPT VERSION 1
vid2pose
Video to Openpose & DWPose (All OS supported)
https://github.com/sdbds/vid2pose
SCRIPT VERSION 1
IP-Adapter-FaceID
Enter a face image and transform it to any other image. Demo for the h94/IP-Adapter-FaceID model
https://huggingface.co/spaces/multimodalart/Ip-Adapter-FaceID
SCRIPT VERSION 1
Dreamtalk
When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
https://github.com/ali-vilab/dreamtalk
SCRIPT VERSION 1
Video2Openpose
Turn any video into Openpose video
https://huggingface.co/spaces/fffiloni/video2openpose2
MagicAnimate Mini
[NVIDIA GPU Only] An optimized version of MagicAnimate
https://github.com/sdbds/magic-animate-for-windows
MagicAnimate
[NVIDIA GPU Only] Temporally Consistent Human Image Animation using Diffusion Model
https://showlab.github.io/magicanimate/
AudioSep
Separate Anything You Describe
https://huggingface.co/spaces/Audio-AGI/AudioSep
Tokenflow
Temporally consistent video editing. A local version of
https://huggingface.co/spaces/weizmannscience/tokenflow
ModelScope Image2Video (Nvidia GPU only)
Turn any image into a video! (Web UI created by fffiloni:
https://huggingface.co/spaces/fffiloni/MS-Image2Video)
Text Generation WebUI
A Gradio web UI for Large Language Models
https://github.com/oobabooga/text-generation-webui
SCRIPT VERSION 1
MAGNeT
MAGNeT is a text-to-music and text-to-sound model capable of generating high-quality audio samples conditioned on text descriptions
https://github.com/facebookresearch/audiocraft/blob/main/docs/MAGNET.md
SCRIPT VERSION 1
VideoCrafter 2
[Runs fast on NVIDIA GPUs. Works on M1/M2/M3 Macs but slow] VideoCrafter is an open-source video generation and editing toolbox for crafting video content. It currently includes the Text2Video and Image2Video models
https://github.com/AILab-CVC/VideoCrafter
SCRIPT VERSION 1.1
Bark Voice Cloning
Upload a clean 20 seconds WAV file of the vocal persona you want to mimic, type your text-to-speech prompt and hit submit! A local version of