EVERYTHING AI FREE DOWNLOADS


Updated:

EVERYTHING AI FREE DOWNLOADS - By DICE

SCRIPT VERSION 1.5

Face Fusion 2.6.0

Next generation face swapper and enhancer

https://github.com/facefusion/facefusion-pinokio

SCRIPT VERSION 1.5

Hallo

[NVIDIA Only] Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

https://github.com/fudan-generative-vision/hallo

SCRIPT VERSION 1.5

Flash Diffusion

Accelerating any conditional diffusion model for few steps image generation

https://gojasper.github.io/flash-diffusion-project/

SCRIPT VERSION 1.5

Chat-With-Mlx

[Mac Onlyl] An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework.

https://github.com/qnguyen3/chat-with-mlx

PCM

Phased Consistency Model - generate high quality images with 2 steps

https://huggingface.co/spaces/radames/Phased-Consistency-Model-PCM

SCRIPT VERSION 1.5

Stable Audio

An Open Source Model for Audio Samples and Sound Design

https://github.com/Stability-AI/stable-audio-tools

SCRIPT VERSION 1.5

SillyTavern

a local-install interface that allows you to interact with text generation AIs (LLMs) to chat and roleplay with custom characters.

https://docs.sillytavern.app/

SCRIPT VERSION 1.5

AITown

Build and customize your own version of AI town - a virtual town where AI characters live, chat and socialize

https://github.com/a16z-infra/ai-town

Augmentoolkit

Turn any raw text into a high-quality dataset for AI finetuning

https://github.com/e-p-armstrong/augmentoolkit

LoRA the Explorer

Stable Diffusion LoRA Playground HuggingFace:

https://huggingface.co/spaces/multimodalart/LoraTheExplorer

lavie

Text-to-Video (T2V) generation framework from Vchitect

https://github.com/Vchitect/LaVie

SCRIPT VERSION 1.3

Dust3r

Geometric 3D Vision Made Easy

https://dust3r.europe.naverlabs.com/

SCRIPT VERSION 1.5

LlamaFactory

Unify Efficient Fine-Tuning of 100+ LLMs

https://github.com/hiyouga/LLaMA-Factory

SCRIPT VERSION 1.5

Invoke

The Gen AI Platform for Pro Studios

https://github.com/invoke-ai/InvokeAI

SCRIPT VERSION 1.5

Openui

Describe UI and see it rendered live. Ask for changes and convert HTML to React, Svelte, Web Components, etc. Like vercel v0, but open source

https://github.com/wandb/openui

XTTS

clone voices into different languages by using just a quick 3-second audio clip. (a local version of

https://huggingface.co/spaces/coqui/xtts

RVC

1 Click Installer for Retrieval-based-Voice-Conversion-WebUI

https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI

LCM

Fast Image generator using Latent consistency models

https://replicate.com/blog/run-latent-consistency-model-on-mac

SCRIPT VERSION 1.3

Whisper-WebUI

A Web UI for easy subtitle using whisper model

https://github.com/jhj0517/Whisper-WebUI

Realtime BakLLaVA

llama.cpp with BakLLaVA model describes what does it see

https://github.com/Fuzzy-Search/realtime-bakllava

Realtime StableDiffusion

Demo showcasing ~real-time Latent Consistency Model pipeline with Diffusers and a MJPEG stream server

https://github.com/radames/Real-Time-Latent-Consistency-Model

SCRIPT VERSION 1

StreamDiffusion

[NVIDIA ONLY] A Pipeline-Level Solution for Real-Time Interactive Generation

https://github.com/cumulo-autumn/StreamDiffusion

SCRIPT VERSION 1

Moore-AnimateAnyone

[NVIDIA GPU ONLY] Unofficial Implementation of Animate Anyone

https://github.com/MooreThreads/Moore-AnimateAnyone

SCRIPT VERSION 1

Moore-AnimateAnyone-Mini

[NVIDIA ONLY] Efficient Implementation of Animate Anyone (13G VRAM + 2G model size)

https://github.com/sdbds/Moore-AnimateAnyone-for-windows

SCRIPT VERSION 1

PhotoMaker

Customizing Realistic Human Photos via Stacked ID Embedding

https://github.com/TencentARC/PhotoMaker

SCRIPT VERSION 1.1

BRIA RMBG

Background removal model developed by BRIA.AI, trained on a carefully selected dataset and is available as an open-source model for non-commercial use

https://huggingface.co/spaces/briaai/BRIA-RMBG-1.4

SCRIPT VERSION 1.2

Gligen

An intuitive GUI for GLIGEN that uses ComfyUI in the backend

https://github.com/mut-ex/gligen-gui

SCRIPT VERSION 1.2

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean

https://github.com/myshell-ai/MeloTTS

Chatbot-Ollama

open source chat UI for Ollama

https://github.com/ivanfioravanti/chatbot-ollama

SCRIPT VERSION 1.2

Differential-diffusion-ui

Differential Diffusion modifies an image according to a text prompt, and according to a map that specifies the amount of change in each region

https://differential-diffusion.github.io/

SCRIPT VERSION 1.2

Supir

[NVIDIA ONLY] Text-driven, intelligent restoration, blending AI technology with creativity to give every image a brand new life

https://supir.xpixel.group

SCRIPT VERSION 1.5

ZeST

ZeST: Zero-Shot Material Transfer from a Single Image. Local port of

https://huggingface.co/spaces/fffiloni/ZeST (Project: https://ttchengab.github.io/zest/)

SCRIPT VERSION 1.5

StoryDiffusion Comics

create a story by generating consistent images

https://github.com/HVision-NKU/StoryDiffusion

SCRIPT VERSION 1.2

Lobe Chat

An open-source, modern-design ChatGPT/LLMs UI/Framework. Supports speech-synthesis, multi-modal, and extensible (function call) plugin system.

https://github.com/lobehub/lobe-chat

SCRIPT VERSION 1.5

Parler-tts

a lightweight text-to-speech (TTS) model that can generate high-quality speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation).

https://huggingface.co/spaces/parler-tts/parler_tts_mini

SCRIPT VERSION 1.5

Instantstyle

Upload the picture of an image, and generate images with that image style. Instant generation with no LoRA required

https://huggingface.co/spaces/InstantX/InstantStyle

SCRIPT VERSION 1.5

Openvoice2

Openvoice 2 Web UI - A local web UI for Openvoice2, a multilingual voice cloning TTS

https://x.com/myshell_ai/status/1783161876052066793

SCRIPT VERSION 1.5

IDM-VTON

Improving Diffusion Models for Authentic Virtual Try-on in the Wild

https://huggingface.co/spaces/yisol/IDM-VTON

SCRIPT VERSION 1.5

Devika

Agentic AI Software Engineer

https://github.com/stitionai/devika

SCRIPT VERSION 1.2

Open WebUI

User-friendly WebUI for LLMs, supported LLM runners include Ollama and OpenAI-compatible APIs

https://github.com/open-webui/open-webui

SCRIPT VERSION 1.5

CosXL

Edit images with just prompt, an unofficial demo for CosXL and CosXL Edit from Stability AI,

https://huggingface.co/spaces/multimodalart/cosxl

SCRIPT VERSION 1.5

Face-to-all

Diffusers InstantID + ControlNet inspired by face-to-many from fofr (https://x.com/fofrAI) - a localized Version of

https://huggingface.co/spaces/multimodalart/face-to-all

SCRIPT VERSION 1.5

CustomNet

A unified encoder-based framework for object customization in text-to-image diffusion models

https://huggingface.co/spaces/TencentARC/CustomNet

SCRIPT VERSION 1.5

Brushnet

A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion

https://huggingface.co/spaces/TencentARC/BrushNet

SCRIPT VERSION 1.5

Arc2Face

A Foundation Model of Human Faces

https://huggingface.co/spaces/FoivosPar/Arc2Face

SCRIPT VERSION 1.2

TripoSR

a state-of-the-art open-source model for fast feedforward 3D reconstruction from a single image, developed in collaboration between Tripo AI and Stability AI.

https://huggingface.co/spaces/stabilityai/TripoSR

SCRIPT VERSION 1.2

ZETA

Zero-Shot Text-Based Audio Editing Using DDPM Inversion

https://huggingface.co/spaces/hilamanor/audioEditing

SCRIPT VERSION 1.2

Remove-video-bg

Video background removal tool

https://huggingface.co/spaces/amirgame197/Remove-Video-Background

SCRIPT VERSION 1.1

[NVIDIA GPU ONLY] LGM

LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation

https://huggingface.co/spaces/ashawkey/LGM

SCRIPT VERSION 1

vid2pose

Video to Openpose & DWPose (All OS supported)

https://github.com/sdbds/vid2pose

SCRIPT VERSION 1

IP-Adapter-FaceID

Enter a face image and transform it to any other image. Demo for the h94/IP-Adapter-FaceID model

https://huggingface.co/spaces/multimodalart/Ip-Adapter-FaceID

SCRIPT VERSION 1

Dreamtalk

When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

https://github.com/ali-vilab/dreamtalk

SCRIPT VERSION 1

Video2Openpose

Turn any video into Openpose video

https://huggingface.co/spaces/fffiloni/video2openpose2

MagicAnimate Mini

[NVIDIA GPU Only] An optimized version of MagicAnimate

https://github.com/sdbds/magic-animate-for-windows

MagicAnimate

[NVIDIA GPU Only] Temporally Consistent Human Image Animation using Diffusion Model

https://showlab.github.io/magicanimate/

AudioSep

Separate Anything You Describe

https://huggingface.co/spaces/Audio-AGI/AudioSep

Tokenflow

Temporally consistent video editing. A local version of

https://huggingface.co/spaces/weizmannscience/tokenflow

ModelScope Image2Video (Nvidia GPU only)

Turn any image into a video! (Web UI created by fffiloni:

https://huggingface.co/spaces/fffiloni/MS-Image2Video)

Text Generation WebUI

A Gradio web UI for Large Language Models

https://github.com/oobabooga/text-generation-webui

SCRIPT VERSION 1

MAGNeT

MAGNeT is a text-to-music and text-to-sound model capable of generating high-quality audio samples conditioned on text descriptions

https://github.com/facebookresearch/audiocraft/blob/main/docs/MAGNET.md

SCRIPT VERSION 1

VideoCrafter 2

[Runs fast on NVIDIA GPUs. Works on M1/M2/M3 Macs but slow] VideoCrafter is an open-source video generation and editing toolbox for crafting video content. It currently includes the Text2Video and Image2Video models

https://github.com/AILab-CVC/VideoCrafter

SCRIPT VERSION 1.1

Bark Voice Cloning

Upload a clean 20 seconds WAV file of the vocal persona you want to mimic, type your text-to-speech prompt and hit submit! A local version of

https://huggingface.co/spaces/fffiloni/instant-TTS-Bark-cloning

6
0

Comments