EVERYTHING AI FREE DOWNLOADS

Background removal model developed by BRIA.AI, trained on a carefully selected dataset and is available as an open-source model for non-commercial use

https://huggingface.co/spaces/briaai/BRIA-RMBG-1.4

SCRIPT VERSION 1.2

Gligen

An intuitive GUI for GLIGEN that uses ComfyUI in the backend

https://github.com/mut-ex/gligen-gui

SCRIPT VERSION 1.2

MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean

https://github.com/myshell-ai/MeloTTS

Chatbot-Ollama

open source chat UI for Ollama

https://github.com/ivanfioravanti/chatbot-ollama

SCRIPT VERSION 1.2

Differential-diffusion-ui

Differential Diffusion modifies an image according to a text prompt, and according to a map that specifies the amount of change in each region

https://differential-diffusion.github.io/

SCRIPT VERSION 1.2

Supir

[NVIDIA ONLY] Text-driven, intelligent restoration, blending AI technology with creativity to give every image a brand new life

https://supir.xpixel.group

SCRIPT VERSION 1.5

ZeST

ZeST: Zero-Shot Material Transfer from a Single Image. Local port of

https://huggingface.co/spaces/fffiloni/ZeST (Project: https://ttchengab.github.io/zest/)

SCRIPT VERSION 1.5

StoryDiffusion Comics

create a story by generating consistent images

https://github.com/HVision-NKU/StoryDiffusion

SCRIPT VERSION 1.2

Lobe Chat

An open-source, modern-design ChatGPT/LLMs UI/Framework. Supports speech-synthesis, multi-modal, and extensible (function call) plugin system.

https://github.com/lobehub/lobe-chat

SCRIPT VERSION 1.5

Parler-tts

a lightweight text-to-speech (TTS) model that can generate high-quality speech with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation).

https://huggingface.co/spaces/parler-tts/parler_tts_mini

SCRIPT VERSION 1.5

Instantstyle

Upload the picture of an image, and generate images with that image style. Instant generation with no LoRA required

https://huggingface.co/spaces/InstantX/InstantStyle

SCRIPT VERSION 1.5

Openvoice2

Openvoice 2 Web UI - A local web UI for Openvoice2, a multilingual voice cloning TTS

https://x.com/myshell_ai/status/1783161876052066793

SCRIPT VERSION 1.5

IDM-VTON

Improving Diffusion Models for Authentic Virtual Try-on in the Wild

https://huggingface.co/spaces/yisol/IDM-VTON

SCRIPT VERSION 1.5

Devika

Agentic AI Software Engineer

https://github.com/stitionai/devika

SCRIPT VERSION 1.2

Open WebUI

User-friendly WebUI for LLMs, supported LLM runners include Ollama and OpenAI-compatible APIs

https://github.com/open-webui/open-webui

SCRIPT VERSION 1.5

CosXL

Edit images with just prompt, an unofficial demo for CosXL and CosXL Edit from Stability AI,

https://huggingface.co/spaces/multimodalart/cosxl

SCRIPT VERSION 1.5

Face-to-all

Diffusers InstantID + ControlNet inspired by face-to-many from fofr (https://x.com/fofrAI) - a localized Version of

https://huggingface.co/spaces/multimodalart/face-to-all

SCRIPT VERSION 1.5

CustomNet

A unified encoder-based framework for object customization in text-to-image diffusion models

https://huggingface.co/spaces/TencentARC/CustomNet

SCRIPT VERSION 1.5

Brushnet

A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion

https://huggingface.co/spaces/TencentARC/BrushNet

SCRIPT VERSION 1.5

Arc2Face

A Foundation Model of Human Faces

https://huggingface.co/spaces/FoivosPar/Arc2Face

SCRIPT VERSION 1.2

TripoSR

a state-of-the-art open-source model for fast feedforward 3D reconstruction from a single image, developed in collaboration between Tripo AI and Stability AI.

https://huggingface.co/spaces/stabilityai/TripoSR

SCRIPT VERSION 1.2

ZETA

Zero-Shot Text-Based Audio Editing Using DDPM Inversion

https://huggingface.co/spaces/hilamanor/audioEditing

SCRIPT VERSION 1.2

Remove-video-bg

Video background removal tool

https://huggingface.co/spaces/amirgame197/Remove-Video-Background

SCRIPT VERSION 1.1

[NVIDIA GPU ONLY] LGM

LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation

https://huggingface.co/spaces/ashawkey/LGM

SCRIPT VERSION 1

vid2pose

Video to Openpose & DWPose (All OS supported)

https://github.com/sdbds/vid2pose

SCRIPT VERSION 1

IP-Adapter-FaceID

Enter a face image and transform it to any other image. Demo for the h94/IP-Adapter-FaceID model

https://huggingface.co/spaces/multimodalart/Ip-Adapter-FaceID

SCRIPT VERSION 1

Dreamtalk

When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

https://github.com/ali-vilab/dreamtalk

SCRIPT VERSION 1

Video2Openpose

Turn any video into Openpose video

https://huggingface.co/spaces/fffiloni/video2openpose2

MagicAnimate Mini

[NVIDIA GPU Only] An optimized version of MagicAnimate

https://github.com/sdbds/magic-animate-for-windows

MagicAnimate

[NVIDIA GPU Only] Temporally Consistent Human Image Animation using Diffusion Model

https://showlab.github.io/magicanimate/

AudioSep

Separate Anything You Describe

https://huggingface.co/spaces/Audio-AGI/AudioSep

Tokenflow

Temporally consistent video editing. A local version of

https://huggingface.co/spaces/weizmannscience/tokenflow

ModelScope Image2Video (Nvidia GPU only)

Turn any image into a video! (Web UI created by fffiloni:

https://huggingface.co/spaces/fffiloni/MS-Image2Video)

Text Generation WebUI

A Gradio web UI for Large Language Models

https://github.com/oobabooga/text-generation-webui

SCRIPT VERSION 1

MAGNeT

MAGNeT is a text-to-music and text-to-sound model capable of generating high-quality audio samples conditioned on text descriptions

https://github.com/facebookresearch/audiocraft/blob/main/docs/MAGNET.md

SCRIPT VERSION 1

VideoCrafter 2

[Runs fast on NVIDIA GPUs. Works on M1/M2/M3 Macs but slow] VideoCrafter is an open-source video generation and editing toolbox for crafting video content. It currently includes the Text2Video and Image2Video models

https://github.com/AILab-CVC/VideoCrafter

SCRIPT VERSION 1.1

Bark Voice Cloning

Upload a clean 20 seconds WAV file of the vocal persona you want to mimic, type your text-to-speech prompt and hit submit! A local version of

EVERYTHING AI FREE DOWNLOADS

https://huggingface.co/spaces/fffiloni/instant-TTS-Bark-cloning

Comments