clone voices into different languages by using just a quick 3-second audio clip. (a local version of https://huggingface.co/spaces/coqui/xtts)

Fully local DeepSite with backend using HuggingFace model locally.

[MAC ONLY] A powerful and user-friendly web interface for FLUX, powered by MLX and Gradio via MFLUX

Text-to-Speech using IndicF5 for Indian languages
Sana is a text-to-image framework that can efficiently generate images
Real Time Speech Transcription
Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait
deep hermes, but without the need for a system prompt. Autonomously responds based on its OWN judgment https://github.com/cocktailpeanut/deeperhermes
Install AnimateDiff Automatic1111 Extension and the models with one click
Style Aligned Image Generation via Shared Attention https://style-aligned-gen.github.io/
Turn any video into Openpose video https://huggingface.co/spaces/fffiloni/video2openpose2
Florence-2 Image Captioning
BEN2 for background removal
Image Upscale is an AI-powered application designed to enhance and upscale images using advanced techniques like Stable Diffusion and Tile ControlNet. It provides high-quality image enhancement with options for HDR effects and customizable settings.
[NVIDIA ONLY] Temporally Consistent Human Image Animation using Diffusion Model https://showlab.github.io/magicanimate/
The best vocal remover application on the internet, and it's totally free and open source!