Orpheus TTS is an open-source text-to-speech system built on the Llama-3b backbone. Orpheus demonstrates the emergent capabilities of using LLMs for speech synthesis https://github.com/canopyai/Orpheus-TTS
MatAnyone AI is a tool for editing videos by separating objects from their backgrounds. It is an AI to remove the background from videos effectively. Stable Video Matting with Consistent Memory Propagation: https://github.com/pq-yang/MatAnyone.git
Generate synchronized audio from video and/or text inputs https://github.com/hkchengrex/MMAudio
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching https://huggingface.co/spaces/mrfakename/E2-F5-TTS
# SillyTavern Character Generator A pinokio script for https://github.com/Tremontaine/character-card-generator When used with KoboldCPP use http://localhost:5001/v1 Where 5001 is the port reported by KoboldCPP when starting Text API Key needs to be filled with anything. (If left empty will give a error so just add anything to it)
High quality LipSync Application with a simple UI
Image colorization
1 Click Installer for Retrieval-based-Voice-Conversion-WebUI (https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)
Generate music in different genres using text and audio prompts.
Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech application
Automatically remove watermarks from videos generated by Sora AI.
A FastAPI wrapper for KokoroTTS. Integrates with Open-WebUI and other API-driven AI applications.
Build your own voice for StyleTTS2
Generate realistic and expressive speech with natural language voice design.
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics,
A simple, high-quality voice conversion tool focused on ease of use and performance.
A mass video player for easy browsing of large video datasets
