Ovi is a veo-3 like, video+audio generation model that simultaneously generates both video and audio content from text or text+image inputs.