Use case

AI Voiceover API

POST text to the published webhook, get back a studio-quality voiceover. Integrate AI narration into any app, store, or CMS without a TTS SDK.

Who this is for

Product teams, app developers, e-commerce platforms

The problem

Adding voiceovers to an app means picking a TTS provider, writing SDK integration code, handling retries, and managing API keys across environments. For a feature that ships audio, the plumbing takes longer than the product work.

The flow

Build the voice pipeline on the canvas and publish it as a webhook. POST raw text to the endpoint. The Gemini Text node cleans the input for natural speech (expanding abbreviations, inserting pauses, normalizing formatting). The ElevenLabs Audio node synthesizes the voiceover. Respond to Webhook returns the audio file. Your app calls one URL instead of managing a TTS SDK.

The outcome

A voiceover API endpoint from a two-node flow. Any app that can POST JSON gets studio-quality narration back.

Nodes used

HTTP Trigger Text (Gemini)Audio (ElevenLabs)Respond to Webhook

More use cases

All use cases

Product Video AdsUpload a product photo, get back a video ad with generated copy and voiceover. One webhook call replaces your entire creative production queue.Narrated Explainer VideosGenerate a script, a hero scene visual, an ≤8-second clip, and full-length narration from one topic prompt on the canvas.Social Media Content PipelinePOST a creative brief via webhook, get back a generated hero image for the campaign. Multi-platform sizing is on the roadmap.

Generate your first video ad in 3 minutes.

Free to start. No credit card. Upload a product photo, connect your AI models, click Run.