Use case

Document Processing

POST document images to the published webhook. Get back structured data: extracted fields, summaries, classifications, or any schema you define in the prompt.

Who this is for

Operations teams, data entry, back-office automation

The problem

Documents arrive as images or photos of printed pages. Extracting the data you need (invoice totals, contract dates, product specs) means reading each one manually or maintaining brittle parsing scripts.

The flow

POST the document image to the published webhook. The Gemini vision node reads the image and extracts structured fields based on your prompt: line items, dates, totals, classifications, summaries, or any custom schema you define. The Text node formats the extraction into clean JSON. Respond to Webhook returns the structured data, ready for your database, spreadsheet, or downstream system.

The outcome

Structured JSON from document images in one API call. No parsing scripts, no manual data entry.

Nodes used

HTTP TriggerFile InputImage (Gemini vision)Text (Gemini)Respond to Webhook

More use cases

All use cases

Product Video AdsUpload a product photo, get back a video ad with generated copy and voiceover. One webhook call replaces your entire creative production queue.Narrated Explainer VideosGenerate a script, a hero scene visual, an ≤8-second clip, and full-length narration from one topic prompt on the canvas.Social Media Content PipelinePOST a creative brief via webhook, get back a generated hero image for the campaign. Multi-platform sizing is on the roadmap.

Generate your first video ad in 3 minutes.

Free to start. No credit card. Upload a product photo, connect your AI models, click Run.