Skip to content
Use case

Document Processing

POST document images to the published webhook. Get back structured data: extracted fields, summaries, classifications, or any schema you define in the prompt.

Who this is for

Operations teams, data entry, back-office automation

The problem

Documents arrive as images or photos of printed pages. Extracting the data you need (invoice totals, contract dates, product specs) means reading each one manually or maintaining brittle parsing scripts.

The flow

POST the document image to the published webhook. The Gemini vision node reads the image and extracts structured fields based on your prompt: line items, dates, totals, classifications, summaries, or any custom schema you define. The Text node formats the extraction into clean JSON. Respond to Webhook returns the structured data, ready for your database, spreadsheet, or downstream system.

The outcome

Structured JSON from document images in one API call. No parsing scripts, no manual data entry.

Generate your first video ad in 3 minutes.

Free to start. No credit card. Upload a product photo, connect your AI models, click Run.