Image extraction

Extract structured data from images

Upload photos of documents, receipts, business cards, or any image containing text. The API runs OCR with vision model proofreading to extract clean, structured data.

How it works

Send your JPG / PNG file

Upload via the API or pass a URL. The API auto-detects the format.

Define your schema

Describe the fields you want as a JSON schema. The API maps your document to your structure.

Get structured JSON

Receive typed data with confidence scores and citations back to the source document.

Example request

curl -X POST https://dev.thedrive.ai/api/v1/extract \
  -H "X-API-Key: your_key" \
  -F "file=@document.jpg / png" \
  -F 'schema={"text": "string", "vendor": "string", "amount": "number"}'

JPG / PNG processing features

Multi-format support

JPG, PNG, GIF, WebP, SVG, TIFF, BMP, HEIC, HEIF, and AVIF.

OCR + vision proofreading

Text is extracted via OCR, then a vision model compares against the original to fix misread characters.

Handwriting recognition

Handles stamps, handwritten notes, and phone photos of documents.

Start extracting from JPG / PNG files

Free tier includes 100 credits/month. No credit card required.

Get API Key Try in Playground