Image extraction
Extract structured data from images
Upload photos of documents, receipts, business cards, or any image containing text. The API runs OCR with vision model proofreading to extract clean, structured data.
How it works
Send your JPG / PNG file
Upload via the API or pass a URL. The API auto-detects the format.
Define your schema
Describe the fields you want as a JSON schema. The API maps your document to your structure.
Get structured JSON
Receive typed data with confidence scores and citations back to the source document.
Example request
curl -X POST https://dev.thedrive.ai/api/v1/extract \
-H "X-API-Key: your_key" \
-F "file=@document.jpg / png" \
-F 'schema={"text": "string", "vendor": "string", "amount": "number"}'
JPG / PNG processing features
Multi-format support
JPG, PNG, GIF, WebP, SVG, TIFF, BMP, HEIC, HEIF, and AVIF.
OCR + vision proofreading
Text is extracted via OCR, then a vision model compares against the original to fix misread characters.
Handwriting recognition
Handles stamps, handwritten notes, and phone photos of documents.
Start extracting from JPG / PNG files
Free tier includes 100 credits/month. No credit card required.