Web intelligence
Extract structured data from any website
Pass a URL instead of a file — same endpoint. The API renders the page in a headless browser, runs JavaScript, then extracts structured data from the live DOM. Enable follow_links to crawl subpages and fill your schema from multiple pages in one call.
What you can extract
Company name, logo, and brand colors
Product names, pricing, and descriptions
Contact info, social links, and addresses
Page metadata, headings, and structured content
Tech stack detection from scripts and headers
Multi-page data via automatic subpage crawling
Example schema
{
"company_name": "string",
"description": "string",
"products": [
{"name": "string", "price": "number"}
],
"contact_email": "string",
"social_links": ["string"]
}
Supported document types
Company websites
Extract brand info, team pages, and company descriptions.
E-commerce pages
Product catalogs, pricing, and inventory data.
Job boards
Job listings, requirements, and company details.
Automate web intelligence document processing
Free tier includes 100 credits/month. No credit card required.