Web intelligence

Extract structured data from any website

Pass a URL instead of a file — same endpoint. The API renders the page in a headless browser, runs JavaScript, then extracts structured data from the live DOM. Enable follow_links to crawl subpages and fill your schema from multiple pages in one call.

What you can extract

Company name, logo, and brand colors

Product names, pricing, and descriptions

Contact info, social links, and addresses

Page metadata, headings, and structured content

Tech stack detection from scripts and headers

Multi-page data via automatic subpage crawling

Example schema

{
  "company_name": "string",
  "description": "string",
  "products": [
    {"name": "string", "price": "number"}
  ],
  "contact_email": "string",
  "social_links": ["string"]
}

Supported document types

Company websites

Extract brand info, team pages, and company descriptions.

E-commerce pages

Product catalogs, pricing, and inventory data.

Job boards

Job listings, requirements, and company details.

Automate web intelligence document processing

Free tier includes 100 credits/month. No credit card required.