Web Intelligence
Scrape Endpoint
Fetch a single public URL and return clean content formats for AI agent workflows.
Status: Preview.
POST /v1/scrape is the first Web Intelligence endpoint. Use it when an agent needs a clean representation of one public page.
Request
Parameters
| Field | Type | Required | Notes |
|---|---|---|---|
url | string | Yes | Public URL to fetch. |
formats | string[] | No | markdown, html, text, links, metadata, screenshot. |
agentContext | boolean | No | Adds summary and context hints for LLM workflows. |
waitFor | number | No | Extra browser wait time in milliseconds for JavaScript-heavy pages. |
onlyMainContent | boolean | No | Prefer article/docs body over nav, ads, and repeated chrome. |
Response
Common Failures
| Code | Cause | Handling |
|---|---|---|
400 | Missing or invalid URL. | Validate URL before sending. |
401 | Missing API key. | Send bearer auth. |
403 | Page blocks automated access or key lacks access. | Try a different page or check workspace access. |
408 | Page timed out. | Lower requested formats or increase job timeout where allowed. |
429 | Rate limit hit. | Retry with exponential backoff. |