Scrape Endpoint

Status: Preview.

POST /v1/scrape is the first Web Intelligence endpoint. Use it when an agent needs a clean representation of one public page.

Request

code

curl https://api.agentmag.dev/v1/scrape \
  -H "Authorization: Bearer $AGENTMAG_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "url": "https://docs.example.com",
    "formats": ["markdown", "links", "metadata"],
    "agentContext": true
  }'

Parameters

Field	Type	Required	Notes
`url`	string	Yes	Public URL to fetch.
`formats`	string[]	No	`markdown`, `html`, `text`, `links`, `metadata`, `screenshot`.
`agentContext`	boolean	No	Adds summary and context hints for LLM workflows.
`waitFor`	number	No	Extra browser wait time in milliseconds for JavaScript-heavy pages.
`onlyMainContent`	boolean	No	Prefer article/docs body over nav, ads, and repeated chrome.

Response

code

{
  "id": "scrape_01h",
  "url": "https://docs.example.com",
  "status": "completed",
  "credits": 3,
  "data": {
    "markdown": "# Docs example...",
    "links": ["https://docs.example.com/api"],
    "metadata": {
      "title": "Docs example",
      "canonicalUrl": "https://docs.example.com"
    }
  }
}

Common Failures

Code	Cause	Handling
`400`	Missing or invalid URL.	Validate URL before sending.
`401`	Missing API key.	Send bearer auth.
`403`	Page blocks automated access or key lacks access.	Try a different page or check workspace access.
`408`	Page timed out.	Lower requested formats or increase job timeout where allowed.
`429`	Rate limit hit.	Retry with exponential backoff.

Stay in the know

Scrape Endpoint

Request

Parameters

Response

Common Failures

On this page