- Crawl to find relevant URLs.
- Extract to turn page content into structured items.
Crawl for URLs
Use/api/crawl with a pattern:
*matches any characters except/**matches any characters including/
results.hits.
Extract from URLs to get items
Use/api/extract with:
urlorurlstemplate
A single endpoint to crawl and extract
To run both phases in one request, use/api/scrape.
/api/scrape when you want the convenience of one call. Use /api/crawl + /api/extract directly when you want fine-grained control over each phase.