You can limit the number of visits and depth of crawls
max_visits
parameter puts a limit on the number of pages visited in the crawl. The crawl will never visit more than that many URLs.
Below is an example of a call to the crawl endpoint with a limit.
max_depth
parameter limits the depth of the crawl. This parameter only applies if you specify starting URLs using the start_urls
parameter.
Depth is the distance from the starting URLs. Each starting URL has a depth of 0. URLs linked directly from the starting URL pages have a depth of 1, and pages linked from there have a depth of 2, and so on. If there are multiple links to a page, the lowest depth value is used.
The max_depth
parameter is the maximum depth from the start URLs that is allowed in a crawl. If you set a max depth of 0, only the starting URLs will be visited. If you set a max depth of 1, only pages linked from the starting URLs will be visited, and so on.
Below is an example of a crawl that limits the maximum depth.