Some sites require proxies to avoid blocks
proxy
option on any request. Below is an example of using the residential_cdp
proxy for a crawl.
Different domains will need different proxies. You’ll typically want to use the cheapest proxy that can reliably access a site. Below is a table showing the proxies available, and their costs.
Proxy | Cost per GB | Loads assets? |
---|---|---|
none | $0.01 / GB | No |
datacenter | $0.01 / GB | No |
mobile | $0.50 / GB | No |
residential_cdp | $8.00 / GB | No |
residential_cdp_assets | $8.50 / GB | Yes |
proxy
parameter. FetchFox will launch concurrent visits for each proxy you pass in, and use the first valid response it gets.
Below is an example of running an extraction with multiple proxies.
auto
as the proxy parameter.
Below is an example of how to run a crawl that automatically picks a proxy. The same parameter can be used for all other endpoints like scrape and extract.