
The program will continue to retry to load the pages until it is successful but it may take a while until it finds a proxy that will work. Each connection will look like a different computer visiting the target website so it will spread out your scraping making it less noticeable.The only problem with this system is many of the proxies may be not a fast which means they may opperate very slowly.
#PROXIE SCRAPER FULL#
If you are doing any level of heavy scraping this means 5+ full scrapes a day this is a good option. This means if you are running the program with 8 threads and a single proxy provider then each thread will have its own unique IP address. When you make a connection the IP Address is changed on the backend by the proxy provider.

You never see the lists of proxies, you can only connect to one or two ip addresses. The difference is that with a single provider you pay for access to thousands of proxies as a service instead of paying per proxy. Rotating Proxy Providers are much like a normal proxy provider and will give you a unique IP address each connection. Shared Proxies are good for Google Maps, Yahoo Local, Bing Local, and any 'Quick' scrapers. The idea is to look as natural as possible, and they less you can use proxies they longer they will live. By giving them a few seconds between each use it will better hide your scraping since the proxies would not be active on the site 100% of the time. This is because I like my proxies to have down time when they are not used. If I had only 25 proxies I would use at most 5 threads. Personally I use 8 Threads with 200 proxies. Shared Proxies from ProxyBonanza start at $10 a month and go up from there. A normal scrape is 1,000 results, with 10 proxies that means each proxy only visits 100 pages each. Each proxy will look like a different computer visiting the target website so it will spread out your scraping making it less noticeable.

If you are doing any level of heavy scraping this means 5+ full scrapes a day you will need proxies. This means if you are running the program with 8 threads and 8 proxies then each thread will have its own IP address. Proxies will give you a unique IP address per proxy.
