I need a script that uses Google to find 3 different kind of web proxies and then checks if the web proxy works.
Output should be a text file with the list of working proxies.
1) Identify web proxies
Use different search patterns
like
Glyphe Script
Pattern: "Remove Objects" "Allow Cookies"
https://www.google.com/#hl=de&tbo=d&output=search&sclient=psy-ab&q=%22Remove+Objects%22+%22Allow+Cookies%22&oq=%22Remove+Objects%22+%22Allow+Cookies%22&gs_l=hp.3...1085.14049.0.14383.9.8.1.0.0.0.97.715.8.8.0...0.0...1c.1.C6uNPkw8wgk&pbx=1&bav=on.2,or.r_gc.r_pw.r_qf.&bvm=bv.1355534169,d.Yms&fp=9f5cb4d0262964be&bpcl=40096503&biw=1920&bih=994
PHProxy
Pattern: "Use ROT13 encoding on the address"
CGI Proxy
"Remove all cookies" "Hide referrer information"
2) From the list of sites, check if there is actually a proxy form and if the form works. e.g. submit a test URL and check if the return page show the right URL
3) Create the results file with the working proxies
I will provide you with a working proxy, so that can scrape Google without blocking.