Tor-privoxy is a Ruby Mechanize wrapper to access the web with mechanize via Tor/Privoxy It allows to use multiple Privoxy instances, switch endpoints, switch proxy when you get 4xx HTTP code Useful for web robots, scanners, grabbers when accessing sites which may ban/block you unexpectedly
tor-privoxy is a Ruby Mechanize wrapper for accessing the web via Tor/Privoxy.
It allows multiple Privoxy instances, switching endpoints, and switching the
proxy when you get an HTTP 4xx error code.
It is useful for web robots, scanners, and scrapers when accessing sites which may
ban/block you unexpectedly
The first step is to install the gem:
gem install tor-privoxy
To use in your application:
require 'tor-privoxy'
To get a Mechanize instance wrapped to use Tor and able to use another endpoint when it encounters an HTTP 4xx code:
agent = TorPrivoxy::Agent.new '127.0.0.1', '', {8118 => 9051} do |agent|
sleep 1
puts "New IP is #{agent.ip}"
end
Passing block is optional:
agent = TorPrivoxy::Agent.new '127.0.0.1', '', 8118 => 9051
First parameter is a proxy host, second is Tor control port password:
agent = TorPrivoxy::Agent.new '127.0.0.1', '', 8118 => 9051
The hash is in format proxyport => torcontrolport. Yes, you may provide as many as you want, but I don’t have an idea why I initially did it like so.
And use the agent as a usual Mechanize agent instance:
agent.get "http://example.com"
Configuration options are passed when creating an agent and consist of:
Created by Phil Pirozhkov