Problems with Parallel User Agent
Lukasz Keler (kelcon@search.pl)
Tue, 18 May 1999 19:42:51 +0200
Hello !
I have problem with PUA 2.42. Why script below returns 'Broken pipe' (on
Linux RedHat with newest version of libraries from CPAN). First part
generates list of webservers visited by polish infoseek...
require LWP::Parallel::RobotUA;
use HTTP::Request;
require WWW::RobotRules::AnyDBM_File;
require LWP::UserAgent;
$ua = new LWP::UserAgent;
$root = "http://infoseek.icm.edu.pl/root.html";
$request = new HTTP::Request('GET', $root);
$response = $ua->request($request); # or
$content = $response->content;
$content =~
s/(<\s*(a|frame)\b[^>]+\b(?:href|src)\s*=\s*)(["']?)([^>\s]+)\3/push(@urls,$
4)/gie;
$protocol = 'http://';
my $rules = new WWW::RobotRules::AnyDBM_File 'Spy','cache';
my $pua = new LWP::Parallel::RobotUA ("Spy",'kelcon@search.pl',$rules);
$pua->duplicates(1); # ignore duplicates
$pua->timeout (5); # in seconds
$pua->delay (2); # Powinno byc
$pua->redirect (1); # follow redirects
$pua->max_req (1); # max parallel requests per server
$pua->max_hosts(30); # max parallel servers accessed
foreach (@urls)
{
if ((/$protocol/) && ($ile < 300))
{
$pua->register(HTTP::Request->new('GET',$_));
}
}
my $entries = $pua->wait();
Pozdrawiam,
Lukasz Keler +48 604 640 124