[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [SLUG] Call for new list best-of-slug ?



Sonam Chauhan wrote:
> 
> > Okay, where is the dummies guide to swish?
> >
> > Like what exactly does it do?
> > Does it provide a word searchable index to mail messages?
> > What does it create? - a html page for my apache to display?
> > So, I figure it may be quicker to ask and save reinventing the wheel.
> 
> Hi Terry: Swish is an indexing engine. It visits web pages and follows links
> like web engines do, but only builds a searchable index. It's search interface
> is only command line, so Swish needs to be coupled with CGI frontends to work 
> as a full search engine. There are a few out there like WWWais, and they can 
> be tailored for our specific circumstance.

Yep, figured all this out since posting.
Now I need a spider that works under linux.

It indexed the some stuff by filesystem in a trial, but barfs on
spidering on html. Apparently the spider is a perl spider for sparc.

.................................................
[terryc@nospam.owl terryc]$ head /opt/swish/src/swishspider
#!/local/sparc/bin/perl
 
use LWP::UserAgent;
use LWP::RobotUA;
use HTTP::Request;
use HTTP::Status;
use HTML::LinkExtor;
 
if (scalar(@nospam.ARGV) != 2) {
    print STDERR "Usage: SwishSpider localpath url\n";
[terryc@nospam.owl terryc]$                                                   

Can fix the perl line okay, but head scratching about the LWP:
:RobotUA; that it doesn't like. "Cannot locate LWP/UserAgent.pm in
@nospam.INC......


> 
> However we *don't need* to modify/write anything - 

I do {:-0 as I'm looking at plugging it onto the SLUG list archive
that I currently maintain.

thanks for any help.

--
   Terry Collins {:-)}}} Ph(02) 4627 2186 Fax(02) 4628 7861  
   email: terryc@nospam.woa.com.au  www: http://www.woa.com.au  
       or terryc@nospam.zip.net.au 
   WOA Computer Services <lan/wan, linux/unix, novell>
   snail:  PO Box 1047, Campbelltown, NSW 2560.

 "People without trees are like fish without clean water"
--
SLUG - Sydney Linux Users Group Mailing List - http://www.slug.org.au
To unsubscribe send email to slug-request@nospam.slug.org.au with
unsubscribe in the text