Re: [ba-unrev-talk] ExtremWeb -- p2p grid computing
"Harvest addresses these problems through a set of customizable tools for
gathering information from diverse repositories, building topic-specific
content indexes, flexibly searching the indexes, widely replicating them,
and caching objects as they are retrieved across the Internet. The system
interoperates with Mosaic and with HTTP, FTP, WAIS, and Gopher information
Harvest permits flexible construction of information services that use both
the network and information servers efficiently. It can be configured to
automatically collect and summarize related objects from around the Internet
into a large collection or to collect, summarize, and hand annotate a tiny,
specialized collection. Harvest automates many of the tasks needed to
integrate information from disparate repositories and formats. Harvest's
architecture makes efficient use of Internet servers and network links, and
our measurements indicate that Harvest can reduce server load, network
traffic, and index space requirements by one to two orders of magnitude,
compared with previous systems. " (02)
----- Original Message -----
From: "Peter Jones" <firstname.lastname@example.org>
Sent: Sunday, December 02, 2001 6:15 PM
Subject: Re: [ba-unrev-talk] ExtremWeb -- p2p grid computing (03)
> In the book, Information Retrieval by Ricardo Baeza-Yates and Berthier
> Ribiero-Neto there is mention of a system called Harvest that uses a
> distributed indexing approach to the Web.
> I haven't searched for any more info yet though.
> ----- Original Message -----
> From: "Jack Park" <email@example.com>
> To: <firstname.lastname@example.org>
> Sent: Saturday, December 01, 2001 8:05 PM
> Subject: [ba-unrev-talk] ExtremWeb -- p2p grid computing
> > http://www.lri.fr/~fedak/XtremWeb/introduction.php3
> > Open source grid computing.
> > I mention this here because I am possessed with notions of finding a way
> > index, ontologize, and yadayadayada..the entire Web, at least those
> > portions of interest to me and to others who want to play in the same
> > Yup. I know the drill..."Can't be done." "Not worth the effort." "Get
> > life!" and so forth.
> > So what!
> > Imagine a way to share cpu cycles from idle machines all over the world,
> > doing keyword searches, then using greenstone to mirror, index, and
> > ontologize the sites found.
> > Go figure...
> > Jack