Atreyu 42 on Tue, 5 Aug 2003 10:14:16 +0200 (CEST)


[Date Prev] [Date Next] [Thread Prev] [Thread Next] [Date Index] [Thread Index]

Re: <nettime> Mas(s) Googleing


-----Original Message-----
> From: auskadi <auskadi@tvcabo.co.mz>
> 
> This might be a really dumb question ....
> Is there an "open" search engine or a non corporate > search engine in 
> existence other than the dmoz project

>From Grub.org <http://www.grub.org/html/help.php>:
| Q: What is Grub? 
| A: Grub is a distributed web crawler. People who 
| choose to download and run the client will assist in 
| building the Web's largest, most accurate database 
| of URLs. This database will be used to improve 
| existing search engines' results by increasing the 
| frequency at which sites are crawled and indexed. 
| 
| Conceivably, Grub's distributed network could enable 
| state information to be gathered on every document 
| on the Internet, each and every day. By having 
| websites crawl their own content, and having 
| volunteers donate their bandwidth and clock cycle 
| resources, it decreases bandwidth consumption across 
| the Internet dramatically, allows for pre-processing 
| on the resulting data, and ultimately improves 
| search results sent to end users. 
[...]
| Grub coders wrote most of the software used in the 
| project, and have made some of that code available 
| to the Open Source community. If anything, Grub has 
| contributed to the community, both by making it's 
| code open, and by opening up the database to other 
| similar projects out on the Internet. 
[...]
| Q: Isn't this concept similar to Kazaa, or 
| SETI@home? 
| A: Yes.  The concept is similar.  Grub's client 
| enables any computer on the Internet to utilize its 
| resources (bandwidth, processor time, drive space) 
| to crawl and index a portion of the Internet in its 
| spare time.  With enough clients, Grub will be able 
| to visit and index every web page on the Internet - 
| every single day. 

Grub is used by the search engine LookSmart <http://www.looksmart.com/>.
-- 
Atreyu 42 <mailto:atreyu42@myrealbox.com>

#  distributed via <nettime>: no commercial use without permission
#  <nettime> is a moderated mailing list for net criticism,
#  collaborative text filtering and cultural politics of the nets
#  more info: majordomo@bbs.thing.net and "info nettime-l" in the msg body
#  archive: http://www.nettime.org contact: nettime@bbs.thing.net