Location:
Search - nutch d
Search list
Description: 一种新型的基于Nutch的搜索引擎技术,时下热门研究方向-A new search engine based on Nutch technology research nowadays popular
Platform: |
Size: 266240 |
Author: 李化 |
Hits:
Description: Google的核心竞争技术是它的计算平台。Apache上就出现了一个类似的解决方案,目前它们都属亍Apache的Hadoop项目,对应的分删是: Chubby-->ZooKeeper GFS-->HDFS BigTable-->HBase MapReduce-->Hadoop 目前,基亍类似思想的Open Source项目迓径多,Hadoop是其中最为流行的框架,本文就将简要介绍hadoop的一个开发流程。-Hadoop got its start in Nutch. A few of us were attempting to build an open source
web search engine and having trouble managing computations running on even a
handful of computers. Once Google published its GFS and MapReduce papers, the
route became clear. They’d devised systems to solve precisely the problems we were
having with Nutch. So we started, two of us, half-time, to try to recreate these systems
as a part of Nutch
Platform: |
Size: 607232 |
Author: aaaaaaa |
Hits:
Description: Hadoop got its start in Nutch. A few of us were attempting to build an open source
web search engine and having trouble managing computations running on even a
handful of computers. Once Google published its GFS and MapReduce papers, the
route became clear. They’d devised systems to solve precisely the problems we were
having with Nutch. So we started, two of us, half-time, to try to re-create these systems
as a part of Nutch.
We managed to get Nutch limping along on 20 machines, but it soon became clear that
to handle the Web’s massive scale, we’d need to run it on thousands of machines and,
moreover, that the job was bigger than two half-time developers could handle.
Around that time, Yahoo! got interested, and quickly put together a team that I joined.
We split off the distributed computing part of Nutch, naming it Hadoop. With the help
of Yahoo!, Hadoop soon grew into a technology that could truly scale to the Web.
Platform: |
Size: 7521280 |
Author: likunbao |
Hits: