-
Category : Search Engine
Tags :
- Update : 2012-11-26
- Size : 2.85mb
- Downloaded :0次
- Author :曹****
- About :
Nobody
- PS : If download it fails, try it again. Download again for free!
Introduction - If you have any usage issues, please Google them yourself
In this paper, lucene and Heritrix build a Web search application
Lucene is a Java-based full-text information retrieval package, it is now the Apache Jakarta family, following an open source project.
Lucene is very powerful, but, no matter how powerful search engine tool, in its background, we need something to support it, that is, Web crawler Spider. Web crawlers, also known as Spider Spider, or robot network, BOT, etc., which are insignificant, the most important thing is to recognize that, due to the presence of reptiles, which makes the search engine there are plenty of resources.
Heritrix is a pure Java developed by the, open source Web crawler, the user can use it to grab you want from the network resources. It comes from www.archive.org. Heritrix is that it is the best scalability, developers can extend its various components, to achieve their capture logic.
Packet file list
(Preview for download)
4pm\.classpath
...\.cvsignore
...\.project
...\.springBeans
...\.tomcatplugin
...\commons\403.jsp
...\.......\404.jsp
...\.......\error.jsp
...\.......\footer.jsp
...\.......\inprogress.jsp
...\.......\messages.jsp
...\.......\meta.jsp
...\.......\taglibs.jsp
...\commons
...\index.jsp
...\META-INF\MANIFEST.MF
...\META-INF
...\query.jsp
...\scripts\builder.js
...\.......\calendar.js
...\.......\controls.js
...\.......\dragdrop.js
...\.......\effects.js
...\.......\global.js
...\.......\login.js
...\.......\prototype.js
...\.......\scriptaculous.js
...\.......\selectbox.js
...\.......\slider.js
...\.......\validator.jsp
...\scripts
...\.tyles\msn\css\blue\FormDefault.css
...\......\...\...\....\ViewDefault.css
...\......\...\...\blue
...\......\...\...\cyan\ViewDefault.css
...\......\...\...\cyan
...\......\...\...\dialog.css
...\......\...\...\edit.css
...\......\...\...\FormDefault.css
...\......\...\...\formFlow.css
...\......\...\...\formForm.css
...\......\...\...\formMain.css
...\......\...\...\formMain1.css
...\......\...\...\formTop.css
...\......\...\...\FormViewDefault.css
...\......\...\...\green\ViewDefault.css
...\......\...\...\green
...\......\...\...\HtmlEdit.css
...\......\...\...\module.left.css
...\......\...\...\newwalterzorn.css
...\......\...\...\systree.css
...\......\...\...\ViewDefault.css
...\......\...\...\xtree.css
...\......\...\css
...\......\...\images\addoption.gif
...\......\...\......\addoption_a.gif
...\......\...\......\archivedo.gif
...\......\...\......\arrow_icon.gif
...\......\...\......\arrow_left.gif
...\......\...\......\arrow_right.gif
...\......\...\......\arrow_yellow.gif
...\......\...\......\assistover.gif
...\......\...\......\attach.gif
...\......\...\......\attachment_btn.gif
...\......\...\......\background\beij_01.jpg
...\......\...\......\..........\bk_01.jpg
...\......\...\......\..........\bk_02.jpg
...\......\...\......\..........\bk_03.jpg
...\......\...\......\..........\bk_04.jpg
...\......\...\......\..........\bk_05.jpg
...\......\...\......\..........\bk_06.jpg
...\......\...\......\..........\bk_07.jpg
...\......\...\......\..........\bk_08.jpg
...\......\...\......\..........\bk_09.jpg
...\......\...\......\..........\btn_bg.gif
...\......\...\......\..........\b_l.gif
...\......\...\......\..........\b_y.gif
...\......\...\......\..........\dongh.gif
...\......\...\......\..........\d_l.gif
...\......\...\......\..........\d_y.gif
...\......\...\......\..........\d_z.gif
...\......\...\......\..........\iframetitle.gif
...\......\...\......\..........\java.jpg
...\......\...\......\..........\shadow_bottom.gif
...\......\...\......\..........\shadow_left.gif
...\......\...\......\..........\shadow_left_bottom.gif
...\......\...\......\..........\shadow_right.gif
...\......\...\......\..........\shadow_right_bottom.gif
...\......\...\......\..........\shadow_top.gif
...\......\...\......\..........\shadow_top_left.gif
...\......\...\......\..........\shadow_top_right.gif
...\......\...\......\..........\tab-active.gif
...\......\...\......\..........\tab-beginer.gif
...\......\...\......\..........\tab-breaker.gif
...\......\...\......\..........\tab-ender.gif
...\......\...\......\..........\tab-expand.gif
...\......\...\......\..........\tab-normal.gif
...\......\...\......\..........\Thumbs.db
...\......\...\......\..........\titlebar.gif
...\......\...\......\..........\t_l.gif
Related instructions
- We are an exchange download platform that only provides communication channels. The downloaded content comes from the internet. Except for download issues, please Google on your own.
- The downloaded content is provided for members to upload. If it unintentionally infringes on your copyright, please contact us.
- Please use Winrar for decompression tools
- If download fail, Try it againg or Feedback to us.
- If downloaded content did not match the introduction, Feedback to us,Confirm and will be refund.
- Before downloading, you can inquire through the uploaded person information