Hot Search : Source embeded web remote control p2p game More...
Location : Home Downloads SourceCode Mathimatics-Numerical algorithms DataMining

java网络爬虫

  • Category : DataMining
  • Tags :
  • Update : 2017-09-21
  • Size : 13.92mb
  • Downloaded :0次
  • Author :孤独的***
  • About : Nobody
  • PS : If download it fails, try it again. Download again for free!
Download1 Download2
Don't use download software fo downloading.
If download fail,Try it again for free.
Introduction - If you have any usage issues, please Google them yourself
Is a JAVA reptile framework (kernel) that does not need to be configured for easy development. It provides a streamlined API that requires a small amount of code to implement a powerful crawler
Packet file list
(Preview for download)
基于 Java 的开源网络爬虫框架.htm
WebCollector
WebCollector\.gitignore
WebCollector\LICENSE.txt
WebCollector\NewsCrawler.java
WebCollector\README.md
WebCollector\README.zh-cn.md
WebCollector\WebCollector-JRuby
WebCollector\WebCollector-JRuby\lib
WebCollector\WebCollector-JRuby\lib\webcollector.rb
WebCollector\WebCollector-JRuby\webcollector-0.1.0.gem
WebCollector\WebCollector-JRuby\webcollector.gemspec
WebCollector\WebCollector
WebCollector\WebCollector\CODE_COVERAGE.md
WebCollector\WebCollector\WebCollector.iml
WebCollector\WebCollector\pom.xml
WebCollector\WebCollector\src
WebCollector\WebCollector\src\main
WebCollector\WebCollector\src\main\java
WebCollector\WebCollector\src\main\java\cn
WebCollector\WebCollector\src\main\java\cn\edu
WebCollector\WebCollector\src\main\java\cn\edu\hfut
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\contentextractor
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\contentextractor\ContentExtractor.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\contentextractor\News.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\crawldb
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\crawldb\DBManager.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\crawldb\Generator.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\crawldb\Injector.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\crawldb\SegmentWriter.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\crawler
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\crawler\AutoParseCrawler.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\crawler\Crawler.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\example
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\example\DemoBingCrawler.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\example\DemoDepthCrawler.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\example\DemoHashSetNextFilter.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\example\DemoMetaCrawler.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\example\DemoNextFilter.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\example\DemoPostCrawler.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\example\DemoSelenium.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\example\DemoTypeCrawler.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\example\TutorialCrawler.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\fetcher
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\fetcher\Executor.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\fetcher\Fetcher.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\fetcher\NextFilter.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\fetcher\Visitor.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\model
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\model\CrawlDatum.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\model\CrawlDatums.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\model\Links.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\model\Page.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\net
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\net\HttpRequest.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\net\HttpResponse.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\net\Proxys.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\net\Requester.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\plugin
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\plugin\berkeley
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\plugin\berkeley\BerkeleyCrawler.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\plugin\berkeley\BerkeleyDBManager.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\plugin\berkeley\BerkeleyDBReader.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\plugin\berkeley\BerkeleyDBUtils.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\plugin\berkeley\BerkeleyGenerator.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\plugin\berkeley\BreadthCrawler.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\plugin\nextfilter
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\plugin\nextfilter\HashSetNextFilter.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\plugin\ram
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\plugin\ram\RamCrawler.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\plugin\ram\RamDB.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\plugin\ram\RamDBManager.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\plugin\ram\RamGenerator.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\util
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\util\CharsetDetector.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\util\Config.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\util\Counter.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\util\CrawlDatumFormater.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\util\FileSystemOutput.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\util\FileUtils.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\util\JsoupUtils.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\util\MysqlHelper.java
WebCollector\WebCollector\src\main\java\cn\edu\hfut\dmic\webcollector\util\RegexRule.java
WebCollector\WebCollector\src\main\resources
WebCollector\WebCollector\src\main\resources\log4j.properties
WebCollector\WebCollector\src\test
WebCollector\WebCollector\src\test\java
WebCollector\WebCollector\src\test\java\cn
WebCollector\WebCollector\src\test\java\cn\edu
WebCollector\WebCollector\src\test\java\cn\edu\hfut
WebCollector\WebCollector\src\test\java\cn\edu\hfut\dmic
WebCollector\WebCollector\src\test\java\cn\edu\hfut\dmic\webcollector
WebCollector\WebCollector\src\test\java\cn\edu\hfut\dmic\webcollector\util
WebCollector\WebCollector\src\test\java\cn\edu\hfut\dmic\webcollector\util\CharsetDetectorTest.java
WebCollector\WebCollector\src\test\java\cn\edu\hfut\dmic\webcollector\util\CrawlDatumTest.java
WebCollector\webcollector-2.52-bin.zip
Related instructions
  • We are an exchange download platform that only provides communication channels. The downloaded content comes from the internet. Except for download issues, please Google on your own.
  • The downloaded content is provided for members to upload. If it unintentionally infringes on your copyright, please contact us.
  • Please use Winrar for decompression tools
  • If download fail, Try it againg or Feedback to us.
  • If downloaded content did not match the introduction, Feedback to us,Confirm and will be refund.
  • Before downloading, you can inquire through the uploaded person information

Nothing.

Post Comment
*Quick comment Recommend Not bad Password Unclear description Not source
Lost files Unable to decompress Bad
*Content :
*Captcha :
CodeBus is one of the largest source code repositories on the Internet!
Contact us :
1999-2046 CodeBus All Rights Reserved.