Welcome![Sign In][Sign Up]
Location:
Search - IKAnalyzer3.2.8

Search list

[JSP/JavaIKAnalyzer3.2.8-bin

Description: IKAnalyzer是一个开源的,基于java语言开发的轻量级的中文分词工具包。从2006年12月推出1.0版开始,IKAnalyzer已经推出 了3个大版本。最初,它是以开源项目Luence为应用主体的,结合词典分词和文法分析算法的中文分词组件。新版本的IKAnalyzer3.0则发展为 面向Java的公用分词组件,独立于Lucene项目,同时提供了对Lucene的默认优化实现。 -IKAnalyzer is an open source, java based development of the lightweight language Chinese Word Kit. From the December 2006 launch of version 1.0 started, IKAnalyzer has launched three major versions. Initially, it is based on the application of the main open source project Luence, the combination of sub-word dictionary and grammar of Chinese word segmentation algorithm components. IKAnalyzer3.0 the new version of the Java development for the common sub-word component, independent of the Lucene project, while providing the optimal implementation of Lucene' s default.
Platform: | Size: 2074624 | Author: wuwenyan | Hits:

[JSP/JavaIKAnalyzer3.2.8-source

Description: IKAnalyzer的源码包,实现中文分词功能,(1) 采用了特有的“正向迭代最细粒度切分算法”,具有60万字/ 秒的高速处理能力。 (2) 采用了多子处理器分析模式,支持:英文字母( IP 地址、Email、URL )、数字(日期,常用中文数量词,罗马数字,科学计数法),中文词汇(姓名、地名处理)等分词处理 。 (3) 优化的词典存储,更小的内存占用。支持用户词典扩展定义 (4) 针对 Lucene 全文检索优化的查询分析器 IKQueryParser;采用歧义分析算法优化查询关键字的搜索排列组合,能极大的提高 Lucene 检索的命中率。-IKAnalyzer the source package to achieve Chinese word function, (1) using the unique " forward iteration the most fine-grained segmentation algorithm" , with 60 million characters/second high-speed processing capability. (2) The analysis of multi-mode sub-processor support: letters (IP address, Email, URL), numbers (dates, commonly used in Chinese numeral, roman numerals, scientific notation), Chinese vocabulary (name, place name treatment) segmentation processing. (3) optimization of the dictionary storage, a smaller memory footprint. Support the extended definition of the user dictionary (4) optimized for the Lucene full-text search query analyzer IKQueryParser by ambiguity of keyword search algorithm to optimize query permutations and combinations, can greatly improve the Lucene search hits.
Platform: | Size: 1197056 | Author: 陈雅倩 | Hits:

[JSP/JavaIKAnalyzer_V3.2.8

Description: IK Analyzer是一个开源的,基于java诧言开发的轻量级的中文分词工具包。最初,它是以开源项目Luence为应用主体的,结合词典分词和文法分析算法的中文分词组件。新版本的IK Analyzer 3.X则发展为面吐Java的公用分词组件,独立于Lucene项目,同时提供了对Lucene的默认优化实现。-IK Analyzer 3.X
Platform: | Size: 715776 | Author: flymoon | Hits:

CodeBus www.codebus.net