CodeBus
www.codebus.net
Search
Sign in
Sign up
Hot Search :
Source
embeded
web
remote control
p2p
game
More...
Location :
Home
Search - longest
Main Category
SourceCode
Documents
Books
WEB Code
Develop Tools
Other resource
Sub Category
assembly language
SCSI-ASPI
ELanguage
Disk Tools
Speech/Voice recognition/combine
Editor
Anti-virus
MultiLanguage
MPI
source in ebook
Delphi VCL
OS Develop
MiddleWare
MacOS develop
LabView
e-language
python
Search - longest - List
[
MultiLanguage
]
lunce-IKAnalyzer
DL : 1
中文分词工具,利用lucence的接口写的,进行最长匹配,正向和反向匹配后根据词数选择。别人写的,我用了,觉得不错,简单,上手快-Chinese word segmentation tool lucence interface written for the longest match, the forward and reverse to match the number under the word choice. Someone else wrote, I used the think that a good, simple, on手快
Date
: 2025-12-26
Size
: 848kb
User
:
xielang
[
MultiLanguage
]
Dictory
DL : 0
用B-树作为查找存储结构,对中文单词进行哈希,用最长逆向匹配算法进行中文句段分词,实现中文词典以及分词。中文词典规模为十万八千多词汇。分词的正确率在90 以上。-Use the B-tree as storage structure , and hash the Chinese word while storing or searching. Use the longest reverse matching algorithm to split Chinese sentence to words. Chinese dictionaries scale of more than 108,000 words. Word correct rate of over 90 .
Date
: 2025-12-26
Size
: 780kb
User
:
康维鹏
CodeBus
is one of the largest source code repositories on the Internet!
Contact us :
1999-2046
CodeBus
All Rights Reserved.