Welcome![Sign In][Sign Up]
Location:
Search - text clustering toolkit

Search list

[JSP/Javalingpipe-3.6.0

Description: 一个自然语言处理的Java开源工具包。LingPipe目前已有很丰富的功能,包括主题分类(Top Classification)、命名实体识别(Named Entity Recognition)、词性标注(Part-of Speech Tagging)、句题检测(Sentence Detection)、查询拼写检查(Query Spell Checking)、兴趣短语检测(Interseting Phrase Detection)、聚类(Clustering)、字符语言建模(Character Language Modeling)、医学文献下载/解析/索引(MEDLINE Download, Parsing and Indexing)、数据库文本挖掘(Database Text Mining)、中文分词(Chinese Word Segmentation)、情感分析(Sentiment Analysis)、语言辨别(Language Identification)等API。-A natural language processing of the Java open-source toolkit. LingPipe currently have a lot of useful features, including Subject Classification (Top Classification), Named Entity Recognition (Named Entity Recognition), part of speech tagging (Part-of Speech Tagging), sentence detection problem (Sentence Detection), spell-checking query (Query Spell Checking), interest in the phrase detection (Interseting Phrase Detection), Cluster (Clustering), Character Modeling Language (Character Language Modeling), medical literature to download/analysis/index (MEDLINE Download, Parsing and Indexing), text mining database (Database Text Mining), Chinese word segmentation (Chinese Word Segmentation), emotional analysis (Sentiment Analysis), language identification (Language Identification), such as API.
Platform: | Size: 4669440 | Author: 张国栋 | Hits:

[JSPtutorial

Description: 基本上来看是一个很好的工具,关键在于给出了案例,一共有6个。-The Dragon Toolkit is a Java-based development package for academic use in information retrieval (IR) and text mining (TM, including text classification, text clustering, text summarization, and topic modeling). It is tailored for researchers who work on large-scale IR and TM and prefer Java programming. Moreover, different from Lucene and Lemur, it provides built-in supports for semantic-based IR and TM. The dragon toolkit seamlessly integrates a set of NLP tools, which enable the toolkit to index text collections with various representation schemes including words, phrases, ontology-based concepts and relationships.
Platform: | Size: 164864 | Author: 徐水 | Hits:

[JSPmaxmatcher

Description: 稀疏矩阵用于文本挖掘的实验平台,TREC数据评价的API借口-The Dragon Toolkit is a Java-based development package for academic use in information retrieval (IR) and text mining (TM, including text classification, text clustering, text summarization, and topic modeling). It is tailored for researchers who work on large-scale IR and TM and prefer Java programming. Moreover, different from Lucene and Lemur, it provides built-in supports for semantic-based IR and TM. The dragon toolkit seamlessly integrates a set of NLP tools, which enable the toolkit to index text collections with various representation schemes including words, phrases, ontology-based concepts and relationships.
Platform: | Size: 4096 | Author: 徐水 | Hits:

[Other DatabasesmalletTest

Description: mallet测试代码, 非常优秀的自然语言处理工具包,基于Java编写,可以进行文本分类,聚类等功能,并且支持加入自定义的算法,其中有众多的API接口,有着很好的研究和实用价值。-mallet test code, very good natural language processing toolkit based on Java, you can text classification, clustering and other functions, and supports adding custom algorithm, which has a large number of API interface, with good research and practical value .
Platform: | Size: 6811648 | Author: 王沐杰 | Hits:

[Mathimatics-Numerical algorithmsbow

Description: 一种机器学习的算法,用于统计语言模型,文本检索,分类和聚类的C语言源代码工具包-A Toolkit for Statistical Language Modeling, Text Retri , Classification and Clustering Bow (or libbow) is a library of C code useful for writing statistical text analysis, language modeling and information retri programs.
Platform: | Size: 18009088 | Author: 马俊奇 | Hits:

CodeBus www.codebus.net