Search - text mining tool

[Graph Recognize] TextAbstractor3

Description: 能够对文本的内容进行显示功能，并且能够根据客户的需求可以对关键字查找，并显示带有关键字的句子，并且采用句子分割技术进行处理，实现断句功能。即可以把该文章带有此关键字的句子分句显示出来，该软件的使用必须在ODBC中进行相关的设置，详细请看我的说明文档，请大家导入文章时选择txt的文件格式。我目前正在开发更好的文本挖掘工具，如果大家有什么好的建议和想法请发EMAIL给我：andondon-right to the contents of the text for display, and according to the needs of the clients can keyword search, and display with the keyword phrases and sentences using segmentation processing technology to achieve functional Sentences. That can take the article with the keyword this clause of the sentence demonstrated that the software must be used for the ODBC related settings, I Look at the detailed documentation, please choose articles into txt file format. I is currently developing a better text mining tool, if you have any good suggestions and ideas please send email to me : andondon
Platform: | Size: 153587 | Author: 孙明 | Hits:

[Graph Recognize] TextAbstractor3

Description: 能够对文本的内容进行显示功能，并且能够根据客户的需求可以对关键字查找，并显示带有关键字的句子，并且采用句子分割技术进行处理，实现断句功能。即可以把该文章带有此关键字的句子分句显示出来，该软件的使用必须在ODBC中进行相关的设置，详细请看我的说明文档，请大家导入文章时选择txt的文件格式。我目前正在开发更好的文本挖掘工具，如果大家有什么好的建议和想法请发EMAIL给我：andondon-right to the contents of the text for display, and according to the needs of the clients can keyword search, and display with the keyword phrases and sentences using segmentation processing technology to achieve functional Sentences. That can take the article with the keyword this clause of the sentence demonstrated that the software must be used for the ODBC related settings, I Look at the detailed documentation, please choose articles into txt file format. I is currently developing a better text mining tool, if you have any good suggestions and ideas please send email to me : andondon
Platform: | Size: 5708800 | Author: 孙明 | Hits:

[Other] libsvm-2.9

Description: 文本分类工具libsvm-2.9.zip 信息检索和数据挖掘的中用到的工具包，里面有C++、JAVA、Python等多个语言版本-Libsvm-2.9.zip text classification tool for information retrieval and data mining tools used in the package, inside C++, JAVA, Python and other languages
Platform: | Size: 578560 | Author: 张杰 | Hits:

[Other] text_mining

Description: SPSS文本挖掘工具的中文介绍，包括如何实现的整个过程。-SPSS text mining tool for the Chinese presentations, including how to implement the entire process.
Platform: | Size: 13312 | Author: linda | Hits:

[ISAPI-IE] OutdosoftCMS

Description: 服务器环境要求： Windows2003 + .Net Framework 2.0 + Sql Server 2000 功能特点：俏皮皮图片系统采用Outdosoft CMS 3.0内核，具有超强的采集和生成功能。 1、自定义网页编码(gb2312、utf-8、iso-8859-1)。 2、采用Outdosoft开发的专业采集系统，易用、稳定、高效，可以采集防盗链图片、支持断点续采。 3、自定义缩略图、水印，针对图片站提供了底部文字水印。 4、高效的静态标签生成技术，全站静态化。 5、生成百度地图、Google地图、更新列表、热点列表，加速搜索引擎收录。 6、自定义文件路径、Title、Keywords、Description，对搜索引擎提供最友好的支持。 7、简单易用的标签生成工具，让您随心所欲的改版网站。 8、无限级分类，分类可独立设置模板，让您的网站更加出众。 9、专业级的评论模块，可以过滤不良词句。 10、每日更新统计，让您随时的检查工作情况。-Server environment requirements: Windows2003+. Net Framework 2.0+ Sql Server 2000 Features: Qiao Pipi picture system uses Outdosoft CMS 3.0 kernel, with superior collection and generation. 1, the custom page coding (gb2312, utf-8, iso-8859-1). 2, using Outdosoft acquisition system developed by the professional, easy to use, stable, efficient, and can capture anti Daolian picture, support breakpoint mining. 3, the custom thumbnail, watermark, providing a base station for picture text watermark. 4, effective static label generation technology, the station s static. 5, Baidu generated maps, Google maps, updated list, hot list, speed up your search engine. 6, custom file path, Title, Keywords, Description, most search engine friendly support. 7, easy to use tag generation tool, so you want it to the revised website. 8, infinite-level classification, classification can be independently set the template to make your site even more outstanding. 9, professional commen
Platform: | Size: 967680 | Author: eric | Hits:

[MultiLanguage] TF-IDF

Description: The tf–idf weight (term frequency–inverse document frequency) is a weight often used in information retrieval and text mining. This weight is a statistical measure used to evaluate how important a word is to a document in a collection or corpus. The importance increases proportionally to the number of times a word appears in the document but is offset by the frequency of the word in the corpus. Variations of the tf–idf weighting scheme are often used by search engines as a central tool in scoring and ranking a document s relevance given a user query.
Platform: | Size: 5120 | Author: oplachko84 | Hits:

[Data structs] libshorttext-1.0.tar

Description: Introduction LibShortText is an open source tool for short-text classification and analysis. It can handle the classification of, for example, titles, questions, sentences, and short messages. Main features of LibShortText include It is more efficient than general text-mining packages. On a typical computer, processing and training 10 million short texts takes only around half an hour. The fast training and testing is built upon the linear classifier LIBLINEAR Default options often work well without tedious tuning. An interactive tool for error analysis is included. Based on the property that each short text contains few words, LibShortText provides details in predicting each text.-Introduction LibShortText is an open source tool for short-text classification and analysis. It can handle the classification of, for example, titles, questions, sentences, and short messages. Main features of LibShortText include It is more efficient than general text-mining packages. On a typical computer, processing and training 10 million short texts takes only around half an hour. The fast training and testing is built upon the linear classifier LIBLINEAR Default options often work well without tedious tuning. An interactive tool for error analysis is included. Based on the property that each short text contains few words, LibShortText provides details in predicting each text.
Platform: | Size: 368640 | Author: AMIMIMEK | Hits:

[CSharp] TFIDF-master

Description: tf–idf, short for term frequency–inverse document frequency, is a numerical statistic that is intended to reflect how important a word is to a document in a collection or corpus.[1]:8 It is often used as a weighting factor in information retrieval and text mining. The tf-idf value increases proportionally to the number of times a word appears in the document, but is offset by the frequency of the word in the corpus, which helps to control for the fact that some words are generally more common than others. Variations of the tf–idf weighting scheme are often used by search engines as a central tool in scoring and ranking a document s relevance given a user query. tf–idf can be successfully used for stop-words filtering in various subject fields including text summarization and classification. One of the simplest ranking functions is computed by summing the tf–idf for each query term many more sophisticated ranking functions are variants of this simple model.-tf–idf, short for term frequency–inverse document frequency, is a numerical statistic that is intended to reflect how important a word is to a document in a collection or corpus.[1]:8 It is often used as a weighting factor in information retrieval and text mining. The tf-idf value increases proportionally to the number of times a word appears in the document, but is offset by the frequency of the word in the corpus, which helps to control for the fact that some words are generally more common than others. Variations of the tf–idf weighting scheme are often used by search engines as a central tool in scoring and ranking a document s relevance given a user query. tf–idf can be successfully used for stop-words filtering in various subject fields including text summarization and classification. One of the simplest ranking functions is computed by summing the tf–idf for each query term many more sophisticated ranking functions are variants of this simple model.
Platform: | Size: 17408 | Author: adel | Hits:

[WEB Code] source-archive (3)

Description: A toolkit for generation of dummy XML documents of user specified size and randomness of the structure xml-generator is a Python based toolkit for generation of well formed XML sample documents. Its primary purpose is to generate documents for performance evaluation of XML parsing routines and data mining experimentation. xml-generator, in its current implementation, is a command line based tool. It allows users to specify the size, depth and the randomness of the generated XML tree. The text nodes are populated with randomly chosen words from the vocabulary of several hundred English words. xml-generator generates a 100 MB XML document in ~17 sec and 1GB document
Platform: | Size: 17408 | Author: kkainer | Hits:

Category

Source Code

Web/Internet

Develop Tools

Document

Other

Search in results

OS

Platform

Language

File Type

Search list

[Graph Recognize] TextAbstractor3

[Graph Recognize] TextAbstractor3

[Other] libsvm-2.9

[Other] text_mining

[ISAPI-IE] OutdosoftCMS

[MultiLanguage] TF-IDF

[Data structs] libshorttext-1.0.tar

[CSharp] TFIDF-master

[WEB Code] source-archive (3)

CodeBus www.codebus.net