CodeBus
www.codebus.net
Search
Sign in
Sign up
Hot Search :
Source
embeded
web
remote control
p2p
game
More...
Location :
Home
Search - DATA CLUSTERING
Main Category
SourceCode
Documents
Books
WEB Code
Develop Tools
Other resource
Search - DATA CLUSTERING - List
[
Software Engineering
]
cluster-hyper-dim
DL : 0
This paper studies the problem of categorical data clustering, especially for transactional data characterized by high dimensionality and large volume. Starting from a heuristic method of increasing the height-to-width ratio of the cluster histogram, we develop a novel algorithm – CLOPE, which is very fast and scalable, while being quite effective. We demonstrate the performance of our algorithm on two real world-This paper studies the problem of categori cal data clustering. especially for transactional data characteri propellant by high dimensionality and large volume. St. arting from a heuristic method of increasing th e height-to-width ratio of the cluster histogr am, we develop a novel algorithm-CLOPE. which is very fast and scalable, while being quite effective. We demonstrate th e performance of our algorithm on two real world
Date
: 2025-12-20
Size
: 106kb
User
:
hanzhang
[
Software Engineering
]
TechofDataDigInEbusiness
DL : 0
电子商务中的数据挖掘技术,可以作为专业课程的课程论文-electronic data mining technology, it is as a professional courses in papers
Date
: 2025-12-20
Size
: 9kb
User
:
danny
[
Software Engineering
]
AIS
DL : 0
本文简要介绍了数据挖掘中的聚类、关联分析、时间序列分析等理论和技术,在分析目前交通领域的数据挖掘,特别是AIS信息的数据挖掘研究现状的基础上,针对网络化的AIS数据库的AIS信息特点,提出了用于AIS信息的挖掘方法-This paper briefly describes the data mining clustering, association analysis, time series analysis theory and technology, in analyzing the data mining field of transport, in particular the study of AIS information on the status of data mining based on the AIS for network-oriented database The AIS information on the characteristics of the information is presented for the AIS Mining Method
Date
: 2025-12-20
Size
: 1.03mb
User
:
赵新星
[
Software Engineering
]
algs
DL : 0
Details of the Clustering Algorithms Supplement to the paper “Validating Clustering in Gene Expression Data” (to appear in Bioinformatics)
Date
: 2025-12-20
Size
: 15kb
User
:
MaliheAmini
[
Software Engineering
]
Clustering.Algorithms.Research
DL : 0
软件学报 2008年论文《聚类算法研究》,作者孙吉贵, 刘杰, 赵连宇。pdf格式,14页。对近年来聚类算法的研究现状与新进展进行归纳总结.一方面对近年来提出的较有代表性的聚类算法,从算法思想、关键技术和优缺点等方面进行分析概括 另一方面选择一些典型的聚类算法和一些知名的数据集,主要从正确率和运行效率两个方面进行模拟实验,并分别就同一种聚类算法、不同的数据集以及同一个数据集、不同的聚类算法的聚类情况进行对比分析.最后通过综合上述两方面信息给出聚类分析的研究热点、难点、不足和有待解决的一些问题.上述工作将为聚类分析和数据挖掘等研究提供有益的参考. -The research actuality and new progress in clustering algorithm in recent years are summarized in this paper. First, the analysis and induction of some representative clustering algorithms have been made from several aspects, such as the ideas of algorithm, key technology, advantage and disadvantage. On the other hand, several typical clustering algorithms and known data sets are selected, simulation experiments are implemented from both sides of accuracy and running efficiency, and clustering condition of one algorithm with different data sets is analyzed by comparing with the same clustering of the data set under different algorithms. Finally, the research hotspot, difficulty, shortage of the data clustering and some pending problems are addressed by the integration of the aforementioned two aspects information. The above work can give a valuable reference for data clustering and data mining.
Date
: 2025-12-20
Size
: 459kb
User
:
dengyue
[
Software Engineering
]
LJClusterDemo
DL : 1
文本聚类是基于相似性算法的自动聚类技术,自动对大量无类别的文档进行归类,把内容相近的文档归为一类,并自动为该类生成特征主题词。适用于自动生成热点舆论专题、重大新闻事件追踪、情报的可视化分析等诸多应用。 灵玖Lingjoin(www.lingjoin.com)基于核心特征发现技术,突破了传统聚类方法空间消耗大,处理时间长的瓶颈;不仅聚类速度快,而且准确率高,内存消耗小,特别适合于超大规模的语料聚类和短文本的语料聚类。 灵玖文档聚类组件的主要特色在于: 1、速度快:可以处理海量规模的网络文本数据,平均每小时处理至少50万篇文档; 2、聚类精准:Top N的聚类中心往往能反映出当时的时事热点,适合于舆情热点计算;与国际上以聚类见长的Autonomy公司技术相比,灵玖的各项指标远远领先,或许是灵玖更懂中文吧 3、精准排序:各个类别按照影响权重排序,每个类中的文档按照重要性排序; 4、可定制:可以定制类别数、类别中心。 5、开放式接口:灵玖文档聚类组件作为LJParser的一部分,采用灵活的开发接口,可以方便地融入到用户的业务系统中,可以支持各种操作系统,各类调用语言。 灵玖文档聚类可以应用于文本挖掘、知识管理、搜索聚类、舆情监测等多种应用中。 -Text clustering algorithm is based on the similarity of automatic clustering techniques, automatically a large number of non-classified categories of documents, the contents of the documents fall into a similar category, and automatically generate the features for this kind of keywords. For automatic generation of hot topics of public opinion, major news event tracking, information visualization analysis and many other applications. Ling Jiu Lingjoin (www.lingjoin.com) found that based on the core features of technology, a breakthrough of traditional clustering method of space consumption, processing time is long bottlenecks not only the clustering speed and high accuracy, memory consumption is small, is particularly suitable for ultra-large-scale corpus clustering and short text corpus clustering. Ling-Jiu document clustering component of the main features are: 1, fast: the size of the network can handle the massive text data, the average hourly processing at least 50 mil
Date
: 2025-12-20
Size
: 1.05mb
User
:
lingjoin
[
Software Engineering
]
Clustering
DL : 0
clustering data with measurement errors
Date
: 2025-12-20
Size
: 185kb
User
:
sudek
[
Software Engineering
]
File3
DL : 0
数据挖掘,聚类,遗传算法,k-means算法,基于遗传算法的k-means聚类方法。-Data mining, clustering, genetic algorithm, k-means algorithm, based on genetic algorithm k-means clustering method.
Date
: 2025-12-20
Size
: 2.47mb
User
:
王三
[
Software Engineering
]
PSOformal
DL : 0
PSO formal Data clustering using particle swarm optimization
Date
: 2025-12-20
Size
: 671kb
User
:
a2maridz
[
Software Engineering
]
high-demensional
DL : 0
这是一些有关高位数据聚类的论文,还挺有帮助的-This is some of the papers about the high data clustering, quite helpful
Date
: 2025-12-20
Size
: 3.24mb
User
:
何晓宇
[
Software Engineering
]
sonno
DL : 0
TGSOM一种用于数据聚类的动态自组织映射神经网络,一篇不错的论文-Dynamic TGSOM for data clustering self-organizing map neural network, a good thesis
Date
: 2025-12-20
Size
: 230kb
User
:
awdawd
[
Software Engineering
]
HDclassif
DL : 3
R 语言中高维聚类package,方便快捷-Discriminant analysis and data clustering methods for high dimensional data, based on the assumption that high-dimensional data live in different subspaces with low dimensionality proposing a new parametrization of the Gaussian mixture model which combines the ideas of dimension reduction and constraints on the model.
Date
: 2025-12-20
Size
: 130kb
User
:
meibo
[
Software Engineering
]
jitizhihuibaincheng
DL : 0
集体智慧编程中文版,本书以机器学习与计算统计为主题背景,专门讲述如何挖掘和分析web上的数据和 资源。本书包含协作过滤技术,集群数据分析,搜索引擎核心技术,贝叶斯过滤技术等。-Collective intelligence programming Chinese edition, the book in machine learning and statistics are calculated as the theme, specifically on how to mine and analyze data and resources on the web. This book contains the collaborative filtering techniques, data clustering analysis, search engine core technology, Bayesian filtering technology.
Date
: 2025-12-20
Size
: 27.09mb
User
:
相知无悔
[
Software Engineering
]
Clustering-Algorithms-Research
DL : 0
对近年来聚类算法的研究现状与新进展进行归纳总结.一方面对近年来提出的较有代表性的聚类算法,从算法思想、关键技术和优缺点等方面进行分析概括 另一方面选择一些典型的聚类算法和一些知名的数据集,主要从正确率和运行效率两个方面进行模拟实验,并分别就同一种聚类算法、不同的数据集以及同一个数据集、不同的聚类算法的聚类情况进行对比分析.最后通过综合上述两方面信息给出聚类分析的研究热点、难点、不足和有待解决的一些问题.上述工作将为聚类分析和数据挖掘等研究提供有益的参考.-Status and Progress of research in recent years clustering algorithm summarized on one hand, in recent years raised more representative of the clustering algorithm, the algorithm thinking, key technology and other aspects of the study outlines the advantages and disadvantages on the other hand some typical clustering algorithms and some well-known data sets, mainly for simulation accuracy and efficiency two aspects, respectively, on the same kind of clustering algorithm, different data sets and data set with a different poly Comparative analysis of clustering algorithms is the class situation. Finally, clustering analysis by combining the above two information hotspot, difficulties, shortcomings and problems to be solved. above work for cluster analysis and data mining research provide useful reference.
Date
: 2025-12-20
Size
: 591kb
User
:
Earik
[
Software Engineering
]
rnndf
DL : 0
使用高阶累积量对MPSK信号进行调制识别,可实现对二维数据的聚类,进行逐步线性回归。- Using high-order cumulants of MPSK signal modulation recognition, Can realize the two-dimensional data clustering, Stepwise linear regression.
Date
: 2025-12-20
Size
: 4kb
User
:
赵士胜
[
Software Engineering
]
xs436
DL : 0
可实现对二维数据的聚类,代码里有很完整的注释和解释,有井曲线作为输入可计算其地震波的衰减。- Can realize the two-dimensional data clustering, Code, there are very complete notes and explanations There is a well attenuation curve as input to calculate its seismic waves.
Date
: 2025-12-20
Size
: 5kb
User
:
郑永智
[
Software Engineering
]
Data-Mining
DL : 0
本论文在对各种算法深入分析的基础上,尤其在对基于密度的聚类算法、基于层次的聚类算法和基于划分的聚类算法的深入研究的基础上,提出了一种新的基于密度和层次的快速聚类算法。该算法保持了基于密度聚类算法发现任意形状簇的优点,而且具有近似线性的时间复杂性,因此该算法适合对大规模数据的挖掘。理论分析和实验结果也证明了基于密度和层次的聚类算法具有处理任意形状簇的聚类、对噪音数据不敏感的特点,并且其执行效率明显高于传统的DBSCAN算法。-Based on the analysis on clustering algorithms especially on Density-Based clustering algorithm、Hierarchical-Based clustering algorithm and Partition-Based clustering algorithm, in this paper, a new kind of clustering algorithm that is clustering based on density and hierarchy is presented. This algorithm keeps the ability of density based clustering method’s good features, and it can reach high efficiency because of its linear time complexity, so it can be used in mining very large s. Both theory analysis and experimental results confirm that this algorithm can discover clusters with arbitrary shape and is insensitive to noise data. In the meanwhile, its executing efficiency is much higher than traditional DBSCAN algorithm.
Date
: 2025-12-20
Size
: 130kb
User
:
wfyan
[
Software Engineering
]
yaogeiqie
DL : 0
人脸识别中的光照处理方法,可实现对二维数据的聚类,esprit算法对有干扰的信号频率进行估计。- Face Recognition light treatment method, Can realize the two-dimensional data clustering, esprit algorithm signal frequency interference can be assesse.
Date
: 2025-12-20
Size
: 3kb
User
:
王艳锋
[
Software Engineering
]
hannen_v40
DL : 0
用MATLAB实现动态聚类或迭代自组织数据分析,可实现对二维数据的聚类,现代信号处理中谱估计在matlab中的使用。- Using MATLAB dynamic clustering or iterative self-organizing data analysis, Can realize the two-dimensional data clustering, Modern signal processing used in the spectral estimation in matlab.
Date
: 2025-12-20
Size
: 4kb
User
:
文志海
[
Software Engineering
]
jie_pg40
DL : 0
Clustering analysis based on Euclidean distance, Can realize the two-dimensional data clustering, Genetic algorithm based reactive power optimization.
Date
: 2025-12-20
Size
: 4kb
User
:
inkpmjf
«
1
2
»
CodeBus
is one of the largest source code repositories on the Internet!
Contact us :
1999-2046
CodeBus
All Rights Reserved.