Search - vector quantization for speech

[Other resource] vq

Description: 说话人识别是语音识别的一种特殊方式，其目的不是识别语音内容，而是识别说话人是谁，即从语音信号中提取个人特征。采用矢量量化（VQ）可避免困难的语音分段问题和时间归整问题，且作为一种数据压缩手段可大大减少系统所需的数据存储量。本文提出了识别特征选取采用复倒谱特征参数和对应用VQ的说话人识别系统改进的一种方法。当用于训练的数据量较小时，复倒谱特征可以得到比较稳定的识别性能。VQ的改进方法避免了说话人识别系统的训练时间与使用时间相差过长从而导致系统的性能明显下降以及若利用自相关函数带来的大量运算。-Speaker Recognition Speech Recognition is a special way, its purpose is not voice recognition, Who identification but said that the voice signal from extracting personal characteristics. Vector quantization (VQ) can avoid the difficulties subparagraph voice to the issues and the whole time, and as a means of data compression system can significantly reduce the required data storage capacity. This paper presents a selection of identifiers employ Cepstrum parameters and the application of VQ speaker recognition system to improve a side France. When training for the amount of data is smaller, rehabilitation Cepstrum be relatively stable recognition performance. VQ improved ways to avoid the speech recognition system of training and the use of the difference in time, resulting in excessive sys
Platform: | Size: 23925 | Author: 张开 | Hits:

[Software Engineering] vq

Description: 说话人识别是语音识别的一种特殊方式，其目的不是识别语音内容，而是识别说话人是谁，即从语音信号中提取个人特征。采用矢量量化（VQ）可避免困难的语音分段问题和时间归整问题，且作为一种数据压缩手段可大大减少系统所需的数据存储量。本文提出了识别特征选取采用复倒谱特征参数和对应用VQ的说话人识别系统改进的一种方法。当用于训练的数据量较小时，复倒谱特征可以得到比较稳定的识别性能。VQ的改进方法避免了说话人识别系统的训练时间与使用时间相差过长从而导致系统的性能明显下降以及若利用自相关函数带来的大量运算。-Speaker Recognition Speech Recognition is a special way, its purpose is not voice recognition, Who identification but said that the voice signal from extracting personal characteristics. Vector quantization (VQ) can avoid the difficulties subparagraph voice to the issues and the whole time, and as a means of data compression system can significantly reduce the required data storage capacity. This paper presents a selection of identifiers employ Cepstrum parameters and the application of VQ speaker recognition system to improve a side France. When training for the amount of data is smaller, rehabilitation Cepstrum be relatively stable recognition performance. VQ improved ways to avoid the speech recognition system of training and the use of the difference in time, resulting in excessive sys
Platform: | Size: 23552 | Author: 张开 | Hits:

[Speech/Voice recognition/combine] SPTK-3.1

Description: unix下的语音信号处理工具库，用c语言编写，可用于lsp、lpc、矢量量化分析等等。-under unix tools for speech signal processing library, with c languages, can be used for lsp, lpc, analysis and so on vector quantization.
Platform: | Size: 1105920 | Author: 李航 | Hits:

[Speech/Voice recognition/combine] duojishilianglbgvq

Description: 对于语音编码参数的有权重多级矢量量化，主要是对LSP码本训练，这个权重学习语音的都应知道，且这个可以训练大的语音参数，完全自已写的。上传前已调试成功。-Speech coding parameters for the right to re-multistage vector quantization, mainly for LSP codebook training, the right to re-learn voice should know, and that this can be trained big voice parameters, completely their own writing. From the success of pre-commissioning.
Platform: | Size: 6682624 | Author: 马庆利 | Hits:

[Speech/Voice recognition/combine] test_project

Description:
Platform: | Size: 2048 | Author: Praveen | Hits:

[Speech/Voice recognition/combine] yuyinxinhaolianghua

Description: 采用矢量量化对语音信号量化，并与标量量化对比分析，绘制了语音信号的概率密度曲线-The use of vector quantization for speech signal quantization and scalar quantization and comparative analysis, rendering the speech signal probability density curve
Platform: | Size: 160768 | Author: wu wei | Hits:

[Software Engineering] ReseacrhOfAPPlieationOfEmbeddedSPeeehRecognitionon

Description: 本文首先介绍了目前语音识别的发展现状和主要手段，分析了语音识别中所采用的主要特征参数和比较前沿的研究方向，另外着重讲解了语音识别中最常用隐马尔可夫H(MM)模型，及应用广泛的矢量量化方法(VQ)。接着介绍了嵌入式平台，从软、硬件方面着重介绍了与语音识别相关部分的设计包括硬件及相关驱动程序设计，最后介绍了系统实现方法与测试结果。 -This paper describes the current status of the development of speech recognition and the main means of analysis used in speech recognition parameters and compare the main features cutting-edge research, while focused on explaining the most commonly used in speech recognition, Hidden Markov H (MM) model of , and the widely used vector quantization (VQ). Then introduced the embedded platform, from hardware and software aspects of highlighting the relevant parts of speech recognition with the design, including hardware and associated drivers for the design, finally introduced a system implementation method and test results.
Platform: | Size: 6030336 | Author: fff | Hits:

[Speech/Voice recognition/combine] 19_M-W-K-D-D_SPA2010_final

Description: This paper presents results of speaker recognition experiments using short Polish sentences. We developed and analyzed various vector quantization representations in order to first maximize identification effectiveness and second to compare VQ (vector quantization) and GMM (Gaussian mixture model) approaches. For the research and experiments we created and exploited database, containing specially prepared short speech sequences.
Platform: | Size: 258048 | Author: Tomasz | Hits:

[AI-NN-PR] module-1

Description: ector quantization is a classical quantization technique from signal processing which allows the modeling of probability density functions by the distribution of prototype vectors. It was originally used for data compression. It works by dividing a large set of points (vectors) into groups having approximately the same number of points closest to them. Each group is represented by its centroid point, as in k-means and some other clustering algorithms. Digital libraries not only consist of text data, but also speech and image data. To compress speech data techniques such as vector quantization (VQ) are used.
Platform: | Size: 6144 | Author: noopur | Hits:

[Linux-Unix] SPTK-3.5

Description: SPTK是一套语音信号处理工具为UNIX环境中,如LPC分析,PARCOR分析、LSP分析,PARCOR合成过滤器,LSP合成过滤器,矢量量化技术,以及其他扩展的版本。-SPTK is a suite of speech signal processing tools for UNIX environments, e.g., LPC analysis, PARCOR analysis, LSP analysis, PARCOR synthesis filter, LSP synthesis filter, vector quantization techniques, and other extended versions of them.
Platform: | Size: 6351872 | Author: 石中华 | Hits:

[AI-NN-PR] exp7

Description: LBG分类算法用初始室心随机法和扰动因子分裂法两种方法，比较不同方法不同参数设置时的分类性能。 -LBG classification algorithm vector quantization: vector normalization within a certain range for a particular type, consists of two steps: first generate a codebook, which is the speech feature vector space by the first process- also known as clustering speech parameter sequence as a vector, the reference code for classified- also known as quantization. Clustering algorithm: it is relatively simple and commonly used K-means clustering algorithm. LBG is a clustering algorithm, which is generally assumed that the codebook size is fixed, and for a power of 2. Codebook is small, then expanding until it reaches the requirements. It is often an existing classification split into two subclasses, and initial value with the new code word to each subclass. LBG algorithm on random data and a certain regularity (and meet certain Gaussian distribution) data classification, and look at the performance of the LBG algorithm, the initial chamber heart random disturbance factor-secession law are two
Platform: | Size: 86016 | Author: zzc | Hits:

[Windows Develop] aquaantiziingp

Description: 应用LBG算法对语音信号进行矢量量化。本压缩包共有两个主文件，training.m对训练数据进进行处理并的到四个初始码本，quantizing.m对待量化数据进行矢量量化。其余为自编功能函数。 -The application of the LBG algorithm for vector quantization of speech signals. The compressed package a total of two main documents training.m into the training data to be processed and to the four initial codebook, quantizing.m treatment of quantitative data for vector quantization. The rest is self performance function.
Platform: | Size: 4096 | Author: fz | Hits:

[Multimedia Develop] VQ-for-the-DHMM

Description: 用于聚类的矢量量化程序，可用在相关领域，程序本为语音识别设计，可正常运行。-Used for clustering vector quantization procedure, can be used in the related field, program designed for speech recognition, can be normal operation
Platform: | Size: 1551360 | Author: vsdlh ausdhf | Hits:

[Voice Compress] speech-vq

Description: 针对语音信号的能量，实现三维矢量量化，量化失真小。-Energy for the speech signal to achieve a three-dimensional vector quantization, a quantization distortion.
Platform: | Size: 342016 | Author: hy | Hits:

[matlab] vq-Speakerrecognized

Description: vq矢量量化，对说话人进行识别，内置有语音样本，识别率高达90 。对语音信号进行读取，预处理，特征参数提取，向量量，训练，对比识别。-vq vector quantization for speaker identification, built-in voice samples, the recognition rate of 90 . Read the speech signal, preprocessing, feature extraction, the amount of vector, training, recognition comparison.
Platform: | Size: 705536 | Author: 罗先生 | Hits:

[Audio program] mfcc2delta

Description: In this work, the Mel frequency Cepstrum Coefficient (MFCC) feature has been used for designing a text dependent speaker identification system. The extracted speech features (MFCC’s) of a speaker are quantized to a number of centroids using vector quantization algorithm. These centroids constitute the codebook of that speaker. MFCC’s are calculated in training phase and again in testing phase. Speakers uttered same words once in a training session and once in a testing session later. The Euclidean distance between the MFCC’s of each speaker in training phase to the centroids of individual speaker in testing phase is measured and the speaker is identified according to the minimum Euclidean distance. The code is developed in the MATLAB environment and performs the identification satisfactorily.
Platform: | Size: 1024 | Author: abdou | Hits:

Category

Source Code

Web/Internet

Develop Tools

Document

Other

Search in results

OS

Platform

Language

File Type

Search list