Search - mel scale

[Other resource] FrequencyScaleConversion

Description: Frequency Scale Conversion From f To f Scale frq2mel mel2frq mel The mel scale is based on the human perception of sinewave pitch. frq2erb erb2frq erb The erb scale is based on the equivalent rectangular bandwidths of the human ear. frq2midi midi2frq midi The midi standard specifies a numbering of semitones with middle C being 60. They can use the normal equal tempered scale or else the pythagorean scale of just intonation. They will in addition output note names in a character format. -Frequency Scale Conversion From To f f Scal mel frq2mel mel2frq e mel The scale is based on th e human perception of sinewave pitch. e frq2erb erb rb2frq erb The scale is based on the equivale nt rectangular bandwidths of the human ear. preferentially 2midi midi2frq midi The MIDI standard SPECIFIED s a numbering of semitones with middle C being 60 . They can use the normal equal tempered scale or pythagorean else the scale of just intonation. They will in addition output note names in a char acter format.
Platform: | Size: 8122 | Author: 张一一 | Hits:

[matlab] FrequencyScaleConversion

Description: Frequency Scale Conversion From f To f Scale frq2mel mel2frq mel The mel scale is based on the human perception of sinewave pitch. frq2erb erb2frq erb The erb scale is based on the equivalent rectangular bandwidths of the human ear. frq2midi midi2frq midi The midi standard specifies a numbering of semitones with middle C being 60. They can use the normal equal tempered scale or else the pythagorean scale of just intonation. They will in addition output note names in a character format. -Frequency Scale Conversion From To f f Scal mel frq2mel mel2frq e mel The scale is based on th e human perception of sinewave pitch. e frq2erb erb rb2frq erb The scale is based on the equivale nt rectangular bandwidths of the human ear. preferentially 2midi midi2frq midi The MIDI standard SPECIFIED s a numbering of semitones with middle C being 60 . They can use the normal equal tempered scale or pythagorean else the scale of just intonation. They will in addition output note names in a char acter format.
Platform: | Size: 8192 | Author: 张一一 | Hits:

[Multimedia Develop] MFCC

Description: MFCC (Mel Frequent Cepstral Coefficient) in M-File. epresentation of the short-term power spectrum of a sound, based on a linear cosine transform of a log power spectrum on a nonlinear mel scale of frequency. MFCCs derived as follows: 1. Take the Fourier transform of (a windowed excerpt of) a signal. 2. Map the powers of the spectrum obtained above onto the mel scale, using triangular overlapping windows. 3. Take the logs of the powers at each of the mel frequencies. 4. Take the discrete cosine transform of the list of mel log powers, as if it were a signal. 5. The MFCCs are the amplitudes of the resulting spectrum.
Platform: | Size: 1024 | Author: Mitha | Hits:

[Technology Management] Research_0n_Speech_Cepstral_Features

Description: 该文在研究基于线性预测倒谱和非线性MEL刻度倒谱特征的基础上，研究了LPCC和MFCC参数提取的算法原理及提取算法，提出了一级、二级差分倒谱特征参数的提取算法。识别实验验证了MFCC参数的鲁棒性优于LPCC参数。-In this paper, research is based on linear prediction and nonlinear MEL Cepstrum Cepstrum scale, based on studies of LPCC and MFCC parameter extraction algorithm for principle and extraction algorithm is proposed a secondary characteristic parameters of differential cepstrum extraction algorithm. Recognition experiment results verify the robustness of MFCC parameters is superior to LPCC parameters.
Platform: | Size: 147456 | Author: Decheng Yu | Hits:

[matlab] filterbank

Description: The idea of a filterbank on a non-linear(mel) frequency scale-filter bank for mel
Platform: | Size: 2048 | Author: lou | Hits:

[matlab] filterbank_for_speech_signal

Description: A speech signal filterbank, using melscale frequency and framebanking.- 1. For speech signal can be represented as a discrete sequence of frames (or feature vectors) that can be used as the input to a speech recogniser. Important ideas and techniques that are used in the assignment are the design of a (Mel frequency scale) audio filterbank, , windowing of a continuous audio signal, spectrum analysis of the signal, filtering as multiplication in the frequency domain, the visual representation of a speech signal as a spectrogram, appreciation of the acoustic variability in real speech utterances. 2. To gain a deeper knowledge of the application of MATLAB programming to audio signal processing. 3. To gain practice in the art of writing a formal report: structure, content, style, use of diagrams, presentation etc. etc.
Platform: | Size: 3072 | Author: Yijian Lou | Hits:

[OpenGL program] Milkshape3D_Importer

Description: 1。动画规模因素是用来控制动画率。 2。阈值是用来控制多少在进口UV坐标网格/网格。 3。自从MilkShape3D ascii文件不包含信息的速度播放,我用25 fps射击。 * *历史 1。修理几乎所有的动画虫子发现在0.5版本。 2。支持Milkshape3D Ascii文件。 3。使动画更符合Milkshape3D钥匙。 4。纹理路径解决新旧版本的MilkShape3D文件。-This plug-in is for importing Milkshape3D file(ms3d) Milkshape3D Ascii file(txt) to Maya 8.0. This plug-in supports models with animations and materials. This plug-in is able to import the latest ms3d file with up to four skin weights. Neither Mac OS X nor Windows/Linux 64bit has been supported yet. * Installation* 1. Copy Ms3dImport.mll to your maya plug-ins directory. (For example: "C:\Program Files\Alias\Maya8.0\bin\plug-ins") 2. Copy RubyTranslatorOpts.mel to your maya scripts directory. (For example: "C:\Documents and Settings\user\My Documents\maya\8.0\scripts\others") 2. Launch maya and load this plug-in. (Window-> Setting/Preferences-> Plug-in Manager, select "Ms3dImport.mll"). * Usage* Main menu File-> Open/Import Attention: 1. Animation scale factor is used to control the animation rate. 2. Threshold value is used to control the how many UV coordinates in the imported mesh/meshes. 3. Since MilkShape3D ascii file doesn t contain information of
Platform: | Size: 43008 | Author: zhuhaimjw | Hits:

[Speech/Voice recognition/combine] Gammashirp-filter

Description: In this paper, we figure out the use of appended jitter and shimmer speech features for closed set text independent speaker identification system. Jitter and shimmer features are extracted from the fundamental frequency contour and added to baseline spectral features, specifically Mel-frequency Cepstral Coefficients (MFCCs) for human speech and MFCC-GC which integrate the Gammachirp filterbank instead of the Mel scale. Hidden Markov Models (HMMs) with Gaussian Mixture Models (GMMs) state distributions are used for classification. Our approach achieves substantial performance improvement in a speaker identification task compared with a state-of-the-art robust front-end in a clean condition.
Platform: | Size: 256000 | Author: mansouri | Hits:

[Other] MATLAB

Description: 梅尔频率倒谱(Mel-Frequency Cepstrum)是一段声音的短时功率谱，基于频率的非线性梅尔刻度(mel scale)的对数能量频谱的线性预先变换- the mel-frequency cepstrum (MFC) is a representation of the short-term power spectrum of a sound, based on a linear cosine transform of a log power spectrum on a nonlinear mel scale of frequency.
Platform: | Size: 9216 | Author: 路上 | Hits:

[Software Engineering] mfcc

Description: 在语音辨识（Speech Recognition）和语者辨识（Speaker Recognition）方面，最常用到的语音特征就是「梅尔倒频谱系数」（Mel-scale Frequency Cepstral Coefficients，简称MFCC），此参数考虑到人耳对不同频率的感受程度，因此特别适合用在语音辨识。
Platform: | Size: 1024 | Author: 温睿潜 | Hits:

[File Format] tain

Description: 耳蜗实质上相当于一个滤波器组，耳蜗的滤波作用是在对数频率尺度上进行的，在1000HZ下，人耳的感知能力与频率成线性关系；而在1000HZ以上，人耳的感知能力与频率不构成线性关系，而更偏向于对数关系，这就使得人耳对低频信号比高频信号更敏感。Mel频率的提出是为了方便人耳对不同频率语音的感知特性的研究。频率与Mel频率的转换公式为-Cochlear substantially equivalent to a filter set, cochlear filter is used on logarithm frequency scale, under the 1000 hz, the perception of the human ear and a linear relationship with frequency In more than 1000 hz, the perception of the human ear does not constitute a linear relationship with frequency, and prefer to logarithmic relationship, which makes the human ear is sensitive to low frequency signal is better than high frequency signal. Mel frequency is put forward in order to facilitate the ear of the study of speech perception characteristics of different frequency. For frequency and Mel frequency conversion formula
Platform: | Size: 1024 | Author: 朱健晨 | Hits:

[Technology Management] tese

Description: 耳蜗实质上相当于一个滤波器组，耳蜗的滤波作用是在对数频率尺度上进行的，在1000HZ下，人耳的感知能力与频率成线性关系；而在1000HZ以上，人耳的感知能力与频率不构成线性关系，而更偏向于对数关系，这就使得人耳对低频信号比高频信号更敏感。Mel频率的提出是为了方便人耳对不同频率语音的感知特性的研究。频率与Mel频率的转换公式为-Cochlear substantially equivalent to a filter set, cochlear filter is used on logarithm frequency scale, under the 1000 hz, the perception of the human ear and a linear relationship with frequency In more than 1000 hz, the perception of the human ear does not constitute a linear relationship with frequency, and prefer to logarithmic relationship, which makes the human ear is sensitive to low frequency signal is better than high frequency signal. Mel frequency is put forward in order to facilitate the ear of the study of speech perception characteristics of different frequency. For frequency and Mel frequency conversion formula
Platform: | Size: 1024 | Author: 朱健晨 | Hits:

[AI-NN-PR] mel-scale

Description: Mel scale and inver mel scale and modified mel scale -自己编写的Mel scale 源码
Platform: | Size: 1024 | Author: 陈金豹 | Hits:

[Compress-Decompress algrithms] rastamat

Description: In sound processing, the mel-frequency cepstrum (MFC) is a representation of the short-term power spectrum of a sound, based on a linear cosine transform of a log power spectrum on a nonlinear mel scale of frequency.
Platform: | Size: 23552 | Author: desinfox@gmail.com | Hits:

[matlab] mfcc

Description: Mel-frequency cepstral coefficients (MFCCs) are coefficients that collectively make up an MFC. They are derived a type of cepstral representation of the audio clip (a nonlinear spectrum-of-a-spectrum ). The difference between the cepstrum and the mel-frequency cepstrum is that in the MFC, the frequency bands are equally spaced on the mel scale, which approximates the human auditory system s response more closely than the linearly-spaced frequency bands used in the normal cepstrum.-Mel-frequency cepstral coefficients (MFCCs) are coefficients that collectively make up an MFC. They are derived a type of cepstral representation of the audio clip (a nonlinear spectrum-of-a-spectrum ). The difference between the cepstrum and the mel-frequency cepstrum is that in the MFC, the frequency bands are equally spaced on the mel scale, which approximates the human auditory system s response more closely than the linearly-spaced frequency bands used in the normal cepstrum.
Platform: | Size: 1024 | Author: Perfect | Hits:

[Audio program] mfcc

Description: 语音识别MFCC特征提取matlab代码。「梅尔倒频谱系数」（Mel-scale Frequency Cepstral Coefficients，简称MFCC），是最常用到的语音特征，此参数考虑到人耳对不同频率的感受程度，因此特别适合用在语音辨识。-Speech recognition MFCC feature extraction matlab code. \ Mel cepstrum coefficient (Mel- scale Frequency Cepstral Coefficients, MFCC), is the most commonly used to the phonetic characteristics of this parameter given ear to the feelings of different frequencies, so especially suitable for use in speech recognition
Platform: | Size: 1024 | Author: Katherine | Hits:

[Software Engineering] 20P_ISOLATED

Description: This paper describes an approach of isolated speech recognition by using the Mel-Scale Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW). Several features are extracted speech signal of spoken words. An experimental of total five speakers, speaking 10 digits each is collected under acoustically controlled room is taken. MFCC are extracted speech signal of spoken words. To cope with different speaking speeds in speech recognition Dynamic Time Warping (DTW) is used. DTW is an algorithm, which is used for measuring similarity between two sequences, which may vary in time or speed.-This paper describes an approach of isolated speech recognition by using the Mel-Scale Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW). Several features are extracted speech signal of spoken words. An experimental of total five speakers, speaking 10 digits each is collected under acoustically controlled room is taken. MFCC are extracted speech signal of spoken words. To cope with different speaking speeds in speech recognition Dynamic Time Warping (DTW) is used. DTW is an algorithm, which is used for measuring similarity between two sequences, which may vary in time or speed.
Platform: | Size: 667648 | Author: ali khaleel | Hits:

[Console] MFCC1

Description: 提取语音的MFCC特征参数，在语音识别（Speech Recognition）和话者识别（Speaker Recognition）方面，最常用到的语音特征就是梅尔倒谱系数（Mel-scale Frequency Cepstral Coefficients，简称MFCC）。-MFCC feature parameters extracted speech, speech recognition (Speech Recognition) and speaker verification (Speaker Recognition) terms, the most commonly used speech features that Mel Cepstral (Mel-scale Frequency Cepstral Coefficients, referred MFCC).
Platform: | Size: 1755136 | Author: pxchen | Hits:

[Speech/Voice recognition/combine] Speech-recognition

Description: MFCC参数是基于人的听觉特性利用人听觉的屏蔽效应，在Mel标度频率域提取出来的倒谱特征参数。-MFCC parameters is based on human auditory characteristics using human auditory masking effect, in Mel scale frequency domain parameters of cepstrum.
Platform: | Size: 4096 | Author: 徐金 | Hits:

[Speech/Voice recognition/combine] mfcc

Description: mfcc used in python mel-scale(mfcc used in python mel-scale)
Platform: | Size: 8192 | Author: liumeng | Hits:

Category

Source Code

Web/Internet

Develop Tools

Document

Other

Search in results

OS

Platform

Language

File Type

Search list