comparison of different implementations of mfcc - CiteSeerX

Recommend Documents

Security Evaluation of Different AES Implementations ... - CiteSeerX

ShiftRows. MixColumns. SubBytes. AddRoundKey. Key. Expansion. Round. Key. Register. Register. Reset. AES DATAPATH. AES. CONTROL. Control. Start.

Comparison between different implementations of PVM on IBM SP1 ...

Comparison between di erent implementations of PVM on IBM. SP1 and CRAY T3D. C. Calvin. LMC - IMAG, 46 Av. F elix Viallet,. 38031 Grenoble Cedex 1, ...

COMPARISON OF MFCC AND PITCH ... - Semantic Scholar

glottis and are used for speaker identification (SID). The proposed ..... motivate the design and test of a SID system based on a sex pre- ... [6] R.D. Patterson.

A Comparison of MPI and OpenMP Implementations of a ... - CiteSeerX

a Finite Element Analysis Code. Michael Bane. Centre for Novel Computing. Department of Computer Science. University of Manchester. Oxford Road ...

A Comparison of MPI and OpenMP Implementations of a ... - CiteSeerX

using an element level approach could achieve very good results using MPI [3] ... is to investigate how well the same code can be implemented using OpenMP [4]. ... Further details are given in various texts, but see in particular Freeman and.

Comparison of Two Implementations of Scalable Montgomery ...

AbstractâThis paper presents a comparison of two pos- sible approaches for the efficient implementation of a scalable Montgomery Modular Multiplication ...

Comparison of black box implementations of two

processing of NMR spectra, Gaussian Mixture Model and Singular. Value Decomposition .... HSVD, which belongs to the group of time-domain algorithms for ...

Comparative Evaluation of Various MFCC ... - CiteSeerX

Abstract. Making no claim of being exhaustive, a review of the most popular MFCC (Mel Frequency Cepstral Coefficients) implementations is made.

Numerical Aspects of Different Implementations Kalman Filter

M. Verhaegen is with the NASA Ames Research Center, Moffett Field, CA ..... (which we call Mk) by ern* llMkll, and those in constructing the postarray (which we ...

A Comparison of Linda Implementations in Java - CiteSeerX

Abstract. This paper describes the implementation of an extended version of Linda in Java. The extensions have been made with a view to increasing the ...

COMPARING DIFFERENT IMPLEMENTATIONS FOR THE ...

function, which requires a training algorithm. Levenberg-Marquardt is a second order algorithm which outperforms Backpropagation and is currently available in ...

Comparison between three implementations of ... - Ocean Science

Comparison between three implementations of automatic identification algorithms for the quantification and characterization of mesoscale eddies in the South ...

COMPARISON OF CPML IMPLEMENTATIONS FOR ... - (PIER) Journals

Jun 27, 2011 - THE GPU-ACCELERATED FDTD SOLVER. J. I. Toivanen 1, *, T. P. ... 1Department of Mathematical Information Technology, University of. JyvÃ¤skylÃ¤, P. O. Box .... computational efficiency of the program. For instance, it affects ...

COMPARISON OF THREE DIFFERENT OPEN ... - CiteSeerX

Computer and Automation Research Institute (www.sztaki.hu). Kende u. 13, H-1111 .... A general application programming interface (the upper layer on the Fig.

COMPARISON OF DIFFERENT MIMO STRATEGIES FOR ... - CiteSeerX

On the other hand V-. BLAST does not encode the signal (or, .... 16, no. 8, pp. 1451â1458, Oct. 1998. [2] H.; Calderbank A.R. Tarokh, V.; Jafarkhani, âSpace-.

Comparison of Performance between Different Selection ... - CiteSeerX

Mar 21, 2009 - wheel selection and tournament selection. A SGA is mainly composed of three genetic operations, which are selection, crossover and mutation.

Comparison of Performance between Different Selection ... - CiteSeerX

Mar 21, 2009 - D.S.Huang, Horace H.S.Ip and Zheru Chi, â A neural root finder of polynomials based on root moments,â. Neural Computation, Vol.16, No.8, ...

Performance Comparison of Different Techniques for ... - CiteSeerX

Jul 29, 2003 - Phone: +39-081-7682894, Fax: +39-081-7682950, Email: [email protected] ..... have been made in the source code of the traffic generator. .... Guide,â http://mgen.pf.itd.nrl.navy.mil/mgen.html.

Pharmacological Comparison of Two Different Insect ... - CiteSeerX

from the locust. Using the scorpion Î±-like toxin BmK M1 as a suitable pharmacological tool, 5 of its highly conserved aromatic residues (Y14, Y21, W38, Y35 and ...

Comparison of different machine strength grading ... - CiteSeerX

Oct 30, 2008 - Comparison of different machine strength grading principles. M. Bacher. MiCROTEC GmbH, Julius Durst StraÃe 98, 39042 Brixen (BZ), Italy.

COMPARISON OF DIFFERENT METHODS FOR 210Pb ... - CiteSeerX

important isotope from the point of view of radiation protection. The value of its ingestion dose ..... The radiochemistry of polonium. National Research Council ...

a comparison of different methods for - CiteSeerX

ABSTRACT. This paper presents a comparison of different methods for noise reduction in Behind-The-Ear (BTE) hearing aids with two microphones. In this work ...

a comparison of different methods for - CiteSeerX

With each hearing aid a two-stage adaptive beamformer (A2B) can be added. To make a comparison between different methods (HAO, HA2D, HA3D, HA1+A2B,.

Comparison of Different Ti

Identification of Gram-Positive Cocci by Use of Matrix-Assisted Laser. Desorption ... Received 4 October 2012 Returned for modification 1 November 2012.

comparison of different implementations of mfcc - CiteSeerX

Download PDF

55 downloads 54737 Views 133KB Size Report

Comment

the 863 Speech Database show that, compared with the traditional MFCC with its .... overlapped or not makes a big difference. ..... spontaneous speech data.

J. Computer Science & Technology, 16(6): 582-589, Sept. 2001

COMPARISON OF DIFFERENT IMPLEMENTATIONS OF MFCC Fang Zheng (郑方), Guoliang Zhang (张国亮) and Zhanjiang Song (宋战江) Center of Speech Technology, State Key Lab of Intelligent Technology and Systems, Department of Computer Science and Technology, Tsinghua University, Beijing, 100084, P.R. China [email protected], http://sp.cs.tsinghua.edu.cn

ABSTRACT The performance of the Mel-Frequency Cepstrum Coefficients (MFCC) may be affected by (1) the number of filters, (2) the shape of filters, (3) the way that filters are spaced, and (4) the way that the power spectrum is warped. In this paper, several comparison experiments are done to find a best implementation. The traditional MFCC calculation excludes the 0th coefficient for the reason that it is regarded as somewhat unreliable. According to the analysis and experiments, the authors find that it can be regarded as the generalized frequency band energy (FBE) and is hence useful, which results in the FBE-MFCC. The authors also propose a better analysis, namely the auto-regressive analysis, on the frame energy, which outperforms its 1st and/or 2nd order differential derivatives. Experiments across the 863 Speech Database show that, compared with the traditional MFCC with its corresponding auto-regressive analysis coefficients, the FBE-MFCC and the frame energy with their corresponding auto-regressive analysis coefficients form the best combination, reducing the Chinese syllable error rate (CSER) by about 10.0%, while the FBE-MFCC with the corresponding auto-regressive analysis coefficients reducing CSER by 2.5%. Comparison experiments are also done across a quite casual Chinese speech database, named Chinese Annotated Spontaneous Speech (CASS) corpus, the FBE-MFCC can reduce the error rate by about 2.9% on an average. Keywords: MFCC, frequency band energy, auto-regressive analysis, generalized initial/final

1. INTRODUCTION The extraction and selection of the best parametric representation of acoustic signals is an important task in the design of any speech recognition system; it significantly affects the recognition performance. A compact representation would be provided by a set of mel-frequency cepstrum coefficients (MFCC), which are the results of a cosine transform of the real logarithm of the short-term energy spectrum expressed on a mel-frequency scale [9]. The MFCCs are proved more efficient [2]. The calculation of the MFCC includes the following steps. (1) The discrete Fourier transform (DFT) transforms the windowed speech segment into the frequency domain and the short-term power spectrum P (f) is obtained. (2) The spectrum P (f) is warped along its frequency axis f (in hertz) into the mel-frequency axis as P (M) where M is the mel-frequency, using Equation (1) [8][10]. This is to approximately reflex the human’s ear perception. (1) M ( f ) = 2595 log10 (1 + f / 700) (3) The resulted warped power spectrum is then convolved with the triangular band-pass filter P(M) into θ(M). The convolution with the relatively broad critical-band masking curves ψ(M) significantly reduces the spectral resolution of θ(M) in comparison with the original P(f), which allows for the down sampling of θ(M). The discrete convolution of ψ(M) with θ(M) yields samples of the critical-band power spectrum as θ(Mk) (k=1..K) in Equation (2), where Ωk’s are linearly spaced in the mel-scale. Then K outputs X(k)=ln(θ(Mk)) (k=1..K) are obtained. The K filters in the implementation of discrete convolution are simulated as shown in Figure 1 (a). In the implementation, θ(Mk) is the average instead of the sum. θ( M k ) = ∑ P( M − M k )Ψ ( M ), k = 1..K (2) M (4) The MFCC is computed using Equation (3) and often D