SPEECH AND AUDIO SIGNAL PROCESSING.pdf - Google Drive
Recommend Documents
5. What are diphthongs? 6. Why we require sampling of analog signals? 7. Why we generally keep sampling frequency more t
from any use or operation of any methods, products, instructions, or ideas contained in the material herein. British Lib
as VisualStudio, GCC, Eclipse, Pycharm and has undertaken audio signal processing course. Page 1 of 1. JD_ Audio signal
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 10, NO. 5
, JULY 2002 ... In this paper, the automatic classification of audio signals into.
Whoops! There was a problem loading more pages. Retrying... Audio digital.pdf. Audio digital.pdf. Open. Extract. Open wi
Aug 15, 2007 - the development of both speech and non-speech intersensory perception across .... a Powermac G4 computer (Apple Inc., Cuppertino, CA, USA). ... and position (center, periphery) as a within-subjects factor indicated.
Hearing, Speech and. Audio Technology. Fraunhofer Institute for. Digital Media
Technology IDMT. Fraunhofer - Partner for increased market competitiveness.
Aug 27, 2008 - Applications: digital multimedia libraries, media monitoring, News on. Demand ... Developing multimedia and multilingual indexing and management tools .... Best combination via feature vector concatenation (78 features).
results are obtained when the well-known mel-cepstrum technique [2] is applied. Because of ..... S.B. Davis y P. Mermelstein, IEEE Trans. on ASSP, vol. 28, pp.
of view. The International Standard Organization (ISO) started in October 1996 a ..... France, October 1998. [2] MPEG-7: ISO/IEC 15938-5 Final Commitee Draft-.
costs and the feasibility of management of hundreds of devices. .... RFID-based system able to locate and track personnel and machinery in mines ..... recording) cannot be easily distinguished, since the shapes of their PMFs ..... the energy content
SPEECH AND AUDIO SIGNAL PROCESSING.pdf - Google Drive
... audio compression algorithm. (6+8). 8. Write short notes on : i) Mel frequency cepstral coefficients. ii) Hidden Mar
*JEP1056*
JEP – 1056
II Semester M.E. (Electronics & Communication) Degree Examination, July 2014 (2K9 Scheme) EL 215.4 : Elective – I : SPEECH AND AUDIO SIGNAL PROCESSING Time : 3 Hours
Max. Marks : 100
Instruction : Answer any five full questions. 1. a) Write a block diagram to illustrate speech production mechanism. Discuss the significance of source and filter in the context of producing vowels, fricatives and plosives sounds. b) Write a note on : Spectrograms.
8 6
c) Discuss the following speech sounds in terms of source, filter, places and manner of articulation : i) Nasals and ii) Diphthongs. Give an example for each.
6
2. a) Define short time energy and short time average zero crossing rate.
4
b) Discuss speech Vs silence discrimination algorithm based on short time energy and short time average zero crossing rate. Indicate the limitation of the technique.
8
c) Discuss the significance of short time autocorrelation function and 3 level central clipper in the context of speech signal processing.
8
3. a) Explain short time Fourier transform and give its filter bank interpretation in terms of low pass and band pass filters.
6
b) Discuss the overlap and add method of short time synthesis and derive the expression for necessary constraints.
8
c) Show that short time autocorrelation function and short time psd form a Fourier transform pair.
6 P.T.O.
JEP – 1056
-2-
*JEP1056*
4. a) State and prove any two properties of complex cepstrum by considering rational z-transform of the form. |A|∏
X (z) =
(∏
Ni
Mi k =1
k =1
6
(1 − a kz −1) ∏M0 (1 − bkz) k =1
. (1 − ckz −1)) (∏N0 (1 − dkz)) k =1
b) Find the complex and real cepstrum of the sequence p(n) = δ(n) +
1 1 δ(n − 25 ) + δ(n − 50 ) . 2 4
c) Explain with a block diagram steps involved in computing the real cepstrum of a speech signal. Draw a block diagram to explain how formants of the speech signal are estimated from the computed cepstrum and justify the steps.
6
8
5. a) Define a pth order linear predictor. Show that the following relation holds for a pth order linear predictor s (n) =
p
∑k=1aks(n − k) + e(n), where {ak} are LP
coefficients and e (n) is the prediction error.
4
b) Explain the autocovariance method of LP analysis and show that i)
p
∑k =1α kφn( i − k ) = φn (i, 0),
1 ≤ i ≤ p, and assuming p = 4, write the above
equation in the matrix form. ii) the diagonal elements of the matrix are given by, φn(i + 1, k + 1) = φn(i, k) + sn(− i − 1)sn(− k − 1) − sn(N − 1 − i)sn (N − 1 − k ) .
8
c) Using Durbin’s recursive algorithm compute the transfer function of 2nd order LP model for a speech signal whose autocorrelation sequence is R(k)= (24/5) × 2–|k| – (27/10) × 3–|k|. 6. a) Explain the concept of masking in the context of speech perception.
8 6
b) Explain how VQ techniques can be applied to classify speech vectors.
8
c) Explain 3 different groups of speech coders.
6
*JEP1056*
-3-
JEP – 1056
7. a) With a block diagram explain ADPCM system with feed-forward adaptive quantization. b) Write a note on : i) LD_CELP G.728 speech compression standard. ii) MPEG audio compression algorithm.
6
(6+8)
8. Write short notes on : i) Mel frequency cepstral coefficients. ii) Hidden Markov model, and iii) Text to speech synthesis. __________________