Aliasing reduction in L-F model implementation for an interactive tool ...

0 downloads 20 Views 5MB Size Report
Aug 3, 2015 - implementation for an interactive tool applicable to speech science education ... Nara Institute of Scienc
Aliasing reduction in L-F model implementation for an interactive tool applicable to speech science education Hideki KAWAHARA†, Ken-Ichi SAKAKIBARA††, 
 Hideki BANNO†††, Masanori MORISE††††, 
 Tomoki TODA†††††, and Toshio IRINO† † Wakayama

University
 †† Health Sciences University of Hokkaido ††† Meijo University †††† University of Yamanashi
 ††††† Nara Institute of Science and Technology IEICE Technical meeting, Sendai Japan, 3 August 2015

Overview •Interactive environment for speech science education

•Relation between vocal tract shape,

transfer function, pole locations, LSF, F0, vibrato, voice source and speech sounds.

•A closed-form representation of the antialiased version of the well known voice source model (L-F model) is introduced

•Matlab-based implementation

Overview •Interactive environment for speech science education

•Relation between vocal tract shape,

transfer function, pole locations, LSF, F0, vibrato, voice source and speech sounds.

•A closed-form representation of the antialiased version of the well known voice source model (L-F model) is introduced

•Matlab-based implementation

vocal tract area function (log-area)

mouth

glottis

3D shape

actual modification: scaling × modifier shape

scaling

modifier shape

poles of the transfer function

vocal tract transfer function

selected pole

pole information

LSF: line spectrum frequencies

Overview •Interactive environment for speech science education

•Relation between vocal tract shape,

transfer function, pole locations, LSF, F0, vibrato, voice source and speech sounds.

•A closed-form representation of the antialiased version of the well known voice source model (L-F model) is introduced

•Matlab-based implementation

English translation of following slides

interp. corr.

autocorrelation

Jacobi coeff.

PARCOR coeff. poly. coeff. p.

LPC coeff.

log-aera

CSM poly. root.

LPC polynom. pole freq. bw.

transfer function

CSM freq. int. LSP freq.

display

control

log-aera

transfer function

display

control

log-area
 manipulation

log-aera

transfer function

display

control

pole location
 manipulation

log-aera

transfer function

display

control

Voice source model piecewise (exponential) 
 function: discontinuity

Dept. for Speech, Music and Hearing

Quarterly Progress and Status Report

A four-parameter model of glottal flow Fant, G. and Liljencrants, J. and Lin, Q.

journal: volume: number: year: pages:

STL-QPSR 26 4 1985 001-013

http://www.speech.kth.se/qpsr

Voice source model piecewise (exponential) 
 function: singularity

Voice source model piecewise (exponential) function: singularity

Voice source model piecewise (exponential) function: discontinuity

discretization aliasing

Severe degradation!

Before and after antialiasing

direct digitization

Antialiasing+digitization
 +equalization

Before and after antialiasing

direct digitization

Antialiasing+digitization
 +equalization

Before and after antialiasing

direct digitization

Antialiasing+digitization
 +equalization

Voice source model piecewise (exponential) function: discontinuity

discretization aliasing

Severe degradation!

Overview •Interactive environment for speech science education

•Relation between vocal tract shape,

transfer function, pole locations, LSF, F0, vibrato, voice source and speech sounds.

•A closed-form representation of the antialiased version of the well known voice source model (L-F model) is introduced

•Matlab-based implementation

Anti-aliased L-F model L-F model

L-F model

L-F volume velocity

L-F model

discrete time signal over-sampled discretization over-sampled discretization

digital anti-aliasing

discretization down sampling

differentidigital down ation anti-aliasing sampling

anti-aliasing

continuous time signal

discretization

equalizer

A

B

A B D

C D

C

A B C A B D

C D

Voice source panel

Conclusion •Interactive environment for speech science education

•Relation between vocal tract shape,

transfer function, pole locations, LSF, F0, vibrato, voice source and speech sounds.

•A closed-form representation of the antialiased version of the well known voice source model (L-F model) is introduced

•Matlab-based implementation

Please download and have fun!

These do not require Matlab

Equalizer

Suggest Documents