ELEC97079 (EE4-14) Speech ProcessingLecturer(s): Prof Patrick Naylor Aims
To introduce students to the signal processing and statistical techniques that are used in processing speech signals and to give students an understanding of how these techniques are used in the coding, synthesis and recognition of speech.
Learning Outcomes
To enable students to apply signal processing techniques appropriately for the coding, synthesis and recognition of speech signals. To enable students to use dynamic programming, statistical modelling and inference techniques in pattern recognition applications.
Syllabus
The human vocal and auditory systems. Characteristics of speech signals: phonemes, prosody, IPA notation. Lossless tube model of speech production. Time and frequency domain representations of speech; window characteristics and time/frequency resolution tradeoffs. Properties of digital filters: mean log response, resonance gain and bandwidth relations, bandwidth expansion transformation, all-pass filter characteristics. Autocorrelation and covariance linear prediction of speech; optimality criteria in time and frequency domains; alternate LPC parametrisation. Speech coding: PCM, ADPCM, CELP. Speech synthesis: language processing, prosody, diphone and formant synthesis; time domain pitch and speech modification. Speech recognition: hidden Markov models and associated recognition and training algorithms. Dynamic Programming. Language modelling. Large vocabulary recognition. Acoustic preprocessing for speech recognition.
Exam Duration: 3:00hrs Coursework contribution: 0% Term: Spring Closed or Open Book (end of year exam): Closed Coursework Requirement: Nil Oral Exam Required (as final assessment): no Prerequisite module(s): None required Course Homepage: This course uses Blackboard Book List: Please see Module Reading list
|