Module Description:In this module the learner will gain knowledge and experience of working with speech signal processing technology and learn about about the key signal processing techniques underpinning modern speech technology applications and how they impact the performance of the system.
  1. Identify, describe and apply signal processing techniques to speech processing applications and speech feature extraction.
  2. Compare and select suitable features set to be employed for a specified speech application.
  3. Describe and evaluate the components of automatic speech recognition systems.
  4. Describe and evaluate the components of text-to-speech synthesis systems.
  5. Describe and critique emerging applications of speech technology.

Fundamental Theory
Introduction to speech - Sound and human speech. Phonetics and Phonology. Words, syllables and phonemes. Syntax and Semantics. Speech Technology Overview - Technology and applications utilising speech processing. Overview of Digital Signal Processing. Digital signals, systems and sampling. Time and frequency domain representations. Digital filters, the fast Fourier transform, windowing and filterbanks. Auditory system & speech perception - Anatomy of the auditory system. Signal processing models of the auditory system. Psychoacoustics and auditory perception of speech. Speech production - Anatomy of the speech production system. Models of the speech production system. Speech Signal representations - Short time Fourier analysis and feature extraction. Acoustic model of speech production. Linear predictive coding. Perceptually motivated representations. Formant frequencies and pitch. Measurement of speech quality and intelligibility.
Speech Rocgnition
Hidden Markov Models - The Markov chain and hidden Markov models. Continuous and semi continuous HMMs. Practical issues in using HMMs and their limitations. Gaussian Mixture Models. Acoustic Modelling - Variability in the speech signal. How to measure speech recognition errors. Signal processing and feature extraction. Phonetic modelling in speech recognition. Acoustic modelling—scoring acoustic features. Robustness and adaptive techniques—minimizing mismatches. Confidence measures: measuring the reliability.
Text-to-speech synthesis
Text and Phonetic Analysis - Modules and data Flow. The lexicon of the synthesiser. Document structured detection. Text normalization. Linguistic analysis. Homograph disambiguation. Morphological analysis. Letter-to-sound conversion. Evaluation. Case study: Festival speech synthesis system. Prosody - Perception of prosody. Prosody generation schematic, speaking style, symbolic prosody and duration assignment. Pitch generation and evaluation. Speech Synthesis - Attributes of speech synthesis. Formant synthesis. Concatenative synthesis and unit selection synthesis. Prosodic modification of speech. Source-filter models for prosody modification. Evaluation of TTS Systems. Statistical parametric speech synthesis. Developing a speech corpus for synthesis. Expressive speech synthesis.
Review of Research Oriented and Emerging Applications
A review of emerging fields of study and applications in speech processing for example biometrics and biomedical applications of speech processing.
