Audio and Speech Processing with MATLAB

Audio and Speech Processing with MATLAB

EnglishPaperback / softbackPrint on demand
Hill, Paul
Taylor & Francis Ltd
EAN: 9780367656317
Print on demand
Delivery on Wednesday, 15. of January 2025
€60.84
Common price €67.60
Discount 10%
pc
Do you want this product today?
Oxford Bookshop Banská Bystrica
not available
Oxford Bookshop Bratislava
not available
Oxford Bookshop Košice
not available

Detailed information

Speech and audio processing has undergone a revolution in preceding decades that has accelerated in the last few years generating game-changing technologies such as truly successful speech recognition systems; a goal that had remained out of reach until very recently. This book gives the reader a comprehensive overview of such contemporary speech and audio processing techniques with an emphasis on practical implementations and illustrations using MATLAB code. Core concepts are firstly covered giving an introduction to the physics of audio and vibration together with their representations using complex numbers, Z transforms and frequency analysis transforms such as the FFT.

Later chapters give a description of the human auditory system and the fundamentals of psychoacoustics. Insights, results, and analyses given in these chapters are subsequently used as the basis of understanding of the middle section of the book covering: wideband audio compression (MP3 audio etc.), speech recognition and speech coding.

The final chapter covers musical synthesis and applications describing methods such as (and giving MATLAB examples of) AM, FM and ring modulation techniques. This chapter gives a final example of the use of time-frequency modification to implement a so-called phase vocoder for time stretching (in MATLAB).

Features



  • A comprehensive overview of contemporary speech and audio processing techniques from perceptual and physical acoustic models to a thorough background in relevant digital signal processing techniques together with an exploration of speech and audio applications.




  • A carefully paced progression of complexity of the described methods; building, in many cases, from first principles.




  • Speech and wideband audio coding together with a description of associated standardised codecs (e.g. MP3, AAC and GSM).




  • Speech recognition: Feature extraction (e.g. MFCC features), Hidden Markov Models (HMMs) and deep learning techniques such as Long Short-Time Memory (LSTM) methods.




  • Book and computer-based problems at the end of each chapter.




  • Contains numerous real-world examples backed up by many MATLAB functions and code.


EAN 9780367656317
ISBN 0367656310
Binding Paperback / softback
Publisher Taylor & Francis Ltd
Publication date September 30, 2020
Pages 330
Language English
Dimensions 234 x 156
Country United Kingdom
Authors Hill, Paul