本论文研究基于特定人的孤立词汇语音识别系统。主要工作是完成对语音信号的预处理(包括预加重、分帧、加窗、端点检测等),特征参数的提取、模板匹配,最终寻找最匹配项,输出匹配结果。
论文首先介绍了语音识别的基本原理、语音识别系统结构,然后详细讨论了MFCC参数的提取以及DTW(动态时间规整)算法等,最后针对特定人的孤立词汇,进行了识别实验,得到实验结果。DTW算法对硬件环境要求低,计算速度快,十分适合语音库较小情况下特定人孤立词汇的语音识别。
该识别系统采用Matlab 2010b作为开发工具,使用voicebox作为开发工具包,实现对声音文件的各种操作。并且,本系统采用了GUI(图形界面),使用更加直观。6635
关键词 特定人 孤立词汇 语音识别 MFCC DTW
毕业设计说明书(论文)外文摘要
Title A Speaker-independent and Isolated-word Speech Recognition
Abstract
In this paper, we studied the speaker-independent and isolated-word speech recognition system. This system completed the task of pre-processing of speech signals (including pre-emphasis, frame blocking, windowing and end point detecting), extracting of the parameter, template matching and finally, finding out the match and outputting the results. After this process, we conduct the statistical results in order to illustrate the performance of this system.
This paper introduces the principles of speech recognition, the construction of speech recognition system in the first place. Then, this paper discussed MFCC extraction and DTW(dynamic time warping) algorithm, etc. in detail. Finally, aiming at speaker-independent isolated words, this paper conducted a speech recognition experiment and received the results. DTW has a reputation of low hardware requirements and high computing speed, which makes it fit for speaker-independent and isolated-word speech recognition with a relatively small library.
This recognition system uses Matlab 2010b and a toolbox named voicebox as an SDK to realize voice data management. Additionally, this system has a GUI(graphical user interface) to ensure a more intuitive use.
Keywords Speaker-independent Isolated-words Speech Recognition MFCC DTW
目 录