no-sensitive to phase distortion, phase signal of noise is implemented when phase is restored [5][6]。 In the frequency domain, equation (5) is expressed as

5。2。 Proposed VAD based on improved    spectral

transforms   (DFT)   of  y(i) , s(i)  and d (i)  , respectively。

Because s(i) and d (i) are independent and Nk is gauss distribution, equation (6) is expressed as (7) in frequency domain。

On the basis of section above, we can firstly apply spectral subtraction for noisy sound of CVR to reducing noise and enhance speech, and then enhanced signal is filtered by a preceding filter, finally cockpit voice is extracted by means of double thresholds VAD。 Figure    2

shows the flow chart of proposed VAD。 The preceding



filter is a high-pass filter, such as10。9375z1 ,   which

can filter low-frequency interference, especially

For a frame of speech signal, we

interference of frequency 50Hz or 60Hz, and advance spectrum of  high  frequency which  is  useful  for cockpit

For a wonderful VAD, two requirements must be taken into considered comprehensively: to detect  more  speech

where n (k ) is  statistical  mean  of  unvoiced speech,

2

Sk   is amplitude of enhanced speech。

However, basic SS can generate much musical noises in residual noises。 Some modified SS are proposed to reduce effect。 Weighting factorand power coefficient 

sections and more unvoiced speech sections。 However, when VAD tries to detect more speech frames, it misjudges silence as speech or otherwise。 The latter is ever worse than the former for accident investigation。 Therefore, two evaluation standards are compared to weighing  quantificationally  the  performance  of   VAD:

are introduced into SS, so equation (8) is modified as

probability  of  correctly  detecting  speech  frame Pcs

 

probability of correctly detecting noise frame Pcn  ,   which are expressed as

Modified SS is degraded  to  basic  SS  when =2 and =1[5]。  Other  modified  SS  is  showed  in    relative

references [6][7]。 Better enhancement performance can be gained by adjusting two parameters suitably, but voice

where  N handand  N handare   relatively   the  overall

distortion becomes severer as the degree of noise reduction is larger。

5。Proposed VAD based on improved spectral subtraction

5。1。Improved spectral subtraction

In this paper, we propose iterative spectral subtraction to formerly reducing noise and enhancing speech。 This method uses basic SS or modified SS for appropriate times。 The former enhanced speech becomes latter input signal, so music noise is seen as input noise to reduce again。

number of hand-labeling speech frames and noise  frames

by hand-labeling, N1 and N0 are relatively number of being detected correctly by VAD。

6。2。 Experiment results

In this paper, a section of speech in car (SNR =8) from standard voice bank Aurora2 and a section of true cockpit sound are used, simulation experiments based on traditional double thresholds VAD only and the proposed VAD are carried out。 Figure 3 and table 2 compare the performance of various methods in different environment。 Due to former spectral subtraction, the SNR increases, the curves of STE and ZCR become smoother and proper probability increases。

上一篇:可重构机床设计英文文献和中文翻译
下一篇:模糊TOPSIS方法对初级破碎机英文文献和中文翻译

酵母菌发酵生产天然香料...

从政策角度谈黑龙江對俄...

基于Joomla平台的计算机学院网站设计与开发

浅论职工思想政治工作茬...

上海居民的社会参与研究

AES算法GPU协处理下分组加...

浅谈高校行政管理人员的...

压疮高危人群的标准化中...

STC89C52单片机NRF24L01的无线病房呼叫系统设计

提高教育质量,构建大學生...