tailieunhanh - Hindawi Publishing Corporation EURASIP Journal on Audio, Speech, and Music Processing Volume 2010,
Hindawi Publishing Corporation EURASIP Journal on Audio, Speech, and Music Processing Volume 2010, Article ID 252374, 13 pages doi: Research Article Monaural Voiced Speech Segregation Based on Dynamic Harmonic Function Xueliang Zhang,1, 2 Wenju Liu,1 and Bo Xu1 1 2 Computer National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China Science Department, Inner Mongolia University, Huhhot 010021, China Correspondence should be addressed to Wenju Liu, lwj@ Received 17 September 2010; Accepted 2 December 2010 Academic Editor: DeLiang Wang Copyright © 2010 Xueliang Zhang et al. This is an open access article distributed under the Creative Commons Attribution License,. | Hindawi Publishing Corporation EURASIP Journal on Audio Speech and Music Processing Volume 2010 Article ID 252374 13 pages doi 2010 252374 Research Article Monaural Voiced Speech Segregation Based on Dynamic Harmonic Function Xueliang Zhang 1 2 Wenju Liu 1 and Bo Xu1 1 National Laboratory of Pattern Recognition NLPR Institute of Automation Chinese Academy of Sciences Beijing 100190 China 2 Computer Science Department Inner Mongolia University Huhhot 010021 China Correspondence should be addressed to Wenju Liu lwj@ Received 17 September 2010 Accepted 2 December 2010 Academic Editor DeLiang Wang Copyright 2010 Xueliang Zhang et al. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use distribution and reproduction in any medium provided the original work is properly cited. Correlogram is an important representation for periodic signals. It is widely used in pitch estimation and source separation. For these applications major problems of correlogram are its low resolution and redundant information. This paper proposes a voiced speech segregation system based on a newly introduced concept called dynamic harmonic function DHF . In the proposed system conventional correlograms are further processed by replacing the autocorrelation function ACF with DHF. The advantages of DHF are 1 peak s width is adjustable by controlling the variance of the Gaussian function and 2 the invalid peaks of ACF not at the pitch period tend to be suppressed. Based on DHF pitch detection and effective source segregation algorithms are proposed. Our system is systematically evaluated and compared with the correlogram-based system. Both the signal-to-noise ratio results and the perceptual evaluation of speech quality scores show that the proposed system yields substantially better performance. 1. Introduction In realistic environment speech is often corrupted by acoustic interference. Meanwhile many .
đang nạp các trang xem trước