tailieunhanh - Báo cáo hóa học: " Correlation analysis of the speech multiscale product for the open quotient estimation"

Tuyển tập các báo cáo nghiên cứu về hóa học được đăng trên tạp chí hóa hoc quốc tế đề tài : Correlation analysis of the speech multiscale product for the open quotient estimation | Saidi et al. EURASIP Journal on Audio Speech and Music Processing 2011 2011 8 http content 2011 1 8 D EURASIP Journal on Audio Speech and Music Processing a SpringerOpen Journal RESEARCH Open Access Correlation analysis of the speech multiscale product for the open quotient estimation Wafa Saidi Aicha Bouzid and Noureddine Ellouze Abstract This article proposes a multiscale product MP -based method for estimating the open quotient OQ from the speech waveform. The MP is operated by calculating the wavelet transform coefficients of the speech signal at three scales and then multiplying them. The resulting MP signal presents negative peaks informing about the glottis closure and positive ones informing about the glottis opening. Taking into account the shape of the speech MP close to the derivative of electroglottographic EGG signal we proceed to a correlation analysis for the fundamental frequency and OQ measurement. The approach validation is done on voiced parts of the Keele University database by calculating the absolute and relative errors between the OQ estimated from the speech and the corresponding EGG signals. When considering the mean OQ over each voiced segments results of our test show that OQ is estimated within an absolute error from to and a relative error from 8 to 21 for all the speakers. The approach is not so performant when the evaluation concerns the OQ frame-by-frame measurements. The absolute error reaches and the relative error 30 . Keywords speech open quotient multiscale product crosscorrelation 1. Introduction According to the source-filter theory of the speech production 1 voiced speech is represented as the response of the vocal tract filter to the glottal voice source. The glottal source consists of quasi-periodic pulses which are created by the vocal folds oscillations. It is characterised by two crucial moments the glottal closure GCI and opening instants GOI . GCIs and GOIs are required to be .

TÀI LIỆU LIÊN QUAN