tailieunhanh - Báo cáo toán học: " A novel voice activity detection based on phoneme recognition using statistical model"

Tuyển tập các báo cáo nghiên cứu khoa học ngành toán học được đăng trên tạp chí toán học quốc tế đề tài: A novel voice activity detection based on phoneme recognition using statistical model | EURASIP Journal on Audio Speech and Music Processing SpringerOpen0 This Provisional PDF corresponds to the article as it appeared upon acceptance. Fully formatted PDF and full text HTML versions will be made available soon. A novel voice activity detection based on phoneme recognition using statistical model EURASIP Journal on Audio Speech and Music Processing 2012 2012 1 doi 1687-4722-2012-1 Xulei Bao qunzhong@ Jie Zhu zhujie@ ISSN 1687-4722 Article type Research Submission date 19 September 2011 Acceptance date 9 January 2012 Publication date 9 January 2012 Article URL http content 2012 1 1 This peer-reviewed article was published immediately upon acceptance. It can be downloaded printed and distributed freely for any purposes see copyright notice below . For information about publishing your research in EURASIP ASMP go to http authors instructions For information about other SpringerOpen publications go to http 2012 Bao and Zhu licensee Springer. This is an open access article distributed under the terms of the Creative Commons Attribution License http licenses by which permits unrestricted use distribution and reproduction in any medium provided the original work is properly cited. A novel voice activity detection based on phoneme recognition using statistical model Xulei Bao and Jie Zhu Department of Electronic Engineering Shanghai Jiao Tong University Shanghai 200240 China Corresponding author qunzhong@ Email address JZ zhujie@ Email Corresponding author Abstract In this article a novel voice activity detection VAD approach based on phoneme recognition using Gaussian Mixture Model based Hidden Markov Model HMM GMM is proposed. Some sophisticated speech features such as high order statistics HOS harmonic structure information and Mel-frequency cepstral coefficients MFCCs are employed to represent each speech .

TÀI LIỆU LIÊN QUAN
TỪ KHÓA LIÊN QUAN
TÀI LIỆU MỚI ĐĂNG
13    150    1    26-11-2024
7    106    0    26-11-2024