tailieunhanh - Báo cáo hóa học: " Research Article Voice-to-Phoneme Conversion Algorithms for Voice-Tag Applications in Embedded Platforms"

Tuyển tập báo cáo các nghiên cứu khoa học quốc tế ngành hóa học dành cho các bạn yêu hóa học tham khảo đề tài: Research Article Voice-to-Phoneme Conversion Algorithms for Voice-Tag Applications in Embedded Platforms | Hindawi Publishing Corporation EURASIP Journal on Audio Speech and Music Processing Volume 2008 Article ID 568737 8 pages doi 2008 568737 Research Article Voice-to-Phoneme Conversion Algorithms for Voice-Tag Applications in Embedded Platforms Yan Ming Cheng Changxue Ma and Lynette Melnar Human Interaction Research Motorola Labs 1925 Algonquin Road Schaumburg IL 60196 USA Correspondence should be addressed to Lynette Melnar melnar@ Received 28 November 2006 Revised 15 July 2007 Accepted 26 September 2007 Recommended by Joe Picone We describe two voice-to-phoneme conversion algorithms for speaker-independent voice-tag creation specifically targeted at applications on embedded platforms. These algorithms batch mode and sequential are compared in speech recognition experiments where they are first applied in a same-language context in which both acoustic model training and voice-tag creation and application are performed on the same language. Then their performance is tested in a cross-language setting where the acoustic models are trained on a particular source language while the voice-tags are created and applied on a different target language. In the same-language environment both algorithms either perform comparably to or significantly better than the baseline where utterances are manually transcribed by a phonetician. In the cross-language context the voice-tag performances vary depending on the source-target language pair with the variation reflecting predicted phonological similarity between the source and target languages. Among the most similar languages performance nears that of the native-trained models and surpasses the native reference baseline. Copyright 2008 Yan Ming Cheng et al. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use distribution and reproduction in any medium provided the original work is properly cited. 1. INTRODUCTION A voice-tag or name-tag .

TÀI LIỆU LIÊN QUAN