tailieunhanh - Báo cáo khoa học: "ModelTalker Voice Recorder – An Interface System for Recording a Corpus of Speech for Synthesis"

We will demonstrate the ModelTalker Voice Recorder (MT Voice Recorder) – an interface system that lets individuals record and bank a speech database for the creation of a synthetic voice. The system guides users through an automatic calibration process that sets pitch, amplitude, and silence. The system then prompts users with both visual (text-based) and auditory prompts. Each recording is screened for pitch, amplitude and pronunciation and users are given immediate feedback on the acceptability of each recording. . | ModelTalker Voice Recorder - An Interface System for Recording a Corpus of Speech for Synthesis Debra Yarrington John Gray Chris Pennington AgoraNet Inc. Newark DE 19711 USA yarringt gray penningt @ H. Timothy Bunnell Allegra Cornaglia Jason Lilley Kyoko Nagao James Polikoff Speech Research Laboratory . DuPont Hospital for Children Wilmington DE 19803 USA bunnell cornagli lilley nagao polikoff @ Abstract We will demonstrate the ModelTalker Voice Recorder MT Voice Recorder - an interface system that lets individuals record and bank a speech database for the creation of a synthetic voice. The system guides users through an automatic calibration process that sets pitch amplitude and silence. The system then prompts users with both visual text-based and auditory prompts. Each recording is screened for pitch amplitude and pronunciation and users are given immediate feedback on the acceptability of each recording. Users can then rerecord an unacceptable utterance. Recordings are automatically labeled and saved and a speech database is created from these recordings. The system s intention is to make the process of recording a corpus of utterances relatively easy for those inexperienced in linguistic analysis. Ultimately the recorded corpus and the resulting speech database is used for concatenative synthetic speech thus allowing individuals at home or in clinics to create a synthetic voice in their own voice. The interface may prove useful for other purposes as well. The system facilitates the recording and labeling of large corpora of speech making it useful for speech and linguistic research and it provides immediate feedback on pronunciation thus making it useful as a clinical learning tool. 1 Demonstration MT Voice Recorder Background While most of us are familiar with the highly intelligible but somewhat robotic sound of synthetic speech for the approximately 2 million people in the United States with a limited ability to communicate