tailieunhanh - Báo cáo khoa học: "PROMNECMAIO OF CMOETAL GORITHMS EFRACOPRSNOPNNFR THE PHONEMI CIZATION OF OTORPYORHGAH"

A system for converting English text into synthetic speech can be divided into two processes that operate in series: I) a text-to-phoneme converter, and 2) a phonemic-input speech synthesizer. The conversion of orthographic text into a phonemic form may itself comprise several processes i n series, for instance, formatting t e x t t o expand a b b r e v i a t i o n s and n o n - a l p h a b e t i c e x p r e s s i o n s , p a r. | PERFORMANCE COMPARISON OF COMPONENT ALGORITHMS FOR THE PHONEMICIZATION OF ORTHOGRAPHY Jared Bernstein Larry Nessly Telesensory Speech Systems University of North Carolina Palo Alto CA 94304 chapel Hill NC 27514 A system for converting English text into synthetic speech can be divided into two processes that operate in series 1 a text-to-phoneme converter and 2 a phonemic-input speech synthesizer. The conversion of orthographic text into a phonemic form may itself comprise several processes in series for instance formatting text to expand abbreviations and non-alphabetic expressions parsing and word class determination segmental phonemicization of words word and clause level stress assignment word internal and word boundary allophonic adjustments and duration and fundamental frequency settings for phonological units. Comparing the accuracy of different algorithms for text-to-phoneme conversion is often difficult because authors measure and report system performance Ln incommensurable ways. Furthermore comparison of the output speech from two complete systems may not always provide a good test of the performance of the corresponding component algorithms in the two systems because radical performance differences in other components can obscure small differences in the components of interest. The only reported direct comparison of two complete text-to-speech systems MITALK and TSI s TTS-X was conducted by Bernstein and Pisoni 1980 . This paper reports one study that compared two algorithms for automatic segmental phonemlcization of words and a second study that compared two algorithms for automatic assignment of lexical stress. Only three systems for text-to-phoneme conversion have been reported in detail McIlroy s 1974 Votrax driver Hunnicutt s 1976 rules for the MITALK system and the NRL rules developed by Elovitz and associates 1976 . Liberman 1979 Hertz 1981 and Hunnicutt 1980 have described more recent systems but have not published rule sets. One fairly standard

TỪ KHÓA LIÊN QUAN