Đang chuẩn bị liên kết để tải về tài liệu:
Automatic identification of Vietnamese dialects

Ngọc Yến 80 12 pdf

Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ Tải xuống

The experiment result for the dialect corpus of Vietnamese shows that the performance of dialectal identification with baseline increases from 58.6% for the case using only MFCC coefficients to 70.8% for the case using MFCC coefficients and the information of fundamental frequency. By combining the formants and their bandwidths with the normalized F0 according to average and standard deviation F0, the best recognition rate is 72.2%. | Journal of Computer Science and Cybernetics, V.32, N.1 (2016), 18–29 DOI: 10.15625/1813-9663/32/1/7905 AUTOMATIC IDENTIFICATION OF VIETNAMESE DIALECTS PHAM NGOC HUNG1,2 , TRINH VAN LOAN1,2 , NGUYEN HONG QUANG2 1 Faculty of Information Technology, Hung Yen University of Technology and Education, of Information and Communication Technology, Hanoi University of Science and Technology 1,2 pnhung@utehy.edu.vn; 1,2 loantv@soict.hust.edu.vn; 2 quangnh@soict.hust.edu.vn 2 School Abstract. The dialect identification has been under study for many languages over the world nevertheless the research on signal processing for Vietnamese dialects is still limited and there are not many published works. There are many different dialects for Vietnamese. The influence of dialectal features on speech recognition systems is important. If the information about dialects is known during speech recognition process, the performance of recognition systems will be better because the corpus of these systems is normally organized according to different dialects. In our experiments, MFCC coefficients, formants, correspondent bandwidths and the fundamental frequency with its variants are input parameters for GMM. The experiment result for the dialect corpus of Vietnamese shows that the performance of dialectal identification with baseline increases from 58.6% for the case using only MFCC coefficients to 70.8% for the case using MFCC coefficients and the information of fundamental frequency. By combining the formants and their bandwidths with the normalized F 0 according to average and standard deviation F 0, the best recognition rate is 72.2%. Keywords. Fundamental frequency, MFCC, Formant, Bandwidth, GMM, Vietnamese dialects, identification. 1. INTRODUCTION Vietnamese is a tonal language with many different dialects. It is the diversity of Vietnamese dialects that remains a great challenge to the systems of Vietnamese recognition. In other words, the pronunciation modality of the word .

TÀI LIỆU LIÊN QUAN

Some new results on automatic identification of Vietnamese folk Songs Cheo and Quanho

An efficient method for automatic recognizing text fields on identification card

MRI-based automatic identifcation and segmentation of extrahepatic cholangiocarcinoma using deep learning network

Báo cáo khoa học: "Automatic Identification of Pro and Con Reasons in Online Reviews"

Báo cáo khoa học: "Examining the Role of Linguistic Knowledge Sources in the Automatic Identification and Classification of Reviews"

Báo cáo khoa học: "Implementing a Characterization of Genre for Automatic Genre Identification of Web Pages"

Báo cáo khoa học: "Automatic Identification of Non-compositional Phrases"

Báo cáo khoa học: "Automatic Identification of Word Translations from Unrelated English and German Corpora"

Báo cáo khoa học: "Knowledge-based Automatic Topic Identification"

Báo cáo khoa học: "TOWARDS THE AUTOMATIC IDENTIFICATION OF ADJECTIVAL SCALES: CLUSTERING ADJECTIVES ACCORDING TO MEANING"