tailieunhanh - báo cáo hóa học: " Audio segmentation of broadcast news in the Albayzin-2010 evaluation: overview, results, and discussion"

Tuyển tập báo cáo các nghiên cứu khoa học quốc tế ngành hóa học dành cho các bạn yêu hóa học tham khảo đề tài: Audio segmentation of broadcast news in the Albayzin-2010 evaluation: overview, results, and discussion | Butko and Nadeu EURASIP Journal on Audio Speech and Music Processing 2011 2011 1 http content 2011 1 1 D EURASIP Journal on Audio Speech and Music Processing a SpringerOpen Journal RESEARCH Open Access Audio segmentation of broadcast news in the Albayzin-2010 evaluation overview results and discussion Taras Butko and Climent Nadeu Abstract Recently audio segmentation has attracted research interest because of its usefulness in several applications like audio indexing and retrieval subtitling monitoring of acoustic scenes etc. Moreover a previous audio segmentation stage may be useful to improve the robustness of speech technologies like automatic speech recognition and speaker diarization. In this article we present the evaluation of broadcast news audio segmentation systems carried out in the context of the Albayzín-2010 evaluation campaign. That evaluation consisted of segmenting audio from the 3 24 Catalan TV channel into five acoustic classes music speech speech over music speech over noise and the other. The evaluation results displayed the difficulty of this segmentation task. In this article after presenting the database and metric as well as the feature extraction methods and segmentation techniques used by the submitted systems the experimental results are analyzed and compared with the aim of gaining an insight into the proposed solutions and looking for directions which are promising. Keywords Audio segmentation Broadcast news International evaluation Introduction The recent fast growth of available audio or audiovisual content strongly demands tools for analyzing indexing searching and retrieving the available documents. Given an audio document the necessary first processing step is audio segmentation which consists of partitioning the input audio stream into acoustically homogeneous regions and label them according to a predefined broad set of classes like speech music noise etc. The research studies on audio segmentation .

TÀI LIỆU LIÊN QUAN