tailieunhanh - Báo cáo khoa học: "Resume Information Extraction with Cascaded Hybrid Model"

This paper presents an effective approach for resume information extraction to support automatic resume management and routing. A cascaded information extraction (IE) framework is designed. In the first pass, a resume is segmented into a consecutive blocks attached with labels indicating the information types. Then in the second pass, the detailed information, such as Name and Address, are identified in certain blocks (. blocks labelled with Personal Information), instead of searching globally in the entire resume. . | Resume Information Extraction with Cascaded Hybrid Model Kun Yu Department of Computer Science and Technology University of Science and Technology of China Hefei Anhui China 230027 yukun@ Gang Guan Department of Electronic Engineering Tsinghua University Bejing China 100084 guangang@ Ming Zhou Microsoft Research Asia 5F Sigma Center Zhichun Road Haidian Bejing China 100080 mingzhou@ Abstract This paper presents an effective approach for resume information extraction to support automatic resume management and routing. A cascaded information extraction IE framework is designed. In the first pass a resume is segmented into a consecutive blocks attached with labels indicating the information types. Then in the second pass the detailed information such as Name and Address are identified in certain blocks . blocks labelled with Personal Information instead of searching globally in the entire resume. The most appropriate model is selected through experiments for each IE task in different passes. The experimental results show that this cascaded hybrid model achieves better F-score than flat models that do not apply the hierarchical structure of resumes. It also shows that applying different IE models in different passes according to the contextual structure is effective. 1 Introduction Big enterprises and head-hunters receive hundreds of resumes from job applicants every day. Automatically extracting structured information from resumes of different styles and formats is needed to support the automatic construction of database searching and resume routing. The definition of resume information fields varies in different applications. Normally resume information is described as a hierarchical structure The research was carried out in Microsoft Research Asia. with two layers. The first layer is composed of consecutive general information blocks such as Personal Information Education etc. Then within each general information

crossorigin="anonymous">
Đã phát hiện trình chặn quảng cáo AdBlock
Trang web này phụ thuộc vào doanh thu từ số lần hiển thị quảng cáo để tồn tại. Vui lòng tắt trình chặn quảng cáo của bạn hoặc tạm dừng tính năng chặn quảng cáo cho trang web này.