tailieunhanh - Báo cáo khoa học: "Why Press Backspace? Understanding User Input Behaviors in Chinese Pinyin Input Method"

Chinese Pinyin input method is very important for Chinese language information processing. Users may make errors when they are typing in Chinese words. In this paper, we are concerned with the reasons that cause the errors. Inspired by the observation that pressing backspace is one of the most common user behaviors to modify the errors, we collect 54, 309, 334 error-correction pairs from a realworld data set that contains 2, 277, 786 users via backspace operations. In addition, we present a comparative analysis of the data to achieve a better understanding of users’ input behaviors. . | Why Press Backspace Understanding User Input Behaviors in Chinese Pinyin Input Method Yabin Zheng1 Lixing Xie1 Zhiyuan Liu1 Maosong Sun1 Yang Zhang2 Liyun Ru1 2 1State Key Laboratory of Intelligent Technology and Systems Tsinghua National Laboratory for Information Science and Technology Department of Computer Science and Technology Tsinghua University Beijing 100084 China 2Sogou Inc. Beijing 100084 China lavender087 sunmaosong @ zhangyang ruliyun @ Abstract Chinese Pinyin input method is very important for Chinese language information processing. Users may make errors when they are typing in Chinese words. In this paper we are concerned with the reasons that cause the errors. Inspired by the observation that pressing backspace is one of the most common user behaviors to modify the errors we collect 54 309 334 error-correction pairs from a real-world data set that contains 2 277 786 users via backspace operations. In addition we present a comparative analysis of the data to achieve a better understanding of users input behaviors. Comparisons with English typos suggest that some language-specific properties result in a part of Chinese input errors. 1 Introduction Unlike western languages Chinese is unique due to its logographic writing system. Chinese users cannot directly type in Chinese words using a QWERTY keyboard. Pinyin is the official system to transcribe Chinese characters into the Latin alphabet. Based on this transcription system Pinyin input methods have been proposed to assist users to type in Chinese words Chen 1997 . The typical way to type in Chinese words is in a sequential manner Wang et al. 2001 . Assume users want to type in the Chinese word Ấ what . First they mentally generate and type in corresponding Pinyin shenme . Then a Chinese Pinyin input method displays a list of Chinese words which share that Pinyin as shown in Fig. 1. Users Figure 1 Typical Chinese Pinyin input method for a correct Pinyin .

TỪ KHÓA LIÊN QUAN