tailieunhanh - Báo cáo khoa học: "Topic-focus and salience*"

Most of the current work on corpus annotation is concentrated on morphemics, lexical semantics and sentence structure. However, it becomes more and more obvious that attention should and can be also paid to phenomena that reflect the links between a sentence and its context, . the discourse anchoring of utterances. If conceived in this way, an annotated corpus can be used as a resource for linguistic research not only within the limits of the sentence, but also with regard to discourse patterns. . | m í 1 1 Topic-focus and salience Eva Hajicova Faculty of Mathematics and Physics Charles University Malostranské nám. 25 118 00 Praha Czech Republic hajicova@ Petr Sgall Faculty of Mathematics and Physics Charles University Malostranské nám. 25 118 00 Praha Czech Republic sgall@ 1 Objectives and Motivation Most of the current work on corpus annotation is concentrated on morphemics lexical semantics and sentence structure. However it becomes more and more obvious that attention should and can be also paid to phenomena that reflect the links between a sentence and its context . the discourse anchoring of utterances. If conceived in this way an annotated corpus can be used as a resource for linguistic research not only within the limits of the sentence but also with regard to discourse patterns. Thus the applications of the research to issues of information retrieval and extraction may be made more effective also applications in new domains become feasible be it to serve for inner linguistic and literary aims such as text segmentation specification of topics of parts of a discourse or for other disciplines. These considerations have been a motivation for the tectogrammatical . underlying see below tagging done within the Prague Dependency Treebank PDT to contain also attributes concerning certain contextual features . the contextual anchoring of word tokens and their relationships to their coreferential antecedents. Along with this enrichment in the intersentential aspect we do not neglect to pay attention to intrasentential issues . to sentence structure which displays its own features oriented towards the contextual potential of the sentence namely its topic-focus articulation TFA . In the present paper we give first an outline of the annotation scenario of the PDT Section 2 concentrating then on the use of one of the PDT attributes for the specification of the Topic and the Focus the information structure of the sentence

TÀI LIỆU LIÊN QUAN