Title | Building chinese sense annotated corpus with the help of software tools |
Authors | Wu, Yunfang Jin, Peng Guo, Tao Yu, Shiwen |
Affiliation | School of Electronic Engineering and Computer Science, Peking University, Beijing 100871, China |
Issue Date | 2007 |
Citation | Linguistic Annotation Workshop, LAW 2007.Prague, Czech republic. |
Abstract | This paper presents the building procedure of a Chinese sense annotated corpus. A set of software tools is designed to help human annotator to accelerate the annotation speed and keep the consistency. The software tools include 1) a tagger for word segmentation and POS tagging, 2) an annotating interface responsible for the sense describing in the lexicon and sense annotating in the corpus, 3) a checker for consistency keeping, 4) a transformer responsible for the transforming from text file to XML format, and 5) a counter for sense frequency distribution calculating. ? 2007 Association for Computational Linguistics. |
URI | http://hdl.handle.net/20.500.11897/411292 |
Indexed | EI |
Appears in Collections: | 信息科学技术学院 |