Title | Word clustering for collocation-based word sense disambiguation |
Authors | Jin, Peng Sun, Xu Wu, Yunfang Yu, Shiwen |
Affiliation | Department of Computer Science and Technology, Institute of Computational Linguistics, Peking University, 100871, Beijing, China |
Issue Date | 2007 |
Citation | 8th Annual Conference on Intelligent Text Processing and Computational Linguistics, CICLing 2007.Mexico City, Mexico,4394 LNCS(267-274). |
Abstract | The main disadvantage of collocation-based word sense disambiguation is that the recall is low, with relatively high precision. How to improve the recall without decrease the precision? In this paper, we investigate a word-class approach to extend the collocation list which is constructed from the manually sense-tagged corpus. But the word classes are obtained from a larger scale corpus which is not sense tagged. The experiment results have shown that the F-measure is improved to 71% compared to 54% of the baseline system where the word-class is not considered, although the precision decreases slightly. Further study discovers the relationship between the F-measure and the number of word-class trained from the various sizes of corpus. ? Springer-Verlag Berlin Heidelberg 2007. |
URI | http://hdl.handle.net/20.500.11897/327932 |
Indexed | EI |
Appears in Collections: | 信息科学技术学院 计算语言学教育部重点实验室 |