Title | Word clustering for collocation-based word sense disambiguation |
Authors | Jin, Peng Sun, Xu Wu, Yunfang Yu, Shiwen |
Affiliation | Peking Univ, Inst Computat Linguist, Dept Comp Sci & Technol, Beijing 100871, Peoples R China. |
Issue Date | 2007 |
Citation | Computational Linguistics and Intelligent Text Processing.4394(267-274). |
Abstract | The main disadvantage of collocation-based word sense disambiguation is that the recall is low, with relatively high precision. How to improve the recall without decrease the precision? In this paper, we investigate a word-class approach to extend the collocation list which is constructed from the manually sense-tagged corpus. But the word classes are obtained from a larger scale corpus which is not sense tagged. The experiment results have shown that the F-measure is improved to 71% compared to 54% of the baseline system where the word-class is not considered, although the precision decreases slightly. Further study discovers the relationship between the F-measure and the number of word-class trained from the various sizes of corpus. |
URI | http://hdl.handle.net/20.500.11897/406465 |
ISSN | 0302-9743 |
Indexed | CPCI-S(ISTP) CPCI-SSH(ISSHP) |
Appears in Collections: | 信息科学技术学院 计算语言学教育部重点实验室 |