Title | Constructing chinese abbreviation dictionary: A stacked approach |
Authors | Zhang, Longkai Li, Sujian Wang, Houfeng Sun, Ni Meng, Xinfan |
Affiliation | Key Laboratory of Computational Linguistics, Ministry of Education, Peking University, China |
Issue Date | 2012 |
Citation | 24th International Conference on Computational Linguistics, COLING 2012.Mumbai, India. |
Abstract | Abbreviation is a common linguistic phenomenon with wide popularity and high rate of growth. Correctly linking full forms to their abbreviations will be helpful in many applications. For example, it can improve the recall of information retrieval systems. An intuition to solve this is to build an abbreviation dictionary in advance. This paper investigates an automatic abbreviation generation method, which uses a stacked approach for Chinese abbreviation generation. We tackle this problem in two stages. First we use a sequence labeling method to generate a list of candidate abbreviations. Then, we try to use search engine to incorporate web data to re-rank the candidates, and finally get the best candidate. We use a Chinese abbreviation corpus which contains 8015 abbreviation pairs to evaluate the performance. Experiments revealed that our method gave better performance than the baseline methods. ? 2012 The COLING. |
URI | http://hdl.handle.net/20.500.11897/411860 |
Indexed | EI |
Appears in Collections: | 计算语言学教育部重点实验室 |