Title | Towards easier and faster sequence labeling for natural language processing: A search-based probabilistic online learning framework (SAPO) |
Authors | Sun, Xu Ma, Shuming Zhang, Yi Ren, Xuancheng |
Affiliation | Peking Univ, Sch Elect Engn & Comp Sci, Beijing, Peoples R China Peking Univ, MOE Key Lab Computat Linguist, Beijing, Peoples R China Peking Univ, Beijing Inst Big Data Res, Ctr Data Sci, Beijing, Peoples R China |
Keywords | Natural language processing Sequence labeling Search-based learning Convergence |
Issue Date | 2019 |
Publisher | INFORMATION SCIENCES |
Abstract | There are two major approaches for sequence labeling. One is the probabilistic gradient-based methods such as conditional random fields (CRF) and neural networks (e.g., RNN), which have high accuracy but drawbacks: slow training, and no support of search-based optimization (which is important in many cases). The other is the search-based learning methods such as structured perceptron and margin infused relaxed algorithm (MIRA), which have fast training but also drawbacks: low accuracy, no probabilistic information, and non-convergence in real-world tasks. We propose a novel and "easy" solution, a search-based probabilistic online learning method, to address most of those issues. The method is "easy", because the optimization algorithm at the training stage is as simple as the decoding algorithm at the test stage. This method searches the output candidates, derives probabilities, and conducts efficient online learning. We show that this method with fast training and theoretical guarantee of convergence, which is easy to implement, can support search-based optimization and obtain top accuracy. Experiments on well-known tasks show that our method has better accuracy than CRF and BiLSTM.(1) (C) 2018 Elsevier Inc. All rights reserved. |
URI | http://hdl.handle.net/20.500.11897/549708 |
ISSN | 0020-0255 |
DOI | 10.1016/j.ins.2018.11.025 |
Indexed | SCI(E) EI |
Appears in Collections: | 信息科学技术学院 计算语言学教育部重点实验室 |