Title | RelationNet: Learning Deep-Aligned Representation for Semantic Image Segmentation |
Authors | Zhuang, Yueqing Tao, Li Yang, Fan Ma, Cong Zhang, Ziwei Jia, Huizhu Xie, Xiaodong |
Affiliation | Peking Univ, Dept EECS, Natl Engn Lab Video Technol, 5 Yiheyuan Rd, Beijing 100871, Peoples R China. Peking Univ, Beijing, Peoples R China. Cooperat Medianet Innovat Ctr, Beijing, Peoples R China. Beida Informat Res, Beijing, Peoples R China. Peking Univ, Dept EECS, Natl Engn Lab Video Technol, 5 Yiheyuan Rd, Beijing 100871, Peoples R China. Jia, HZ (reprint author), Peking Univ, Beijing, Peoples R China. |
Issue Date | 2018 |
Publisher | 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) |
Citation | 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR). 2018, 1506-1511. |
Abstract | Semantic image segmentation, which assigns labels in pixel level, plays a central role in image understanding. Recent approaches have attempted to harness the capabilities of deep learning. However, one central problem of these methods is that deep convolutional neural network gives little consideration to the correlation among pixels. To handle this issue, in this paper, we propose a novel deep neural network named RelationNet, which utilizes CNN and RNN to aggregate context information. Besides, a spatial correlation loss is applied to train RelationNet to align features of spatial pixels belonging to same category. Importantly, since it is expensive to obtain pixel-wise annotations, we exploit a new training method to combine the coarsely and finely labeled data. Experiments show the detailed improvements of each proposal. Experimental results demonstrate the effectiveness of our proposed method to the problem of semantic image segmentation, which obtains state-of-the-art performance on the Cityscapes benchmark and Pascal Context dataset. |
URI | http://hdl.handle.net/20.500.11897/571612 |
ISSN | 1051-4651 |
Indexed | CPCI-S(ISTP) |
Appears in Collections: | 信息科学技术学院 |