Title | Scene classification using multi-scale deeply described visual words |
Authors | Zhao, Wenzhi Du, Shihong |
Affiliation | Peking Univ, Inst Remote Sensing & GIS, Beijing 100871, Peoples R China. |
Keywords | CONVOLUTIONAL NEURAL-NETWORKS REMOTE-SENSING IMAGERY URBAN AREAS FEATURES MODEL |
Issue Date | 2016 |
Publisher | INTERNATIONAL JOURNAL OF REMOTE SENSING |
Citation | INTERNATIONAL JOURNAL OF REMOTE SENSING.2016,37(17),4119-4131. |
Abstract | This article presents a deep learning-based Multi-scale Bag-of-Visual Words (MBVW) representation for scene classification of high-resolution aerial imagery. Specifically, the convolutional neural network (CNN) is introduced to learn and characterize the complex local spatial patterns at different scales. Then, the learnt deep features are exploited in a novel way to generate visual words. Moreover, the MBVW representation is constructed using the statistics of the visual word co-occurrences at different scales, which are derived from a training data set. We apply our technique to the challenging aerial scene data set: the University of California (UC) Merced data set consisting of 21 different aerial scene categories with sub-metre resolution. The experimental results show that the statistics of deeply described visual words can characterize the scene well and improve classification accuracy. It demonstrates that the proposed method is highly effective in the scene classification of high-resolution remote-sensing imagery. |
URI | http://hdl.handle.net/20.500.11897/492484 |
ISSN | 0143-1161 |
DOI | 10.1080/01431161.2016.1207266 |
Indexed | SCI(E) EI |
Appears in Collections: | 地球与空间科学学院 |