TitleScene classification using multi-scale deeply described visual words
AuthorsZhao, Wenzhi
Du, Shihong
AffiliationPeking Univ, Inst Remote Sensing & GIS, Beijing 100871, Peoples R China.
KeywordsCONVOLUTIONAL NEURAL-NETWORKS
REMOTE-SENSING IMAGERY
URBAN AREAS
FEATURES
MODEL
Issue Date2016
PublisherINTERNATIONAL JOURNAL OF REMOTE SENSING
CitationINTERNATIONAL JOURNAL OF REMOTE SENSING.2016,37(17),4119-4131.
AbstractThis article presents a deep learning-based Multi-scale Bag-of-Visual Words (MBVW) representation for scene classification of high-resolution aerial imagery. Specifically, the convolutional neural network (CNN) is introduced to learn and characterize the complex local spatial patterns at different scales. Then, the learnt deep features are exploited in a novel way to generate visual words. Moreover, the MBVW representation is constructed using the statistics of the visual word co-occurrences at different scales, which are derived from a training data set. We apply our technique to the challenging aerial scene data set: the University of California (UC) Merced data set consisting of 21 different aerial scene categories with sub-metre resolution. The experimental results show that the statistics of deeply described visual words can characterize the scene well and improve classification accuracy. It demonstrates that the proposed method is highly effective in the scene classification of high-resolution remote-sensing imagery.
URIhttp://hdl.handle.net/20.500.11897/492484
ISSN0143-1161
DOI10.1080/01431161.2016.1207266
IndexedSCI(E)
EI
Appears in Collections:地球与空间科学学院

Files in This Work
There are no files associated with this item.

Web of Science®



Checked on Last Week

Scopus®



Checked on Current Time

百度学术™



Checked on Current Time

Google Scholar™





License: See PKU IR operational policies.