Scene classification using multi-scale deeply described visual words
Published in International Journal of Remote Sensing, 2016
Recommended citation: Wenzhi Zhao, et al. (2016). "Scene classification using multi-scale deeply described visual words." International Journal of Remote Sensing. 10(7). https://www.tandfonline.com/doi/abs/10.1080/01431161.2016.1207266
This article presents a deep learning-based Multi-scale Bag-of-Visual Words (MBVW) representation for scene classification of high-resolution aerial imagery. Specifically, the convolutional neural network (CNN) is introduced to learn and characterize the complex local spatial patterns at different scales. Then, the learnt deep features are exploited in a novel way to generate visual words. Moreover, the MBVW representation is constructed using the statistics of the visual word co-occurrences at different scales, which are derived from a training data set.
Recommended citation: Wenzhi Zhao, et al. (2016). “Scene classification using multi-scale deeply described visual words” International Journal of Remote Sensing. 10(7).