Map Image Classification (2020) - Katsurai Laboratory / Doshisha University

A Deep Multimodal Approach for Map Image Classification

Abstract: A large number of map images are available on the Internet, creating a growing need for efficient management, retrieval, and recommendation of such data. In this study, we train a classification model that automatically categorizes map images by theme, leveraging both textual features extracted via OCR and visual features from the images. We also release the labeled dataset constructed for our experiments to facilitate further research in this area.

Authors: Tomoya Sawada, Marie Katsurai

Publication venue: ICASSP2020

Labeled dataset

CSV

Reference

T. Sawada and M. Katsurai, “A Deep Multimodal Approach for Map Image Classification,” in 2020 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 4457–4461, 2020. doi: 10.1109/IC ASSP40776.2020.9054767. PDF

multimedia