{"title":"A Computer Vision Framework for Automatic Description of Indian Monuments","authors":"Pushkar Shukla, Beena Rautela, A. Mittal","doi":"10.1109/SITIS.2017.29","DOIUrl":null,"url":null,"abstract":"Monument recognition and description has emerged as a promising area of research. For any given image of a monument a question arises that up to what extend can a computer model describe the monument from that image?The main objective of the paper is to propose a framework which is capable of identifying multiple attributes from a single image of a monument. Four different attributes i.e. the class of the monument, the style of the architecture, the time period in which the monument was constructed and the type of the monument are taken into consideration. The paper proposes a framework that relies on Deep Convolutional Neural Networks (DCNN) for describing the monument in terms of the aforementioned attributes. The experiments have been performed on a dataset comprising of 6102 images of 117 Indian monuments. The model was able to achieve an accuracy greater than 80% for all the different set of experimentations. The results clearly indicate the usefulness of the framework.","PeriodicalId":153165,"journal":{"name":"2017 13th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 13th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SITIS.2017.29","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
Monument recognition and description has emerged as a promising area of research. For any given image of a monument a question arises that up to what extend can a computer model describe the monument from that image?The main objective of the paper is to propose a framework which is capable of identifying multiple attributes from a single image of a monument. Four different attributes i.e. the class of the monument, the style of the architecture, the time period in which the monument was constructed and the type of the monument are taken into consideration. The paper proposes a framework that relies on Deep Convolutional Neural Networks (DCNN) for describing the monument in terms of the aforementioned attributes. The experiments have been performed on a dataset comprising of 6102 images of 117 Indian monuments. The model was able to achieve an accuracy greater than 80% for all the different set of experimentations. The results clearly indicate the usefulness of the framework.