{"title":"Tell me what","authors":"Xiansheng Hua, Jin Li","doi":"10.1109/ICMEW.2014.6890616","DOIUrl":null,"url":null,"abstract":"“Tell Me What” is smart phone based image recognition system, and it is also an automatic pipeline for generating image recognition systems to recognize an arbitrary set of entities. For any given set of entities, “Tell Me What” backend system automatically fetches related image data from the Internet for each entity, and then run a comprehensive data cleaning process to purify the data. A multi-class classifier and inverted index are then built based on the cleaned data. For an unknown new image captured by a camera, the user is allowed to optionally highlight regions and then a classification process and a search process are applied to get recognition results. Distributed computing techniques are applied to ensure that the backend model and index generation processes can be done in a few hours.","PeriodicalId":178700,"journal":{"name":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMEW.2014.6890616","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
“Tell Me What” is smart phone based image recognition system, and it is also an automatic pipeline for generating image recognition systems to recognize an arbitrary set of entities. For any given set of entities, “Tell Me What” backend system automatically fetches related image data from the Internet for each entity, and then run a comprehensive data cleaning process to purify the data. A multi-class classifier and inverted index are then built based on the cleaned data. For an unknown new image captured by a camera, the user is allowed to optionally highlight regions and then a classification process and a search process are applied to get recognition results. Distributed computing techniques are applied to ensure that the backend model and index generation processes can be done in a few hours.
“Tell Me What”是基于智能手机的图像识别系统,也是生成图像识别系统以识别任意一组实体的自动流水线。对于任意给定的一组实体,“Tell Me What”后端系统自动从互联网上获取每个实体的相关图像数据,然后运行全面的数据清洗过程,对数据进行净化。然后基于清理后的数据构建多类分类器和倒排索引。对于相机捕捉到的未知新图像,允许用户选择性地突出显示区域,然后应用分类过程和搜索过程来获得识别结果。应用分布式计算技术确保后端模型和索引生成过程可以在几个小时内完成。