{"title":"Voxento 2.0: A Prototype Voice-controlled Interactive Search Engine for Lifelogs","authors":"Ahmed Alateeq, M. Roantree, C. Gurrin","doi":"10.1145/3463948.3469071","DOIUrl":null,"url":null,"abstract":"In this paper, we describe an extended version of Voxento which is an interactive voice-based retrieval system for lifelogs that has been developed to participate in the fourth Lifelog Search Challenge LSC'21, at ACM ICMR'21. Voxento provides a spoken interface to the lifelog dataset, which facilitates a novice user to interact with a personal lifelog using a range of vocal commands and interactions. For the version presented here, Voxento has been enhanced with new retrieval features and better user interaction support. In this paper, we introduce these new features, which include dynamic result filtering, predefined interactive responses and the development of a new retrieval API. Although Voxento was proposed for wearable technologies such as Google Glass or interactive devices like smart TVs, the version of Voxento presented here uses a desktop computer in order to participate in the LSC'21 competition. In the current Voxento iteration, the user has the option to enable voice interaction or use standard text-based retrieval.","PeriodicalId":150532,"journal":{"name":"Proceedings of the 4th Annual on Lifelog Search Challenge","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th Annual on Lifelog Search Challenge","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3463948.3469071","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14
Abstract
In this paper, we describe an extended version of Voxento which is an interactive voice-based retrieval system for lifelogs that has been developed to participate in the fourth Lifelog Search Challenge LSC'21, at ACM ICMR'21. Voxento provides a spoken interface to the lifelog dataset, which facilitates a novice user to interact with a personal lifelog using a range of vocal commands and interactions. For the version presented here, Voxento has been enhanced with new retrieval features and better user interaction support. In this paper, we introduce these new features, which include dynamic result filtering, predefined interactive responses and the development of a new retrieval API. Although Voxento was proposed for wearable technologies such as Google Glass or interactive devices like smart TVs, the version of Voxento presented here uses a desktop computer in order to participate in the LSC'21 competition. In the current Voxento iteration, the user has the option to enable voice interaction or use standard text-based retrieval.