{"title":"Voice-based user interface for hands-free data entry and automation at workplaces","authors":"Daiwiik Harihar, Vedansh Shrivastava, Pratvina Talele, Aditi Jahagirdar","doi":"10.1016/j.mex.2025.103596","DOIUrl":null,"url":null,"abstract":"<div><div>The increasing demand for hands-free interaction in modern workplaces has led to the development of Voice-Based User Interfaces (VUIs) that enhance accessibility, efficiency and automation. This research presents the Voice-Based User Interface for Hands-Free Data Entry and Automation at Workplaces. The system enables real-time speech-to-text transcription, allowing users to interact with workplace applications without manual input, making it intuitive, user-friendly and capable of enhancing efficiency and convenience in various workplace scenarios. Through extensive testing and evaluation, the study demonstrates the practicality and benefits of the Voice-Based User Interface for hands-free data entry and automation.</div><div><strong>Methodology Overview</strong>:<ul><li><span>•</span><span><div>Utilized WIT.AI API for speech-to-text transcription.</div></span></li><li><span>•</span><span><div>Implemented chunking, caching, and concurrency control to optimize processing.</div></span></li><li><span>•</span><span><div>Evaluated performance using Word Error Rate (WER), Levenshtein Distance and Cosine Similarity on real world datasets.</div></span></li></ul>The system proves to be upto 88.8% accurate in recognizing spoken commands and efficiently converting them into text with best performance achieved when the audio was divided into 7 optimal chunks. Cosine Similarity for these chunks is more accurate than that of sizeable file and approximately 2. Moreover, the integration of real-time updates across different domains (educational, legal, medical) and data synchronization enhances productivity and usability. In conclusion, the Voice-Based User Interface offers a viable solution for hands-free data entry and automation at workplaces.</div></div>","PeriodicalId":18446,"journal":{"name":"MethodsX","volume":"15 ","pages":"Article 103596"},"PeriodicalIF":1.9000,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MethodsX","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2215016125004406","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The increasing demand for hands-free interaction in modern workplaces has led to the development of Voice-Based User Interfaces (VUIs) that enhance accessibility, efficiency and automation. This research presents the Voice-Based User Interface for Hands-Free Data Entry and Automation at Workplaces. The system enables real-time speech-to-text transcription, allowing users to interact with workplace applications without manual input, making it intuitive, user-friendly and capable of enhancing efficiency and convenience in various workplace scenarios. Through extensive testing and evaluation, the study demonstrates the practicality and benefits of the Voice-Based User Interface for hands-free data entry and automation.
Methodology Overview:
•
Utilized WIT.AI API for speech-to-text transcription.
•
Implemented chunking, caching, and concurrency control to optimize processing.
•
Evaluated performance using Word Error Rate (WER), Levenshtein Distance and Cosine Similarity on real world datasets.
The system proves to be upto 88.8% accurate in recognizing spoken commands and efficiently converting them into text with best performance achieved when the audio was divided into 7 optimal chunks. Cosine Similarity for these chunks is more accurate than that of sizeable file and approximately 2. Moreover, the integration of real-time updates across different domains (educational, legal, medical) and data synchronization enhances productivity and usability. In conclusion, the Voice-Based User Interface offers a viable solution for hands-free data entry and automation at workplaces.