{"title":"From Audio to Animated Signs","authors":"Xinfeng Ye, Zhongling Tang, S. Manoharan","doi":"10.1109/iceee55327.2022.9772564","DOIUrl":null,"url":null,"abstract":"Nearly 5% of the world population is hearing-impaired and uses a sign language as their primary mode of communication. Unfortunately, sign languages are not commonly understood, and this poses a great communication barrier between the hearing-impaired and the rest. This paper builds, based on prior wok, a novel approach that breaks the barrier one-way with accessibility and accuracy as the key objectives. It achieves accessibility using mobile devices for user-facing interactions and accuracy using a transformer model. The system uses a four-stage pipeline to enable one-way communication between hearing-gifted and hearing-impaired: audio capture, audio-to-text conversion, text-to-gloss transliteration, and gloss animation. The user-facing first and last stages of the pipeline are implemented on mobile devices to ensure wide accessibility, while the computationally intensive middle stages are implemented on cloud servers. Empirical evaluations show that the approach has a high accuracy.","PeriodicalId":375340,"journal":{"name":"2022 9th International Conference on Electrical and Electronics Engineering (ICEEE)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 9th International Conference on Electrical and Electronics Engineering (ICEEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iceee55327.2022.9772564","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Nearly 5% of the world population is hearing-impaired and uses a sign language as their primary mode of communication. Unfortunately, sign languages are not commonly understood, and this poses a great communication barrier between the hearing-impaired and the rest. This paper builds, based on prior wok, a novel approach that breaks the barrier one-way with accessibility and accuracy as the key objectives. It achieves accessibility using mobile devices for user-facing interactions and accuracy using a transformer model. The system uses a four-stage pipeline to enable one-way communication between hearing-gifted and hearing-impaired: audio capture, audio-to-text conversion, text-to-gloss transliteration, and gloss animation. The user-facing first and last stages of the pipeline are implemented on mobile devices to ensure wide accessibility, while the computationally intensive middle stages are implemented on cloud servers. Empirical evaluations show that the approach has a high accuracy.