Margaret Broeren, Yuzhe Gu, Mark Pitt, Virginia Tompkins
{"title":"使用Rev AI的自动语音转录的脚本和教程","authors":"Margaret Broeren, Yuzhe Gu, Mark Pitt, Virginia Tompkins","doi":"10.1002/icd.70007","DOIUrl":null,"url":null,"abstract":"<p>We introduce Speech Transcriber with Rev AI (STR) - a Python script that allows for easy interfacing with the Rev AI speech transcription service. Recent advancements in technology have led to increased accuracy and affordability of automatic transcription services, making them preferable over the laborious and time-consuming process of manual transcription. STR allows users to take advantage of speech-to-text transcription services to transcribe their own verbal response data. STR is partially tailored to child development researchers utilising the Codes for the Human Analysis of Transcripts (CHAT) though the code is generic enough to output unformatted transcriptions. STR allows transcription of single words and multi-speaker dialogues in 50+ languages. We describe STR, provide a tutorial for CHAT-formatted transcriptions, describe settings available for customising transcription and conduct a brief analysis of the efficiency and accuracy of transcription. Speech that was transcribed in over half an hour by trained transcribers was transcribed in less than two minutes (with ~90% accuracy) by Rev AI. Considering the additional time needed for error correction and CHAT formatting, we estimate that manual transcription takes twice as long as transcribing with assistance from STR.</p>","PeriodicalId":47820,"journal":{"name":"Infant and Child Development","volume":"34 2","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/icd.70007","citationCount":"0","resultStr":"{\"title\":\"A Script and Tutorial for Using Rev AI's Automatic Speech Transcription\",\"authors\":\"Margaret Broeren, Yuzhe Gu, Mark Pitt, Virginia Tompkins\",\"doi\":\"10.1002/icd.70007\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>We introduce Speech Transcriber with Rev AI (STR) - a Python script that allows for easy interfacing with the Rev AI speech transcription service. Recent advancements in technology have led to increased accuracy and affordability of automatic transcription services, making them preferable over the laborious and time-consuming process of manual transcription. STR allows users to take advantage of speech-to-text transcription services to transcribe their own verbal response data. STR is partially tailored to child development researchers utilising the Codes for the Human Analysis of Transcripts (CHAT) though the code is generic enough to output unformatted transcriptions. STR allows transcription of single words and multi-speaker dialogues in 50+ languages. We describe STR, provide a tutorial for CHAT-formatted transcriptions, describe settings available for customising transcription and conduct a brief analysis of the efficiency and accuracy of transcription. Speech that was transcribed in over half an hour by trained transcribers was transcribed in less than two minutes (with ~90% accuracy) by Rev AI. Considering the additional time needed for error correction and CHAT formatting, we estimate that manual transcription takes twice as long as transcribing with assistance from STR.</p>\",\"PeriodicalId\":47820,\"journal\":{\"name\":\"Infant and Child Development\",\"volume\":\"34 2\",\"pages\":\"\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-03-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/icd.70007\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Infant and Child Development\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/icd.70007\",\"RegionNum\":4,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"PSYCHOLOGY, DEVELOPMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infant and Child Development","FirstCategoryId":"102","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/icd.70007","RegionNum":4,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PSYCHOLOGY, DEVELOPMENTAL","Score":null,"Total":0}
引用次数: 0
摘要
我们介绍语音转录器与Rev AI (STR) -一个Python脚本,允许轻松接口与Rev AI语音转录服务。最近技术的进步提高了自动转录服务的准确性和可负担性,使它们优于人工转录的费力和耗时的过程。STR允许用户利用语音到文本的转录服务来转录他们自己的口头响应数据。STR部分是为使用人类转录分析代码(CHAT)的儿童发展研究人员量身定制的,尽管该代码足够通用,可以输出未格式化的转录。STR允许转录50多种语言的单字和多说话者对话。我们描述STR,提供chat格式的转录教程,描述可用于定制转录的设置,并对转录的效率和准确性进行简要分析。由训练有素的转录员花半个多小时转录的演讲,由Rev AI在不到两分钟的时间内转录(准确率约为90%)。考虑到错误纠正和CHAT格式化所需的额外时间,我们估计人工转录所需的时间是STR协助转录的两倍。
A Script and Tutorial for Using Rev AI's Automatic Speech Transcription
We introduce Speech Transcriber with Rev AI (STR) - a Python script that allows for easy interfacing with the Rev AI speech transcription service. Recent advancements in technology have led to increased accuracy and affordability of automatic transcription services, making them preferable over the laborious and time-consuming process of manual transcription. STR allows users to take advantage of speech-to-text transcription services to transcribe their own verbal response data. STR is partially tailored to child development researchers utilising the Codes for the Human Analysis of Transcripts (CHAT) though the code is generic enough to output unformatted transcriptions. STR allows transcription of single words and multi-speaker dialogues in 50+ languages. We describe STR, provide a tutorial for CHAT-formatted transcriptions, describe settings available for customising transcription and conduct a brief analysis of the efficiency and accuracy of transcription. Speech that was transcribed in over half an hour by trained transcribers was transcribed in less than two minutes (with ~90% accuracy) by Rev AI. Considering the additional time needed for error correction and CHAT formatting, we estimate that manual transcription takes twice as long as transcribing with assistance from STR.
期刊介绍:
Infant and Child Development publishes high quality empirical, theoretical and methodological papers addressing psychological development from the antenatal period through to adolescence. The journal brings together research on: - social and emotional development - perceptual and motor development - cognitive development - language development atypical development (including conduct problems, anxiety and depressive conditions, language impairments, autistic spectrum disorders, and attention-deficit/hyperactivity disorders)