{"title":"从手工到机器","authors":"Meng Guo, Lili Han","doi":"10.1075/intp.00100.guo","DOIUrl":null,"url":null,"abstract":"\n This study introduces a groundbreaking automated methodology for measuring ear–voice span (EVS) in simultaneous\n interpreting (SI). Traditionally, assessing EVS – a critical temporal metric in SI – has been hampered by labour-intensive and\n time-consuming manual methods that are prone to inconsistency. To overcome these challenges, our research harnesses\n state-of-the-art natural language processing (NLP) technologies, including automatic speech recognition (ASR), sentence boundary\n detection (SBD) and cross-lingual alignment, to automate EVS measurement. We deployed a comprehensive array of NLP models and\n evaluated the automated pipelines on a 20-hour English-to-Portuguese SI corpus which featured 57 varied audio pairings. The\n findings are encouraging: the most effective model combination achieved a median EVS error of less than 0.1 seconds across the\n corpus. Moreover, the automated pipelines exhibited a high level of accuracy, strong correlation and substantial agreement with\n manual measurements when assessing median EVS for individual audio pairs. Despite these satisfactory results, certain challenges\n persist with some NLP models, indicating clear avenues for future research. This study not only introduces a groundbreaking\n approach to large-scale EVS measurement but also propels the automation of process analysis in Interpreting Studies.","PeriodicalId":512697,"journal":{"name":"Interpreting. International Journal of Research and Practice in Interpreting","volume":"207 ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"From manual to machine\",\"authors\":\"Meng Guo, Lili Han\",\"doi\":\"10.1075/intp.00100.guo\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n This study introduces a groundbreaking automated methodology for measuring ear–voice span (EVS) in simultaneous\\n interpreting (SI). Traditionally, assessing EVS – a critical temporal metric in SI – has been hampered by labour-intensive and\\n time-consuming manual methods that are prone to inconsistency. To overcome these challenges, our research harnesses\\n state-of-the-art natural language processing (NLP) technologies, including automatic speech recognition (ASR), sentence boundary\\n detection (SBD) and cross-lingual alignment, to automate EVS measurement. We deployed a comprehensive array of NLP models and\\n evaluated the automated pipelines on a 20-hour English-to-Portuguese SI corpus which featured 57 varied audio pairings. The\\n findings are encouraging: the most effective model combination achieved a median EVS error of less than 0.1 seconds across the\\n corpus. Moreover, the automated pipelines exhibited a high level of accuracy, strong correlation and substantial agreement with\\n manual measurements when assessing median EVS for individual audio pairs. Despite these satisfactory results, certain challenges\\n persist with some NLP models, indicating clear avenues for future research. This study not only introduces a groundbreaking\\n approach to large-scale EVS measurement but also propels the automation of process analysis in Interpreting Studies.\",\"PeriodicalId\":512697,\"journal\":{\"name\":\"Interpreting. International Journal of Research and Practice in Interpreting\",\"volume\":\"207 \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Interpreting. International Journal of Research and Practice in Interpreting\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1075/intp.00100.guo\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Interpreting. International Journal of Research and Practice in Interpreting","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1075/intp.00100.guo","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
This study introduces a groundbreaking automated methodology for measuring ear–voice span (EVS) in simultaneous
interpreting (SI). Traditionally, assessing EVS – a critical temporal metric in SI – has been hampered by labour-intensive and
time-consuming manual methods that are prone to inconsistency. To overcome these challenges, our research harnesses
state-of-the-art natural language processing (NLP) technologies, including automatic speech recognition (ASR), sentence boundary
detection (SBD) and cross-lingual alignment, to automate EVS measurement. We deployed a comprehensive array of NLP models and
evaluated the automated pipelines on a 20-hour English-to-Portuguese SI corpus which featured 57 varied audio pairings. The
findings are encouraging: the most effective model combination achieved a median EVS error of less than 0.1 seconds across the
corpus. Moreover, the automated pipelines exhibited a high level of accuracy, strong correlation and substantial agreement with
manual measurements when assessing median EVS for individual audio pairs. Despite these satisfactory results, certain challenges
persist with some NLP models, indicating clear avenues for future research. This study not only introduces a groundbreaking
approach to large-scale EVS measurement but also propels the automation of process analysis in Interpreting Studies.