{"title":"视频制作优化软件工具","authors":"R. F. Davletshin, I. S. Shakhova","doi":"10.3103/S0005105525700268","DOIUrl":null,"url":null,"abstract":"<p>This paper proposes software mechanisms intended to enhance video production for the authors of artistic video materials. We propose a mechanism for creating animated three-dimensional shooting plans (storyboards) with the use of augmented reality to position and animate actors’ movements. To overcome the limitations of the iOS operating system related to access to sensors, we developed a mechanism for separately capturing audio and video streams from device sensors for recording and subsequent synchronization using timestamps for saving to device memory. Computer vision technologies are used to ensure compliance with the rules of compositional construction and image quality analysis. This paper also presents mechanisms for working with the script, including text-processing algorithms to display subtitles on the screen and speech recognition algorithms to compare actors’ speech recognition of actors with the text of the script.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 4 supplement","pages":"S192 - S201"},"PeriodicalIF":0.5000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Software Tool for Video-Production Optimization\",\"authors\":\"R. F. Davletshin, I. S. Shakhova\",\"doi\":\"10.3103/S0005105525700268\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This paper proposes software mechanisms intended to enhance video production for the authors of artistic video materials. We propose a mechanism for creating animated three-dimensional shooting plans (storyboards) with the use of augmented reality to position and animate actors’ movements. To overcome the limitations of the iOS operating system related to access to sensors, we developed a mechanism for separately capturing audio and video streams from device sensors for recording and subsequent synchronization using timestamps for saving to device memory. Computer vision technologies are used to ensure compliance with the rules of compositional construction and image quality analysis. This paper also presents mechanisms for working with the script, including text-processing algorithms to display subtitles on the screen and speech recognition algorithms to compare actors’ speech recognition of actors with the text of the script.</p>\",\"PeriodicalId\":42995,\"journal\":{\"name\":\"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS\",\"volume\":\"58 4 supplement\",\"pages\":\"S192 - S201\"},\"PeriodicalIF\":0.5000,\"publicationDate\":\"2025-03-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://link.springer.com/article/10.3103/S0005105525700268\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.3103/S0005105525700268","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
This paper proposes software mechanisms intended to enhance video production for the authors of artistic video materials. We propose a mechanism for creating animated three-dimensional shooting plans (storyboards) with the use of augmented reality to position and animate actors’ movements. To overcome the limitations of the iOS operating system related to access to sensors, we developed a mechanism for separately capturing audio and video streams from device sensors for recording and subsequent synchronization using timestamps for saving to device memory. Computer vision technologies are used to ensure compliance with the rules of compositional construction and image quality analysis. This paper also presents mechanisms for working with the script, including text-processing algorithms to display subtitles on the screen and speech recognition algorithms to compare actors’ speech recognition of actors with the text of the script.
期刊介绍:
Automatic Documentation and Mathematical Linguistics is an international peer reviewed journal that covers all aspects of automation of information processes and systems, as well as algorithms and methods for automatic language analysis. Emphasis is on the practical applications of new technologies and techniques for information analysis and processing.