{"title":"Automatic content segmentation of audio recordings at multidisciplinary medical team meetings","authors":"Jing Su, B. Kane, S. Luz","doi":"10.1109/INFTECH.2008.4621647","DOIUrl":null,"url":null,"abstract":"A single recording of a multidisciplinary medical team meeting (MDTM) can be expected to contain several separate discussions on different patients. Automatic speaker segmentation alone does not allow for the separation of individual patient case discussions (PCDs). A novel method is presented here, based on Hidden Markov Models (HMM), to segment audio recordings of MDTMs and facilitate the non-linear retrieval of individual PCDs. The method combines professional role interaction with speaker vocalization patterns. The sequence and duration of vocalization and speakerspsila roles are used as training states. Results demonstrate HMM segmentation to have good potential in the development of an MDTM browser. The approach outlined here can be applied in a wide range of meetings.","PeriodicalId":247264,"journal":{"name":"2008 1st International Conference on Information Technology","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 1st International Conference on Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INFTECH.2008.4621647","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
A single recording of a multidisciplinary medical team meeting (MDTM) can be expected to contain several separate discussions on different patients. Automatic speaker segmentation alone does not allow for the separation of individual patient case discussions (PCDs). A novel method is presented here, based on Hidden Markov Models (HMM), to segment audio recordings of MDTMs and facilitate the non-linear retrieval of individual PCDs. The method combines professional role interaction with speaker vocalization patterns. The sequence and duration of vocalization and speakerspsila roles are used as training states. Results demonstrate HMM segmentation to have good potential in the development of an MDTM browser. The approach outlined here can be applied in a wide range of meetings.