{"title":"Automatic Title Generation for Spoken Broadcast News","authors":"Rong Jin, Alexander Hauptmann","doi":"10.3115/1072133.1072144","DOIUrl":null,"url":null,"abstract":"In this paper, we implemented a set of title generation methods using training set of 21190 news stories and evaluated them on an independent test corpus of 1006 broadcast news documents, comparing the results over manual transcription to the results over automatically recognized speech. We use both F1 and the average number of correct title words in the correct order as metric. Overall, the results show that title generation for speech recognized news documents is possible at a level approaching the accuracy of titles generated for perfect text transcriptions.","PeriodicalId":108911,"journal":{"name":"Proceedings of the first international conference on Human language technology research - HLT '01","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2001-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"37","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the first international conference on Human language technology research - HLT '01","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3115/1072133.1072144","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 37
Abstract
In this paper, we implemented a set of title generation methods using training set of 21190 news stories and evaluated them on an independent test corpus of 1006 broadcast news documents, comparing the results over manual transcription to the results over automatically recognized speech. We use both F1 and the average number of correct title words in the correct order as metric. Overall, the results show that title generation for speech recognized news documents is possible at a level approaching the accuracy of titles generated for perfect text transcriptions.