Quan Zhou, Pan Deng, Hongjian Liu, Defeng Guo, Kenji Nagamatsu
{"title":"基于关键词锚定和隐马尔可夫模型的汉语韵律词混合标注方法","authors":"Quan Zhou, Pan Deng, Hongjian Liu, Defeng Guo, Kenji Nagamatsu","doi":"10.1109/IALP.2009.24","DOIUrl":null,"url":null,"abstract":"In this paper, a new method of Chinese prosodic word tagging is presented. This method consists of a rule-based algorithm named “Keyword Anchor” and a statistical algorithm based on Hidden Markov Model (HMM). For keyword anchor algorithm, an anchor of the prosodic word is defined to help the system to find the whole prosodic word. For statistical algorithm, a length-based Hidden Markov Model (HMM) is used to find the best result of prosodic word tagging. The experiments of this method prove the better result than preceding methods in this field. The “Open Set F Score” of prosodic word based on this method is up to about 0.96.","PeriodicalId":156840,"journal":{"name":"2009 International Conference on Asian Language Processing","volume":"221 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Hybrid Method of Chinese Prosodic Word Tagging Based on Keyword Anchor and Hidden Markov Model\",\"authors\":\"Quan Zhou, Pan Deng, Hongjian Liu, Defeng Guo, Kenji Nagamatsu\",\"doi\":\"10.1109/IALP.2009.24\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, a new method of Chinese prosodic word tagging is presented. This method consists of a rule-based algorithm named “Keyword Anchor” and a statistical algorithm based on Hidden Markov Model (HMM). For keyword anchor algorithm, an anchor of the prosodic word is defined to help the system to find the whole prosodic word. For statistical algorithm, a length-based Hidden Markov Model (HMM) is used to find the best result of prosodic word tagging. The experiments of this method prove the better result than preceding methods in this field. The “Open Set F Score” of prosodic word based on this method is up to about 0.96.\",\"PeriodicalId\":156840,\"journal\":{\"name\":\"2009 International Conference on Asian Language Processing\",\"volume\":\"221 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 International Conference on Asian Language Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IALP.2009.24\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Asian Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IALP.2009.24","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本文提出了一种新的汉语韵律词标注方法。该方法由基于规则的关键词锚算法和基于隐马尔可夫模型(HMM)的统计算法组成。关键词锚点算法定义韵律词的锚点,帮助系统找到整个韵律词。在统计算法中,使用基于长度的隐马尔可夫模型(HMM)来寻找韵律词标注的最佳结果。实验结果表明,该方法在该领域具有较好的应用效果。基于该方法的韵律词的“Open Set F Score”可达0.96左右。
A Hybrid Method of Chinese Prosodic Word Tagging Based on Keyword Anchor and Hidden Markov Model
In this paper, a new method of Chinese prosodic word tagging is presented. This method consists of a rule-based algorithm named “Keyword Anchor” and a statistical algorithm based on Hidden Markov Model (HMM). For keyword anchor algorithm, an anchor of the prosodic word is defined to help the system to find the whole prosodic word. For statistical algorithm, a length-based Hidden Markov Model (HMM) is used to find the best result of prosodic word tagging. The experiments of this method prove the better result than preceding methods in this field. The “Open Set F Score” of prosodic word based on this method is up to about 0.96.