{"title":"台湾历史人物文本检索与挖掘系统之开发","authors":"S. Sie, Hao-Ren Ke, Su-bing Chang","doi":"10.23919/PNC.2017.8203522","DOIUrl":null,"url":null,"abstract":"Personage is an important kind of entities in study of history. Comprehensive understanding of personage biographies is beneficial for researching into historical events. This article introduces the development of a text retrieval and mining system for Taiwanese historical people — Taiwan Biographical Database (TBDB). It describes the characteristics of personages in TBDB, highlights the system architecture and preliminary achievement of TBDB, and proposes a method to recognize named entities in the personage biographies, specifically poetry societies, which achieves the recall rate 96% and the precision rate 65%. Finally, this article elaborates on the lessons learned through the creation of TBDB, and the future plans.","PeriodicalId":325096,"journal":{"name":"2017 Pacific Neighborhood Consortium Annual Conference and Joint Meetings (PNC)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Development of a text retrieval and mining system for Taiwanese historical people\",\"authors\":\"S. Sie, Hao-Ren Ke, Su-bing Chang\",\"doi\":\"10.23919/PNC.2017.8203522\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Personage is an important kind of entities in study of history. Comprehensive understanding of personage biographies is beneficial for researching into historical events. This article introduces the development of a text retrieval and mining system for Taiwanese historical people — Taiwan Biographical Database (TBDB). It describes the characteristics of personages in TBDB, highlights the system architecture and preliminary achievement of TBDB, and proposes a method to recognize named entities in the personage biographies, specifically poetry societies, which achieves the recall rate 96% and the precision rate 65%. Finally, this article elaborates on the lessons learned through the creation of TBDB, and the future plans.\",\"PeriodicalId\":325096,\"journal\":{\"name\":\"2017 Pacific Neighborhood Consortium Annual Conference and Joint Meetings (PNC)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-12-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Pacific Neighborhood Consortium Annual Conference and Joint Meetings (PNC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.23919/PNC.2017.8203522\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Pacific Neighborhood Consortium Annual Conference and Joint Meetings (PNC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/PNC.2017.8203522","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Development of a text retrieval and mining system for Taiwanese historical people
Personage is an important kind of entities in study of history. Comprehensive understanding of personage biographies is beneficial for researching into historical events. This article introduces the development of a text retrieval and mining system for Taiwanese historical people — Taiwan Biographical Database (TBDB). It describes the characteristics of personages in TBDB, highlights the system architecture and preliminary achievement of TBDB, and proposes a method to recognize named entities in the personage biographies, specifically poetry societies, which achieves the recall rate 96% and the precision rate 65%. Finally, this article elaborates on the lessons learned through the creation of TBDB, and the future plans.