Ángel Castellanos, E. D. De Luca, Juan Cigarán, A. G. Serrano
{"title":"将数据网络的部分资源压缩到应用程序中","authors":"Ángel Castellanos, E. D. De Luca, Juan Cigarán, A. G. Serrano","doi":"10.1109/INTELLISYS.2017.8324319","DOIUrl":null,"url":null,"abstract":"The Web of Data (WOD) contains a large amount of formalized and interconnected data, offering a valuable help for experimental tasks requiring an accurate data representation. However, the practical application of such data is often limited by the complexity when it comes to extracting the necessary information, mainly because of the lack of a proper structure and organization in the WOD-resources. The (re)organization of the knowledge contained in these resources might facilitate the identification of the necessary information and, consequently, limit the problems arising in their practical application. In this context, this paper proposes the application of Formal Concept Analysis (FCA) to create a concept-based abstraction that better organizes the knowledge contained in the WOD-resources. In order to test, to what extent this enhanced organization is able to improve the data representation process, the obtained FCA models will be tested in a practical application to represent a set of Twitter contents in a specific task: the Topic Detection task at Replab 2013. The results demonstrate that the better data representation obtained through FCA improves the operation of the topic detection process, outperforming state-of-the-art results.","PeriodicalId":131825,"journal":{"name":"2017 Intelligent Systems Conference (IntelliSys)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Partially squeezing the resources of the web of data towards applications\",\"authors\":\"Ángel Castellanos, E. D. De Luca, Juan Cigarán, A. G. Serrano\",\"doi\":\"10.1109/INTELLISYS.2017.8324319\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Web of Data (WOD) contains a large amount of formalized and interconnected data, offering a valuable help for experimental tasks requiring an accurate data representation. However, the practical application of such data is often limited by the complexity when it comes to extracting the necessary information, mainly because of the lack of a proper structure and organization in the WOD-resources. The (re)organization of the knowledge contained in these resources might facilitate the identification of the necessary information and, consequently, limit the problems arising in their practical application. In this context, this paper proposes the application of Formal Concept Analysis (FCA) to create a concept-based abstraction that better organizes the knowledge contained in the WOD-resources. In order to test, to what extent this enhanced organization is able to improve the data representation process, the obtained FCA models will be tested in a practical application to represent a set of Twitter contents in a specific task: the Topic Detection task at Replab 2013. The results demonstrate that the better data representation obtained through FCA improves the operation of the topic detection process, outperforming state-of-the-art results.\",\"PeriodicalId\":131825,\"journal\":{\"name\":\"2017 Intelligent Systems Conference (IntelliSys)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Intelligent Systems Conference (IntelliSys)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INTELLISYS.2017.8324319\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Intelligent Systems Conference (IntelliSys)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INTELLISYS.2017.8324319","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
数据网络(Web of Data, WOD)包含了大量形式化和相互关联的数据,为需要精确数据表示的实验任务提供了宝贵的帮助。但是,这些数据的实际应用往往由于提取必要信息的复杂性而受到限制,这主要是因为世界资源缺乏适当的结构和组织。对这些资源中包含的知识进行(重新)组织可能有助于识别必要的信息,从而限制在实际应用中出现的问题。在此背景下,本文提出应用形式概念分析(Formal Concept Analysis, FCA)来创建一个基于概念的抽象,从而更好地组织世界资源中包含的知识。为了测试这种增强的组织能够在多大程度上改进数据表示过程,将在实际应用中测试获得的FCA模型,以在特定任务中表示一组Twitter内容:Replab 2013的主题检测任务。结果表明,通过FCA获得的更好的数据表示改善了主题检测过程的操作,优于目前的结果。
Partially squeezing the resources of the web of data towards applications
The Web of Data (WOD) contains a large amount of formalized and interconnected data, offering a valuable help for experimental tasks requiring an accurate data representation. However, the practical application of such data is often limited by the complexity when it comes to extracting the necessary information, mainly because of the lack of a proper structure and organization in the WOD-resources. The (re)organization of the knowledge contained in these resources might facilitate the identification of the necessary information and, consequently, limit the problems arising in their practical application. In this context, this paper proposes the application of Formal Concept Analysis (FCA) to create a concept-based abstraction that better organizes the knowledge contained in the WOD-resources. In order to test, to what extent this enhanced organization is able to improve the data representation process, the obtained FCA models will be tested in a practical application to represent a set of Twitter contents in a specific task: the Topic Detection task at Replab 2013. The results demonstrate that the better data representation obtained through FCA improves the operation of the topic detection process, outperforming state-of-the-art results.