{"title":"将社会科学词典与关联数据网络连接起来","authors":"Andias Wira-Alam, A. Kempf, Benjamin Zapilko","doi":"10.1109/JCDL.2014.6970223","DOIUrl":null,"url":null,"abstract":"In this paper, we apply different methods for linking subject headings of the Thesaurus for the Social Sciences (TheSoz) to DBpedia, the nucleus of the Web of Linked Data which is derived from the structured information of Wikipedia. Our method utilizes the backlinks and outlinks within Wikipedia for link detection. We examine to what extent the linking process can be optimized with the help of a network-based similarity measure, in order to achieve a higher precision and recall. We test two baseline methods, string alignment and language property matching and compare them to our own method. Our method outperforms the F-scores of the baselines by 10 percentage points.","PeriodicalId":92278,"journal":{"name":"Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries","volume":"8 1","pages":"457-458"},"PeriodicalIF":0.0000,"publicationDate":"2014-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Linking the Thesaurus for the Social Sciences to the Web of Linked Data\",\"authors\":\"Andias Wira-Alam, A. Kempf, Benjamin Zapilko\",\"doi\":\"10.1109/JCDL.2014.6970223\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we apply different methods for linking subject headings of the Thesaurus for the Social Sciences (TheSoz) to DBpedia, the nucleus of the Web of Linked Data which is derived from the structured information of Wikipedia. Our method utilizes the backlinks and outlinks within Wikipedia for link detection. We examine to what extent the linking process can be optimized with the help of a network-based similarity measure, in order to achieve a higher precision and recall. We test two baseline methods, string alignment and language property matching and compare them to our own method. Our method outperforms the F-scores of the baselines by 10 percentage points.\",\"PeriodicalId\":92278,\"journal\":{\"name\":\"Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries\",\"volume\":\"8 1\",\"pages\":\"457-458\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-09-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/JCDL.2014.6970223\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCDL.2014.6970223","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Linking the Thesaurus for the Social Sciences to the Web of Linked Data
In this paper, we apply different methods for linking subject headings of the Thesaurus for the Social Sciences (TheSoz) to DBpedia, the nucleus of the Web of Linked Data which is derived from the structured information of Wikipedia. Our method utilizes the backlinks and outlinks within Wikipedia for link detection. We examine to what extent the linking process can be optimized with the help of a network-based similarity measure, in order to achieve a higher precision and recall. We test two baseline methods, string alignment and language property matching and compare them to our own method. Our method outperforms the F-scores of the baselines by 10 percentage points.