{"title":"基于列概念向量的Web表匹配研究","authors":"Chao Chen, Yue Zhao","doi":"10.1109/WISA.2015.61","DOIUrl":null,"url":null,"abstract":"The Web consists of a huge number of structured data in the form of tables, which makes automatically integrating information from those tables of interest to ordinary users possible. A key problem of web table integration is the discovery of correspondences between web table columns. Most of traditional schema matching techniques can't work well because of the lack of schema information and the small number of instance in the web tables. This paper presents a method of web table matching which is based on column concept vector. Column Heading Matcher and Instance Matcher are employed to enhance the matching accuracy. A set of experiments are applied to real-world web tables and the results demonstrate that our method has higher precision and accuracy.","PeriodicalId":198938,"journal":{"name":"2015 12th Web Information System and Application Conference (WISA)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Research on Column Concept Vector Based Web Table Matching\",\"authors\":\"Chao Chen, Yue Zhao\",\"doi\":\"10.1109/WISA.2015.61\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Web consists of a huge number of structured data in the form of tables, which makes automatically integrating information from those tables of interest to ordinary users possible. A key problem of web table integration is the discovery of correspondences between web table columns. Most of traditional schema matching techniques can't work well because of the lack of schema information and the small number of instance in the web tables. This paper presents a method of web table matching which is based on column concept vector. Column Heading Matcher and Instance Matcher are employed to enhance the matching accuracy. A set of experiments are applied to real-world web tables and the results demonstrate that our method has higher precision and accuracy.\",\"PeriodicalId\":198938,\"journal\":{\"name\":\"2015 12th Web Information System and Application Conference (WISA)\",\"volume\":\"74 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 12th Web Information System and Application Conference (WISA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WISA.2015.61\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 12th Web Information System and Application Conference (WISA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WISA.2015.61","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research on Column Concept Vector Based Web Table Matching
The Web consists of a huge number of structured data in the form of tables, which makes automatically integrating information from those tables of interest to ordinary users possible. A key problem of web table integration is the discovery of correspondences between web table columns. Most of traditional schema matching techniques can't work well because of the lack of schema information and the small number of instance in the web tables. This paper presents a method of web table matching which is based on column concept vector. Column Heading Matcher and Instance Matcher are employed to enhance the matching accuracy. A set of experiments are applied to real-world web tables and the results demonstrate that our method has higher precision and accuracy.