{"title":"为有效的电子表格搜索提取语义元数据","authors":"S. Chatvichienchai","doi":"10.1109/ICSEC.2013.6694747","DOIUrl":null,"url":null,"abstract":"Metadata is an essential part of modern information system since it helps people to find relevant documents from disparate repositories. This paper describes my effort to develop a system that automatically generates semantic metadata from large, diverse, and evolving spreadsheet collections. Semantic metadata is known as metadata that describes contextually relevant or domain-specific information about content based on an industry-specific or enterprise-specific custom metadata model. In order to simplify semantic metadata generation problem, spreadsheet collections are categorized by layout similarity. A set of properties and semantic metadata extraction rules of a categorized spreadsheet collection is defined from a sample spreadsheet selected from the spreadsheet collection. Category of a given spreadsheet is justified by checking its properties with the property sets of registered collections. Semantic metadata generation of the given spreadsheet is based on semantic metadata extraction rules of the category to which the spreadsheet belongs. The hierarchical structure of semantic metadata of this paper enables end-users to define the meanings of search keywords. This capability allows end users to search the relevant spreadsheets efficiently.","PeriodicalId":191620,"journal":{"name":"2013 International Computer Science and Engineering Conference (ICSEC)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Extracting semantic metadata for effective spreadsheet search\",\"authors\":\"S. Chatvichienchai\",\"doi\":\"10.1109/ICSEC.2013.6694747\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Metadata is an essential part of modern information system since it helps people to find relevant documents from disparate repositories. This paper describes my effort to develop a system that automatically generates semantic metadata from large, diverse, and evolving spreadsheet collections. Semantic metadata is known as metadata that describes contextually relevant or domain-specific information about content based on an industry-specific or enterprise-specific custom metadata model. In order to simplify semantic metadata generation problem, spreadsheet collections are categorized by layout similarity. A set of properties and semantic metadata extraction rules of a categorized spreadsheet collection is defined from a sample spreadsheet selected from the spreadsheet collection. Category of a given spreadsheet is justified by checking its properties with the property sets of registered collections. Semantic metadata generation of the given spreadsheet is based on semantic metadata extraction rules of the category to which the spreadsheet belongs. The hierarchical structure of semantic metadata of this paper enables end-users to define the meanings of search keywords. This capability allows end users to search the relevant spreadsheets efficiently.\",\"PeriodicalId\":191620,\"journal\":{\"name\":\"2013 International Computer Science and Engineering Conference (ICSEC)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 International Computer Science and Engineering Conference (ICSEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSEC.2013.6694747\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 International Computer Science and Engineering Conference (ICSEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSEC.2013.6694747","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Extracting semantic metadata for effective spreadsheet search
Metadata is an essential part of modern information system since it helps people to find relevant documents from disparate repositories. This paper describes my effort to develop a system that automatically generates semantic metadata from large, diverse, and evolving spreadsheet collections. Semantic metadata is known as metadata that describes contextually relevant or domain-specific information about content based on an industry-specific or enterprise-specific custom metadata model. In order to simplify semantic metadata generation problem, spreadsheet collections are categorized by layout similarity. A set of properties and semantic metadata extraction rules of a categorized spreadsheet collection is defined from a sample spreadsheet selected from the spreadsheet collection. Category of a given spreadsheet is justified by checking its properties with the property sets of registered collections. Semantic metadata generation of the given spreadsheet is based on semantic metadata extraction rules of the category to which the spreadsheet belongs. The hierarchical structure of semantic metadata of this paper enables end-users to define the meanings of search keywords. This capability allows end users to search the relevant spreadsheets efficiently.