{"title":"挖掘表中的属性和值","authors":"N. Harnsamut, N. Sahavechaphan","doi":"10.1145/1936254.1936264","DOIUrl":null,"url":null,"abstract":"Table has been recognized as a simply and widely used data representation scheme. Each table alone typically contains rich and useful information which is valuable for many applications such as information retrieval, question-answering and etc. While all table formats can simply be parsed by human, this parsing is difficult for computer, prohibiting such applications to be done in an automatic manner. In this paper, we thus propose the comprehensive and novel table interpretation technique, namely tInterpreter. Essentially, it transforms a table into its corresponding horizontal 1-dimensional tables. To achieve this, the underlying work is based on (i) the similarity of two given cells with respect to the data type and the semantic correspondence concerns; (ii) the discovery for the boundary of a primitive table residing in a composite table; (iii) the identification of the attribute-value relationship and the value association of cells; and (iv) the integration of two pieces of similar or dissimilar information. The experimental result showed that the overall effectiveness of tInterpreter was higher than Chen, Tengli and Kim.","PeriodicalId":226712,"journal":{"name":"J. Multim. Process. Technol.","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Mining for attributes and values in tables\",\"authors\":\"N. Harnsamut, N. Sahavechaphan\",\"doi\":\"10.1145/1936254.1936264\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Table has been recognized as a simply and widely used data representation scheme. Each table alone typically contains rich and useful information which is valuable for many applications such as information retrieval, question-answering and etc. While all table formats can simply be parsed by human, this parsing is difficult for computer, prohibiting such applications to be done in an automatic manner. In this paper, we thus propose the comprehensive and novel table interpretation technique, namely tInterpreter. Essentially, it transforms a table into its corresponding horizontal 1-dimensional tables. To achieve this, the underlying work is based on (i) the similarity of two given cells with respect to the data type and the semantic correspondence concerns; (ii) the discovery for the boundary of a primitive table residing in a composite table; (iii) the identification of the attribute-value relationship and the value association of cells; and (iv) the integration of two pieces of similar or dissimilar information. The experimental result showed that the overall effectiveness of tInterpreter was higher than Chen, Tengli and Kim.\",\"PeriodicalId\":226712,\"journal\":{\"name\":\"J. Multim. Process. Technol.\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-10-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"J. Multim. Process. Technol.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1936254.1936264\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Multim. Process. Technol.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1936254.1936264","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Table has been recognized as a simply and widely used data representation scheme. Each table alone typically contains rich and useful information which is valuable for many applications such as information retrieval, question-answering and etc. While all table formats can simply be parsed by human, this parsing is difficult for computer, prohibiting such applications to be done in an automatic manner. In this paper, we thus propose the comprehensive and novel table interpretation technique, namely tInterpreter. Essentially, it transforms a table into its corresponding horizontal 1-dimensional tables. To achieve this, the underlying work is based on (i) the similarity of two given cells with respect to the data type and the semantic correspondence concerns; (ii) the discovery for the boundary of a primitive table residing in a composite table; (iii) the identification of the attribute-value relationship and the value association of cells; and (iv) the integration of two pieces of similar or dissimilar information. The experimental result showed that the overall effectiveness of tInterpreter was higher than Chen, Tengli and Kim.