{"title":"RefSeer: A citation recommendation system","authors":"W. Huang, Zhaohui Wu, P. Mitra, C. Lee Giles","doi":"10.1109/JCDL.2014.6970192","DOIUrl":"https://doi.org/10.1109/JCDL.2014.6970192","url":null,"abstract":"Citations are important in academic dissemination. To help researchers check the completeness of citations while authoring a paper, we introduce a citation recommendation system called RefSeer. Researchers can use it to find related works to cited while authoring papers. It can also be used by reviewers to check the completeness of a paper's references. RefSeer presents both topic based global recommendation and also citation-context based local recommendation. By evaluating the quality of recommendation, we show that such recommendation system can recommend citations with good precision and recall. We also show that our recommendation system is very efficient and scalable.","PeriodicalId":92278,"journal":{"name":"Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries","volume":"13 1","pages":"371-374"},"PeriodicalIF":0.0,"publicationDate":"2014-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87800649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Reducing computational effort for plagiarism detection by using citation characteristics to limit retrieval space","authors":"Norman Meuschke, Bela Gipp","doi":"10.1109/JCDL.2014.6970168","DOIUrl":"https://doi.org/10.1109/JCDL.2014.6970168","url":null,"abstract":"This paper proposes a hybrid approach to plagiarism detection in academic documents that integrates detection methods using citations, semantic argument structure, and semantic word similarity with character-based methods to achieve a higher detection performance for disguised plagiarism forms. Currently available software for plagiarism detection exclusively performs text string comparisons. These systems find copies, but fail to identify disguised plagiarism, such as paraphrases, translations, or idea plagiarism. Detection approaches that consider semantic similarity on word and sentence level exist and have consistently achieved higher detection accuracy for disguised plagiarism forms compared to character-based approaches. However, the high computational effort of these semantic approaches makes them infeasible for use in real-world plagiarism detection scenarios. The proposed hybrid approach uses citation-based methods as a preliminary heuristic to reduce the retrieval space with a relatively low loss in detection accuracy. This preliminary step can then be followed by a computationally more expensive semantic and character-based analysis. We show that such a hybrid approach allows semantic plagiarism detection to become feasible even on large collections for the first time.","PeriodicalId":92278,"journal":{"name":"Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries","volume":"7 1","pages":"197-200"},"PeriodicalIF":0.0,"publicationDate":"2014-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90990943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Luyuan Li, Yongtao Wang, Liangcai Gao, Zhi Tang, C. Suen
{"title":"Comic2CEBX: A system for automatic comic content adaptation","authors":"Luyuan Li, Yongtao Wang, Liangcai Gao, Zhi Tang, C. Suen","doi":"10.1109/JCDL.2014.6970183","DOIUrl":"https://doi.org/10.1109/JCDL.2014.6970183","url":null,"abstract":"Comics are popular almost throughout the world. With the help of comic document digitization, it is much easier for people to archive and browse comic works. However, there are still some big challenges along with comic document digitization progress. Among these challenges, comic content adaptation is an important one to be tackled. The existing works only focus on parts of this problem and do not provide a tangible solution to display comic contents on different devices. In this paper, we solve these problems by proposing Comic2CEBX, a system which can automatically convert a set of scanned comic page images into a CEBX file that allows reflowing of the original comic pages with fixed layouts. Taking raw comic images as inputs, our system first extracts three kinds of low-level visual patterns and then uses multilayer Conditional Random Fields to detect all the panels. Meanwhile, our system automatically identifies the reading orders of the panels within each page. Finally, we encapsulate the comic page images and the obtained page structure information (i.e., the panels detection results and the corresponding reading orders) to generate a CEBX file. Experimental results show that our comic page layout analysis method achieves better performance than the existing ones, and use case presentation of the CEBX files produced by our system demonstrates that it brings better comic reading experience especially on mobile devices.","PeriodicalId":92278,"journal":{"name":"Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries","volume":"1 1","pages":"299-308"},"PeriodicalIF":0.0,"publicationDate":"2014-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84847288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
D. Pereira, Eduardo Emanuel Braga da Silva, A. Esmin
{"title":"Disambiguating publication venue titles using association rules","authors":"D. Pereira, Eduardo Emanuel Braga da Silva, A. Esmin","doi":"10.1109/JCDL.2014.6970153","DOIUrl":"https://doi.org/10.1109/JCDL.2014.6970153","url":null,"abstract":"Research agencies in several countries evaluate the impact of scientific publications of researcher groups to define their investments, and one of the main used metrics is the quality of the publication venues where their works were published. Several bibliometric indexes have been formulated by measuring the quality of a publication venue. However, given a set of citations extracted, for example, from curricula vitae of a researcher group, to effectively use bibliometric indexes to evaluate their quality it is necessary to identify correctly the publication venue title of each citation. This task is not easy, since there are not unique identifiers for publication venues. Frequently, citations contain abbreviated forms and acronyms, publication venues share similar titles, sometimes they change their titles, divide or merge, creating new ones. Traditional digital libraries deal with this problem by creating Authority Files. In this work, we present a twofold contribution: (i) the creation of a Computer Science publication venue authority file and (ii) the proposal of a method that uses association rules to disambiguate publication venue titles originated from citations. The disambiguator is a supervised learning method that uses the authority file to train a classifier, whose generated model is a set of association rules to identify publication venues. Experiments show that our method obtains better results than three state of art baselines.","PeriodicalId":92278,"journal":{"name":"Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries","volume":"60 1","pages":"77-86"},"PeriodicalIF":0.0,"publicationDate":"2014-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84408490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Method for supporting analysis of personal relationships through place names extracted from documents","authors":"Fuminori Kimura, Akira Maeda","doi":"10.1109/JCDL.2014.6970176","DOIUrl":"https://doi.org/10.1109/JCDL.2014.6970176","url":null,"abstract":"Visualizing information extracted from text is helpful for intuitively understanding the information. Extracting and visualizing personal relationships from text is one of the promising applications of this approach. Existing methods usually estimate personal relationships from direct co-occurrences of personal names that appear in a text. In our previous work, we proposed a method for extracting personal relationships from indirect co-occurrence relationships obtained through place names. This method can estimate the relationships among persons who do not necessarily have direct relationships. These relationships are visualized in a network graph. However, it becomes difficult to grasp the relationships when the number of persons increases. In this paper, we propose a method that supports analyzing the extracted personal relationships through place names and that is based on our previous work. Our goal is to support analysis by providing the information of the clustering of closely related people and important place names for each cluster. The proposed method was applied to a Japanese historical chronicle written in the 12th century. Experimental results showed a strong correspondence to the known historical facts. The results also indicate that the proposed method might be able to uncover the characteristics of people whose histories are not clearly known yet.","PeriodicalId":92278,"journal":{"name":"Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries","volume":"29 1","pages":"253-256"},"PeriodicalIF":0.0,"publicationDate":"2014-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84625633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
J. Lacasta, F. J. Lopez-Pellicer, Walter Renteria-Agualimpia, J. Nogueras-Iso
{"title":"Improving the visibility of geospatial data on the Web","authors":"J. Lacasta, F. J. Lopez-Pellicer, Walter Renteria-Agualimpia, J. Nogueras-Iso","doi":"10.1109/JCDL.2014.6970162","DOIUrl":"https://doi.org/10.1109/JCDL.2014.6970162","url":null,"abstract":"Geospatial information is a common resource used at personal and corporative levels for decision making. Nowadays, a relevant percentage of the geospatial data on the web is provided by standardized services. However, due to the deficiencies in the service content descriptions, the data required for a task are not easy to find. To improve the description of geospatial information on the Web, this work proposes a process to construct a Linked Data model of geospatial resources that allows semantic searching and browsing. This is done by crawling the web in search of available geospatial services, and enriching their descriptions with concepts from common knowledge organizations models. As use case, we have created a Linked Data model describing the Web Map Services published by Spanish organizations.","PeriodicalId":92278,"journal":{"name":"Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries","volume":"14 1","pages":"155-164"},"PeriodicalIF":0.0,"publicationDate":"2014-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87605063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xiaozhong Liu, Yingying Yu, Chun Guo, Yizhou Sun, Liangcai Gao
{"title":"Full-text based context-rich heterogeneous network mining approach for citation recommendation","authors":"Xiaozhong Liu, Yingying Yu, Chun Guo, Yizhou Sun, Liangcai Gao","doi":"10.1109/JCDL.2014.6970191","DOIUrl":"https://doi.org/10.1109/JCDL.2014.6970191","url":null,"abstract":"Citation relationship between scientific publications has been successfully used for scholarly bibliometrics, information retrieval and data mining tasks, and citation-based recommendation algorithms are well documented. While previous studies investigated citation relations from various viewpoints, most of them share the same assumption that, if paper1 cites paper2 (or author1 cites author2), they are connected, regardless of citation importance, sentiment, reason, topic, or motivation. However, this assumption is oversimplified. In this study, we employ an innovative “context-rich heterogeneous network” approach, which paves a new way for citation recommendation task. In the network, we characterize (1) the importance of citation relationships between citing and cited papers, and (2) the topical citation motivation. Unlike earlier studies, the citation information, in this paper, is characterized by citation textual contexts extracted from the full-text citing paper. We also propose algorithm to cope with the situation when large portion of full-text missing information exists in the bibliographic repository. Evaluation results show that, context-rich heterogeneous network can significantly enhance the citation recommendation performance.","PeriodicalId":92278,"journal":{"name":"Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries","volume":"116 1","pages":"361-370"},"PeriodicalIF":0.0,"publicationDate":"2014-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83214846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The Organization information integration in the management of a Digital Library System","authors":"A. D. Iorio, M. Schaerf","doi":"10.1109/JCDL.2014.6970225","DOIUrl":"https://doi.org/10.1109/JCDL.2014.6970225","url":null,"abstract":"The Sapienza Digital Library collects digital resources from the different University's Organizations representing the multidisciplinary Sapienza University's community. The poster presents the pre-ingestion process for creating and aggregating digital resources, under the Organizational Collection conceptualization. The pre-ingestion building process had allowed to automatically provide information about the resources' custody from the origination, until their creation as OAIS Submission Information Package. Whatever system able to provide archival, preservation or dissemination services, could potentially use it, maintaining provenance information.","PeriodicalId":92278,"journal":{"name":"Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries","volume":"73 1","pages":"461-462"},"PeriodicalIF":0.0,"publicationDate":"2014-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91217351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Topical establishment leveraging literature evolution","authors":"Han Xu, É. Martin, Ashesh Mahidadia","doi":"10.1109/JCDL.2014.6970175","DOIUrl":"https://doi.org/10.1109/JCDL.2014.6970175","url":null,"abstract":"From an evolutionary perspective, a body of research is an evolving ecosystem, consisting of research topics subjected to a form of natural selection as topics come into existence, and thrive more or less over a variable period of time. Identifying the form of establishment of a given topic in a scientific domain, in terms of its momentum at the time of inquiry, can provide useful insights into where this topic is heading, and can facilitate e?ective literature research. Here we propose to identify three forms of establishment of topics, emerging from a comparison between two di?erent methodologies in ranking papers, taking advantage of the mutual relationship between recognition of papers and recognition of topics. More specifically, by analysing the correlation between the rankings obtained by applying both methodologies, we discover thee clusters of topics, each of which is associated with a particular momentum of establishment.","PeriodicalId":92278,"journal":{"name":"Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries","volume":"58 1","pages":"249-252"},"PeriodicalIF":0.0,"publicationDate":"2014-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88885623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An approach to named entity extraction from historical documents in traditional mongolian script","authors":"Biligsaikhan Batjargal, Garmaabazar Khaltarkhuu, Fuminori Kimura, Akira Maeda","doi":"10.1109/JCDL.2014.6970239","DOIUrl":"https://doi.org/10.1109/JCDL.2014.6970239","url":null,"abstract":"In this poster, we propose an information extraction method for digitized ancient Mongolian documents by utilizing an ancient-modern dictionary. Named entities such as historical figures and place names will be extracted by employing text mining techniques that aim to reduce the labor-intensive annotation on historical text. The Text Encoding Initiative (TEI) guidelines will be applied to digital text representations that encode the historical figures and place names along with their interpretations, and commentaries.","PeriodicalId":92278,"journal":{"name":"Proceedings of the ... ACM/IEEE Joint Conference on Digital Libraries. ACM/IEEE Joint Conference on Digital Libraries","volume":"178 1","pages":"489-490"},"PeriodicalIF":0.0,"publicationDate":"2014-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78026393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}