Journal of data and information science (Warsaw, Poland)最新文献

Editorial board publication strategy and acceptance rates in Turkish national journals 编辑委员会出版策略和土耳其国家期刊的接受率

Journal of data and information science (Warsaw, Poland) Pub Date : 2023-08-25 DOI: 10.2478/jdis-2023-0019

Lokman Tutuncu

{"title":"Editorial board publication strategy and acceptance rates in Turkish national journals","authors":"Lokman Tutuncu","doi":"10.2478/jdis-2023-0019","DOIUrl":"https://doi.org/10.2478/jdis-2023-0019","url":null,"abstract":"Abstract Purpose This study takes advantage of newly released journal metrics to investigate whether local journals with more qualified boards have lower acceptance rates, based on data from 219 Turkish national journals and 2,367 editorial board members. Design/methodology/approach This study argues that journal editors can signal their scholarly quality by publishing in reputable journals. Conversely, editors publishing inside articles in affiliated national journals would send negative signals. The research predicts that high (low) quality editorial boards will conduct more (less) selective evaluation and their journals will have lower (higher) acceptance rates. Based on the publication strategy of editors, four measures of board quality are defined: Number of board inside publications per editor (INSIDER), number of board Social Sciences Citation Index publications per editor (SSCI), inside-to-SSCI article ratio (ISRA), and board citation per editor (CITATION). Predictions are tested by correlation and regression analysis. Findings Low-quality board proxies (INSIDER, ISRA) are positively, and high-quality board proxies (SSCI, CITATION) are negatively associated with acceptance rates. Further, we find that receiving a larger number of submissions, greater women representation on boards, and Web of Science and Scopus (WOSS) coverage are associated with lower acceptance rates. Acceptance rates for journals range from 12% to 91%, with an average of 54% and a median of 53%. Law journals have significantly higher average acceptance rate (68%) than other journals, while WOSS journals have the lowest (43%). Findings indicate some of the highest acceptance rates in Social Sciences literature, including competitive Business and Economics journals that traditionally have low acceptance rates. Limitations Research relies on local context to define publication strategy of editors. Findings may not be generalizable to mainstream journals and core science countries where emphasis on research quality is stronger and editorial selection is based on scientific merit. Practical implications Results offer useful insights into editorial management of national journals and allow us to make sense of local editorial practices. The importance of scientific merit for selection to national journal editorial boards is particularly highlighted for sound editorial evaluation of submitted manuscripts. Originality/value This is the first attempt to document a significant relation between acceptance rates and editorial board publication behavior.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"0 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42529544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Build neural network models to identify and correct news headlines exaggerating obesity-related scientific findings 建立神经网络模型，识别和纠正夸大肥胖相关科学发现的新闻标题

Journal of data and information science (Warsaw, Poland) Pub Date : 2023-06-01 DOI: 10.2478/jdis-2023-0014

R. An, Quinlan Batcheller, Junjie Wang, Yuyi Yang

{"title":"Build neural network models to identify and correct news headlines exaggerating obesity-related scientific findings","authors":"R. An, Quinlan Batcheller, Junjie Wang, Yuyi Yang","doi":"10.2478/jdis-2023-0014","DOIUrl":"https://doi.org/10.2478/jdis-2023-0014","url":null,"abstract":"Abstract Purpose Media exaggerations of health research may confuse readers’ understanding, erode public trust in science and medicine, and cause disease mismanagement. This study built artificial intelligence (AI) models to automatically identify and correct news headlines exaggerating obesity-related research findings. Design/methodology/approach We searched popular digital media outlets to collect 523 headlines exaggerating obesity-related research findings. The reasons for exaggerations include: inferring causality from observational studies, inferring human outcomes from animal research, inferring distant/end outcomes (e.g., obesity) from immediate/intermediate outcomes (e.g., calorie intake), and generalizing findings to the population from a subgroup or convenience sample. Each headline was paired with the title and abstract of the peer-reviewed journal publication covered by the news article. We drafted an exaggeration-free counterpart for each original headline and fined-tuned a BERT model to differentiate between them. We further fine-tuned three generative language models—BART, PEGASUS, and T5 to autogenerate exaggeration-free headlines based on a journal publication’s title and abstract. Model performance was evaluated using the ROUGE metrics by comparing model-generated headlines with journal publication titles. Findings The fine-tuned BERT model achieved 92.5% accuracy in differentiating between exaggeration-free and original headlines. Baseline ROUGE scores averaged 0.311 for ROUGE-1, 0.113 for ROUGE-2, 0.253 for ROUGE-L, and 0.253 ROUGE-Lsum. PEGASUS, T5, and BART all outperformed the baseline. The best-performing BART model attained 0.447 for ROUGE-1, 0.221 for ROUGE-2, 0.402 for ROUGE-L, and 0.402 for ROUGE-Lsum. Originality/value This study demonstrated the feasibility of leveraging AI to automatically identify and correct news headlines exaggerating obesity-related research findings.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"8 1","pages":"88 - 97"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45890018","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An author credit allocation method with improved distinguishability and robustness 一种提高可分辨性和鲁棒性的作者信用分配方法

Journal of data and information science (Warsaw, Poland) Pub Date : 2023-06-01 DOI: 10.2478/jdis-2023-0016

Yang Li, Tao Jia

{"title":"An author credit allocation method with improved distinguishability and robustness","authors":"Yang Li, Tao Jia","doi":"10.2478/jdis-2023-0016","DOIUrl":"https://doi.org/10.2478/jdis-2023-0016","url":null,"abstract":"Abstract Purpose The purpose of this study is to propose an improved credit allocation method that makes the leading author of the paper more distinguishable and makes the deification more robust under malicious manipulations. Design/methodology/approach We utilize a modified Sigmoid function to handle the fat-tail distributed citation counts. We also remove the target paper in calculating the contribution of co-citations. Following previous studies, we use 30 Nobel Prize-winning papers and their citation networks based on the American Physical Society (APS) and the Microsoft Academic Graph (MAG) dataset to test the accuracy of our proposed method (NCCAS). In addition, we use 654,148 articles published in the field of computer science from 2000 to 2009 in the MAG dataset to validate the distinguishability and robustness of NCCAS. Finding Compared with the state-of-the-art methods, NCCAS gives the most accurate prediction of Nobel laureates. Furthermore, the leading author of the paper identified by NCCAS is more distinguishable compared with other co-authors. The results by NCCAS are also more robust to malicious manipulation. Finally, we perform ablation studies to show the contribution of different components in our methods. Research limitations Due to limited ground truth on the true leading author of a work, the accuracy of NCCAS and other related methods can only be tested in Nobel Physics Prize-winning papers. Practical implications NCCAS is successfully applied to a large number of publications, demonstrating its potential in analyzing the relationship between the contribution and the recognition of authors with different by-line orders. Originality/value Compared with existing methods, NCCAS not only identifies the leading author of a paper more accurately, but also makes the deification more distinguishable and more robust, providing a new tool for related studies.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"8 1","pages":"15 - 46"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47722992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Differences between journal and conference in computer science: a bibliometric view based on Bayesian network 计算机科学期刊与会议的差异——基于贝叶斯网络的文献计量学观点

Journal of data and information science (Warsaw, Poland) Pub Date : 2023-06-01 DOI: 10.2478/jdis-2023-0017

Mingyue Sun, Mingliang Yue, Tingcan Ma

{"title":"Differences between journal and conference in computer science: a bibliometric view based on Bayesian network","authors":"Mingyue Sun, Mingliang Yue, Tingcan Ma","doi":"10.2478/jdis-2023-0017","DOIUrl":"https://doi.org/10.2478/jdis-2023-0017","url":null,"abstract":"Abstract Purpose This paper aims to investigate the differences between conference papers and journal papers in the field of computer science based on Bayesian network. Design/methodology/approach This paper investigated the differences between conference papers and journal papers in the field of computer science based on Bayesian network, a knowledge-representative framework that can model relationships among all variables in the network. We defined the variables required for Bayesian networks modeling, calculated the values of each variable based Aminer dataset (a literature data set in the field of computer science), learned the Bayesian network and derived some findings based on network inference. Findings The study found that conferences are more attractive to senior scholars, the academic impact of conference papers is slightly higher than journal papers, and it is uncertain whether conference papers are more innovative than journal papers. Research limitations The study was limited to the field of computer science and employed Aminer dataset as the sample. Further studies involving more diverse datasets and different fields could provide a more complete picture of the matter. Practical implications By demonstrating that Bayesian networks can effectively analyze issues in Scientometrics, the study offers valuable insights that may enhance researchers’ understanding of the differences between journal and conference in computer science. Originality/value Academic conferences play a crucial role in facilitating scholarly exchange and knowledge dissemination within the field of computer science. Several studies have been conducted to examine the distinctions between conference papers and journal papers in terms of various factors, such as authors, citations, h-index and others. Those studies were carried out from different (independent) perspectives, lacking a systematic examination of the connections and interactions between multiple perspectives. This paper supplements this deficiency based on Bayesian network modeling.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"8 1","pages":"47 - 60"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47950652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multimodal sentiment analysis for social media contents during public emergencies 突发公共事件中社交媒体内容的多模态情感分析

Journal of data and information science (Warsaw, Poland) Pub Date : 2023-06-01 DOI: 10.2478/jdis-2023-0012

Tao Fan, Hao Wang, Peng Wu, Chen Ling, Milad Taleby Ahvanooey

{"title":"Multimodal sentiment analysis for social media contents during public emergencies","authors":"Tao Fan, Hao Wang, Peng Wu, Chen Ling, Milad Taleby Ahvanooey","doi":"10.2478/jdis-2023-0012","DOIUrl":"https://doi.org/10.2478/jdis-2023-0012","url":null,"abstract":"Abstract Purpose Nowadays, public opinions during public emergencies involve not only textual contents but also contain images. However, the existing works mainly focus on textual contents and they do not provide a satisfactory accuracy of sentiment analysis, lacking the combination of multimodal contents. In this paper, we propose to combine texts and images generated in the social media to perform sentiment analysis. Design/methodology/approach We propose a Deep Multimodal Fusion Model (DMFM), which combines textual and visual sentiment analysis. We first train word2vec model on a large-scale public emergency corpus to obtain semantic-rich word vectors as the input of textual sentiment analysis. BiLSTM is employed to generate encoded textual embeddings. To fully excavate visual information from images, a modified pretrained VGG16-based sentiment analysis network is used with the best-performed fine-tuning strategy. A multimodal fusion method is implemented to fuse textual and visual embeddings completely, producing predicted labels. Findings We performed extensive experiments on Weibo and Twitter public emergency datasets, to evaluate the performance of our proposed model. Experimental results demonstrate that the DMFM provides higher accuracy compared with baseline models. The introduction of images can boost the performance of sentiment analysis during public emergencies. Research limitations In the future, we will test our model in a wider dataset. We will also consider a better way to learn the multimodal fusion information. Practical implications We build an efficient multimodal sentiment analysis model for the social media contents during public emergencies. Originality/value We consider the images posted by online users during public emergencies on social platforms. The proposed method can present a novel scope for sentiment analysis during public emergencies and provide the decision support for the government when formulating policies in public emergencies.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"8 1","pages":"61 - 87"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41694711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Perspectives from a publishing ethics and research integrity team for required improvements 出版伦理和研究诚信团队对所需改进的看法

Journal of data and information science (Warsaw, Poland) Pub Date : 2023-06-01 DOI: 10.2478/jdis-2023-0018

Sabina Alam, L. Wilson

引用次数: 0

Assessment of retracted papers, and their retraction notices, from a cancer journal associated with “paper mills” 一份与“造纸厂”相关的癌症杂志撤回论文及其撤回通知的评估

Journal of data and information science (Warsaw, Poland) Pub Date : 2023-04-01 DOI: 10.2478/jdis-2023-0009

J. A. T. da Silva, Serhii Nazarovets

引用次数: 1

International visibility of Armenian domestic journals: the role of scientific diaspora 亚美尼亚国内期刊的国际知名度：散居国外的科学工作者的作用

Journal of data and information science (Warsaw, Poland) Pub Date : 2023-04-01 DOI: 10.2478/jdis-2023-0011

E. Gzoyan, A. Mirzoyan, Anush Sargsyan, Mariam Yeghikyan, D. Maisano, Shushanik A. Sargsyan

{"title":"International visibility of Armenian domestic journals: the role of scientific diaspora","authors":"E. Gzoyan, A. Mirzoyan, Anush Sargsyan, Mariam Yeghikyan, D. Maisano, Shushanik A. Sargsyan","doi":"10.2478/jdis-2023-0011","DOIUrl":"https://doi.org/10.2478/jdis-2023-0011","url":null,"abstract":"Abstract Purpose Nearly 122 scientific journals are currently being published in Armenia—of which only six are indexed by WoS and/or Scopus databases. The majority of the national journals are published in the Armenian language, solely possessing abstracts written in English, although there are also English-language and multi-language journals with articles not only in Armenian but also in other foreign languages. The aim of this article is to study the visibility of the (non-indexed) national Armenian journals in the WoS database through citation analysis. In consideration of the existence of a relevant Armenian “diaspora” in the world, this article also attempts to estimate its impact in terms of citation statistics. Design/methodology/approach For this end, we have identified citations to the national/domestic Armenian journals in the WoS database in comparison with the share of citations received from “diaspora” researchers (researchers of Armenian origin born in foreign countries and those originally from Armenia who have emigrated to foreign countries). Findings Among the 116 Armenian domestic journals analyzed (not indexed by WoS), only 47 were found to be cited in WoS. Of these journals, almost 12% are citations by “diaspora” researchers, most of which concern Social Science and Humanities journals. Research limitations Although the surnames of Armenians end with -i(y)an, sometimes, the Diaspora Armenians, surnames are changed or modified or they are not ending with -i(y)an, in this case we may fail to identify them. Practical implications This study can help to build new, more deep and comprehensive relations with scientific diasporas. Originality/value This study offers a new understanding of multifaced research collaboration with scientific diasporas and their role in internationalization of domestic journals.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"8 1","pages":"93 - 117"},"PeriodicalIF":0.0,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44902267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Global trends in international research collaboration, 1980-2021 国际研究合作的全球趋势，1980-2021

Journal of data and information science (Warsaw, Poland) Pub Date : 2023-04-01 DOI: 10.2478/jdis-2023-0015

D. Aksnes, G. Sivertsen

{"title":"Global trends in international research collaboration, 1980-2021","authors":"D. Aksnes, G. Sivertsen","doi":"10.2478/jdis-2023-0015","DOIUrl":"https://doi.org/10.2478/jdis-2023-0015","url":null,"abstract":"Abstract Purpose The aim of this study is to analyze the evolution of international research collaboration from 1980 to 2021. The study examines the main global patterns as well as those specific to individual countries, country groups, and different areas of research. Design/methodology/approach The study is based on the Web of Science Core collection database. More than 50 million publications are analyzed using co-authorship data. International collaboration is defined as publications having authors affiliated with institutions located in more than one country. Findings At the global level, the share of publications representing international collaboration has gradually increased from 4.7% in 1980 to 25.7% in 2021. The proportion of such publications within each country is higher and, in 2021, varied from less than 30% to more than 90%. There are notable disparities in the temporal trends, indicating that the process of internationalization has impacted countries in different ways. Several factors such as country size, income level, and geopolitics may explain the variance. Research limitations Not all international research collaboration results in joint co-authored scientific publications. International co-authorship is a partial indicator of such collaboration. Another limitation is that the applied full counting method does not take into account the number of authors representing in each country in the publication. Practical implications The study provides global averages, indicators, and concepts that can provide a useful framework of reference for further comparative studies of international research collaboration. Originality/value Long-term macro-level studies of international collaboration are rare, and as a novelty, this study includes an analysis by the World Bank’s division of countries into four income groups.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"8 1","pages":"26 - 42"},"PeriodicalIF":0.0,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43577441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Can first or last name uniqueness help to identify diaspora researchers from any country? 名字或姓氏的独特性是否有助于识别来自任何国家的散居研究人员?

Journal of data and information science (Warsaw, Poland) Pub Date : 2023-04-01 DOI: 10.2478/jdis-2023-0013

M. Thelwall

{"title":"Can first or last name uniqueness help to identify diaspora researchers from any country?","authors":"M. Thelwall","doi":"10.2478/jdis-2023-0013","DOIUrl":"https://doi.org/10.2478/jdis-2023-0013","url":null,"abstract":"Abstract Purpose Diaspora researchers work in one country but have ancestral origins in another, either through moves during a research career (mobile diaspora researchers) or by starting research in the target country (embedded diaspora researchers). Whilst mobile researchers might be tracked through affiliation changes in bibliometric databases, embedded researchers cannot. This article reports an evidence-based discussion of which countries’ diaspora researchers can be partially tracked using first or last names, addressing this limitation. Design/methodology/approach A frequency analysis of first and last names of authors of all Scopus journal articles 2001-2021 for 200 countries or regions. Findings There are great variations in the extent to which first or last names are uniquely national, from Monserrat (no unique first names) to Thailand (81% unique last names). Nevertheless, most countries have a subset of first or last names that are relatively unique. For the 50 countries with the most researchers, authors with relatively national names are always more likely to research their name-associated country, suggesting a continued national association. Lists of researchers’ first and last name frequencies and proportions are provided for 200 countries/regions. Research limitations Only one period is tracked (2001-2021) and no attempt was made to validate the ancestral origins of any researcher. Practical implications Simple name heuristics can be used to identify the international spread of a sample of most countries’ diaspora researchers, but some manual checks of individual names are needed to weed out false matches. This can supplement mobile researcher data from bibliometric databases. Originality/value This is the first attempt to list name associations for the authors of all countries and large regions, and to identify the countries for which diaspora researchers could be tracked by name.","PeriodicalId":92237,"journal":{"name":"Journal of data and information science (Warsaw, Poland)","volume":"8 1","pages":"1 - 25"},"PeriodicalIF":0.0,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41900076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0