{"title":"The Triple Helix of innovation as a double game involving domestic and foreign actors","authors":"Eustache Mêgnigbêto","doi":"10.2478/jdis-2024-0004","DOIUrl":"https://doi.org/10.2478/jdis-2024-0004","url":null,"abstract":"Purpose The collaboration relationships between innovation actors at a geographic level may be considered as grouping two separate layers, the domestic and the foreign. At the level of each layer, the relationships and the actors involved constitute a Triple Helix game. The paper distinguished three levels of analysis: the global grouping together all actors, the domestic grouping together domestic actors, and the foreign related to only actors from partner countries. Design/methodology/approach Bibliographic records data from the Web of Science for South Korea and West Africa breakdown per innovation actors and distinguishing domestic and international collaboration are analyzed with game theory. The core, the Shapley value, and the nucleolus are computed at the three levels to measure the synergy between actors. Findings The synergy operates more in South Korea than in West Africa; the government is more present in West Africa than in South Korea; domestic actors create more synergy in South Korea, but foreign more in West Africa; South Korea can consume all the foreign synergy, which is not the case of West Africa. Research limitations Research data are limited to publication records; techniques and methods used may be extended to other research outputs. Practical implications West African governments should increase their investment in science, technology, and innovation to benefit more from the synergy their innovation actors contributed at the foreign level. However, the results of the current study may not be sufficient to prove that greater investment will yield benefits from foreign synergies. Originality/value This paper uses game theory to assess innovation systems by computing the contribution of foreign actors to knowledge production at an area level. It proposes an indicator to this end.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"199 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139587951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A comparison of model choice strategies for logistic regression","authors":"Markku Karhunen","doi":"10.2478/jdis-2024-0001","DOIUrl":"https://doi.org/10.2478/jdis-2024-0001","url":null,"abstract":"Purpose The purpose of this study is to develop and compare model choice strategies in context of logistic regression. Model choice means the choice of the covariates to be included in the model. Design/methodology/approach The study is based on Monte Carlo simulations. The methods are compared in terms of three measures of accuracy: specificity and two kinds of sensitivity. A loss function combining sensitivity and specificity is introduced and used for a final comparison. Findings The choice of method depends on how much the users emphasize sensitivity against specificity. It also depends on the sample size. For a typical logistic regression setting with a moderate sample size and a small to moderate effect size, either BIC, BICc or Lasso seems to be optimal. Research limitations Numerical simulations cannot cover the whole range of data-generating processes occurring with real-world data. Thus, more simulations are needed. Practical implications Researchers can refer to these results if they believe that their data-generating process is somewhat similar to some of the scenarios presented in this paper. Alternatively, they could run their own simulations and calculate the loss function. Originality/value This is a systematic comparison of model choice algorithms and heuristics in context of logistic regression. The distinction between two types of sensitivity and a comparison based on a loss function are methodological novelties.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"24 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139587817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Mapping the geography of editors-in-chief","authors":"György Csomós","doi":"10.2478/jdis-2024-0002","DOIUrl":"https://doi.org/10.2478/jdis-2024-0002","url":null,"abstract":"Purpose This study aims to explore the geography of editors-in-chief to demonstrate which countries exercise the highest-level decision-making in scholarly communication. In addition, the study seeks to investigate the potential relationships between the origin and nationality of academic publishers and the geography of editors-in-chief. Design/methodology/approach The analysis involves 11,915 journals listed in Web of Science’s Social Sciences Citation Index (SSCI) and Science Citation Index Expanded (SCIE). These journals employ 15,795 scholars as editors-in-chief. The geographical locations of the institutions the editors-in-chief are affiliated with were identified; then, the data were aggregated at the country level. Findings The results show that most editors-in-chief are located in countries of the Anglosphere, primarily the United States and the United Kingdom. In addition, most academic publishers and professional organizations that publish academic journals were found to be based in the United States and the United Kingdom, where most editors-in-chief are also based. Research limitations The analysis involves journals indexed in the Web of Science’s SCIE/SSCI databases, which are demonstrably biased toward the English language. Furthermore, the study only takes a snapshot of the geography of editors-in-chief for the year 2022, but it does not investigate trends. Research implications The study maps the highest-level decision-making in scholarly communication. Originality/value The study explores and maps the geography of editors-in-chief by using a massive dataset.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"11 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2023-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139066330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Dimensionality reduction model based on integer planning for the analysis of key indicators affecting life expectancy","authors":"Wei Cui, Zhiqiang Xu, Ren Mu","doi":"10.2478/jdis-2023-0025","DOIUrl":"https://doi.org/10.2478/jdis-2023-0025","url":null,"abstract":"Purpose Exploring a dimensionality reduction model that can adeptly eliminate outliers and select the appropriate number of clusters is of profound theoretical and practical importance. Additionally, the interpretability of these models presents a persistent challenge. Design/methodology/approach This paper proposes two innovative dimensionality reduction models based on integer programming (DRMBIP). These models assess compactness through the correlation of each indicator with its class center, while separation is evaluated by the correlation between different class centers. In contrast to DRMBIP-p, the DRMBIP-v considers the threshold parameter as a variable aiming to optimally balances both compactness and separation. Findings This study, getting data from the Global Health Observatory (GHO), investigates 141 indicators that influence life expectancy. The findings reveal that DRMBIP-p effectively reduces the dimensionality of data, ensuring compactness. It also maintains compatibility with other models. Additionally, DRMBIP-v finds the optimal result, showing exceptional separation. Visualization of the results reveals that all classes have a high compactness. Research limitations The DRMBIP-p requires the input of the correlation threshold parameter, which plays a pivotal role in the effectiveness of the final dimensionality reduction results. In the DRMBIP-v, modifying the threshold parameter to variable potentially emphasizes either separation or compactness. This necessitates an artificial adjustment to the overflow component within the objective function. Practical implications The DRMBIP presented in this paper is adept at uncovering the primary geometric structures within high-dimensional indicators. Validated by life expectancy data, this paper demonstrates potential to assist data miners with the reduction of data dimensions. Originality/value To our knowledge, this is the first time that integer programming has been used to build a dimensionality reduction model with indicator filtering. It not only has applications in life expectancy, but also has obvious advantages in data mining work that requires precise class centers.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"59 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138507921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Text duplication of papers in four medical related fields","authors":"Ping Ni, Lianhui Shan, Yong Li, Xinying An","doi":"10.2478/jdis-2023-0024","DOIUrl":"https://doi.org/10.2478/jdis-2023-0024","url":null,"abstract":"Abstract Purpose To reveal the typical features of text duplication in papers from four medical fields: basic medicine, health management, pharmacology and pharmacy, and public health and preventive medicine. To analyze the reasons for duplication and provide suggestions for the management of medical academic misconduct. Design/methodology/approach In total, 2,469 representative Chinese journal papers were included in our research, which were submitted by researchers in 2020 and 2021. A plagiarism check was carried out using the Academic Misconduct Literature Check System (AMLC). We generated a corrected similarity index based on the AMLC general similarity index for further analysis. We compared the similarity indices of papers in four medical fields and revealed their trends over time; differences in similarity index between review and research articles were also analyzed according to the different fields. Further analysis of 143 papers suspected of plagiarism was also performed from the perspective of sections containing duplication and according to the field of research. Findings Papers in the field of pharmacology and pharmacy had the highest similarity index (8.67 ± 5.92%), which was significantly higher than that in other fields, except health management. The similarity index of review articles (9.77 ± 10.28%) was significantly higher than that of research articles (7.41 ± 6.26%). In total, 143 papers were suspected of plagiarism (5.80%) with similarity indices ≥ 15%; most were papers on health management (78, 54.55%), followed by public health and preventive medicine (38, 26.58%); 90.21% of the 143 papers had duplication in multiple sections, while only 9.79% had duplication in a single section. The distribution of sections with duplication varied among different fields; papers in pharmacology and pharmacy were more likely to have duplication in the data/methods and introduction/background sections, however, papers in health management were more likely to contain duplication in the introduction/background or results/discussion sections. Different structures for papers in different fields may have caused these differences. Research limitations There were three limitations to our research. Firstly, we observed that a small number of papers have been checked early. It is unknown who conducted the plagiarism check as this can be included in other evaluations, such as applications for Science and technology projects or awards. If the authors carried out the check, text with high similarity indices may have been excluded before submission, meaning the similarity index in our research may have been lower than the original value. Secondly, there were only four medical fields included in our research. Additional analysis on a wider scale is required in the future. Thirdly, only a general similarity index was calculated in our study; other similarity indices were not tested. Practical implications A comprehensive analysis of similarity indices in f","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135855416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research misconduct in hospitals is spreading: A bibliometric analysis of retracted papers from Chinese university-affiliated hospitals","authors":"Zi-han Yuan, Yi Liu","doi":"10.2478/jdis-2023-0022","DOIUrl":"https://doi.org/10.2478/jdis-2023-0022","url":null,"abstract":"Abstract Purpose The number of retracted papers from Chinese university-affiliated hospitals is increasing, which has raised much concern. The aim of this study is to analyze the retracted papers from university-affiliated hospitals in mainland China from 2000 to 2021. Design/methodology/approach Data for 1,031 retracted papers were identified from the Web of Science Core collection database. The information of the hospitals involved was obtained from their official websites. We analyzed the chronological changes, journal distribution, discipline distribution and retraction reasons for the retracted papers. The grade and geographic locations of the hospitals involved were explored as well. Findings We found a rapid increase in the number of retracted papers, while the retraction time interval is decreasing. The main reasons for retraction are plagiarism/self-plagiarism (n=255), invalid data/images/conclusions (n=212), fake peer review (n=175) and honesty error(n=163). The disciplines are mainly distributed in oncology (n=320), pharmacology & pharmacy (n=198) and research & experimental medicine (n=166). About 43.8% of the retracted papers were from hospitals affiliated with prestigious universities. Research limitations This study fails to differentiate between retractions due to honest error and retractions due to research misconduct. We believe that there is a fundamental difference between honest error retractions and misconduct retractions. Another limitation is that authors of the retracted papers have not been analyzed in this study. Practical implications This study provides a reference for addressing research misconduct in Chinese university-affiliated hospitals. It is our recommendation that universities and hospitals should educate all their staff about the basic norms of research integrity, punish authors of scientific misconduct retracted papers, and reform the unreasonable evaluation system. Originality/value Based on the analysis of retracted papers, this study further analyzes the characteristics of institutions of retracted papers, which may deepen the research on retracted papers and provide a new perspective to understand the retraction phenomenon.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136059273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Alvaro Cabezas-Clavijo, Yusnelkis Milanés-Guisado, Ruben Alba-Ruiz, Ángel M. Delgado-Vázquez
{"title":"The need to develop tailored tools for improving the quality of thematic bibliometric analyses: Evidence from papers published in Sustainability and Scientometrics","authors":"Alvaro Cabezas-Clavijo, Yusnelkis Milanés-Guisado, Ruben Alba-Ruiz, Ángel M. Delgado-Vázquez","doi":"10.2478/jdis-2023-0021","DOIUrl":"https://doi.org/10.2478/jdis-2023-0021","url":null,"abstract":"Abstract Purpose The aim of this article is to explore up to seven parameters related to the methodological quality and reproducibility of thematic bibliometric research published in the two most productive journals in bibliometrics, Sustainability (a journal outside the discipline) and Scientometrics, the flagship journal in the field. Design/methodology/approach The study identifies the need for developing tailored tools for improving the quality of thematic bibliometric analyses, and presents a framework that can guide the development of such tools. A total of 508 papers are analysed, 77% of Sustainability, and 23% published in Scientometrics, for the 2019-2021 period. Findings An average of 2.6 shortcomings per paper was found for the whole sample, with an almost identical number of flaws in both journals. Sustainability has more flaws than Scientometrics in four of the seven parameters studied, while Scientometrics has more shortcomings in the remaining three variables. Research limitations The first limitation of this work is that it is a study of two scientific journals, so the results cannot be directly extrapolated to the set of thematic bibliometric analyses published in journals from all fields. Practical implications We propose the adoption of protocols, guidelines, and other similar tools, adapted to bibliometric practice, which could increase the thoroughness, transparency, and reproducibility of this type of research. Originality/value These results show considerable room for improvement in terms of the adequate use and breakdown of methodological procedures in thematic bibliometric research, both in journals in the Information Science area and journals outside the discipline.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136059277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The notion of dominant terminology in bibliometric research","authors":"Yves Fassin, Ronald Rousseau","doi":"10.2478/jdis-2023-0020","DOIUrl":"https://doi.org/10.2478/jdis-2023-0020","url":null,"abstract":"Abstract In this opinion paper, we introduce the expressions of dominant terminology and dominant term in the quantitative studies of science in analogy to the notion of dominant design in product development and innovation.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135734616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Peculiarities of gender disambiguation and ordering of non-English authors’ names for Economic papers beyond core databases<sup>①</sup>","authors":"Olesya Mryglod, Serhii Nazarovets, Serhiy Kozmenko","doi":"10.2478/jdis-2023-0001","DOIUrl":"https://doi.org/10.2478/jdis-2023-0001","url":null,"abstract":"Abstract Purpose To supplement the quantitative portrait of Ukrainian Economics discipline with the results of gender and author ordering analysis at the level of individual authors, special methods of working with bibliographic data with a predominant share of non-English authors are used. The properties of gender mixing, the likelihood of male and female authors occupying the first position in the authorship list, as well as the arrangements of names are studied. Design/methodology/approach A data set containing bibliographic records related to Ukrainian journal publications in the field of Economics is constructed using Crossref metadata. Partial semi-automatic disambiguation of authors’ names is performed. First names, along with gender-specific ethnic surnames, are used for gender disambiguation required for further comparative gender analysis. Random reshuffling of data is used to determine the impact of gender correlations. To assess the level of alphabetization for our data set, both Latin and Cyrillic versions of names are taken into account. Findings The lack of well-structured metadata and the poor use of digital identifiers lead to numerous problems with automatization of bibliographic data pre-processing, especially in the case of publications by non-Western authors. The described stages for working with such specific data help to work at the level of authors and analyse, in particular, gender issues. Despite the larger number of female authors, gender equality is more likely to be reported at the individual level for the discipline of Ukrainian Economics. The tendencies towards collaborative or solo-publications and gender mixing patterns are found to be dependent on the journal: the differences for publications indexed in Scopus and/or Web of Science databases are found. It has also been found that Ukrainian Economics research is characterized by rather a non-alphabetical order of authors. Research limitations Only partial authors’ name disambiguation is performed in a semi-automatic way. Gender labels can be derived only for authors declared by full First names or gender-specific Last names. Practical implications The typical features of Ukrainian Economic discipline can be used to perform a comparison with other countries and disciplines, to develop an informed-based assessment procedure at the national level. The proposed way of processing publication data can be borrowed to enrich metadata about other research disciplines, especially for non-English speaking countries. Originality/value To our knowledge, this is the first large-scale quantitative study of Ukrainian Economic discipline. The results obtained are valuable not only at the national level, but also contribute to general knowledge about Economic research, gender issues, and authors’ names ordering. An example of the use of Crossref data is provided, while this data source is still less used due to a number of drawbacks. Here, for the first time, attention is drawn to ","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135704714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Giuseppe Catalano,Cinzia Daraio,Jacqueline Leta,Henk F. Moed,Giancarlo Ruocco,Xiaolin Zhang
{"title":"Novel Approaches to the Development and Application of Informetric and Scientometric Tools","authors":"Giuseppe Catalano,Cinzia Daraio,Jacqueline Leta,Henk F. Moed,Giancarlo Ruocco,Xiaolin Zhang","doi":"10.2478/jdis-2020-0041","DOIUrl":"https://doi.org/10.2478/jdis-2020-0041","url":null,"abstract":"This is the second part of the Journal of Data and Information Science (JDIS) Special Issue on ISSI 2019, the 17th International Conference on Scientometrics and Informetrics (ISSI2019) held in Rome, on 2–5 September 2019 and includes additional 10 selected posters presented during the conference largely expanded by the authors afterwards. The papers included in this volume have been grouped in three broad themes:","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"37 1","pages":"1-4"},"PeriodicalIF":1.5,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138557006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}