Vinod Kumar Chauhan , Anna Ledwoch , Alexandra Brintrup , Manuel Herrera , Vaggelis Giannikas , Goran Stojkovic , Duncan Mcfarlane
{"title":"Network science approach for identifying disruptive elements of an airline","authors":"Vinod Kumar Chauhan , Anna Ledwoch , Alexandra Brintrup , Manuel Herrera , Vaggelis Giannikas , Goran Stojkovic , Duncan Mcfarlane","doi":"10.1016/j.dsm.2023.04.001","DOIUrl":"https://doi.org/10.1016/j.dsm.2023.04.001","url":null,"abstract":"<div><p>Currently, flight delays are common and they propagate from an originating flight to connecting flights, leading to large disruptions in the overall schedule. These disruptions cause massive economic losses, affect airlines’ reputations, waste passengers’ time and money, and directly impact the environment. This study adopts a network science approach for solving the delay propagation problem by modeling and analyzing the flight schedules and historical operational data of an airline. We aim to determine the most disruptive airports, flights, flight-connections, and connection types in an airline network. Disruptive elements are influential or critical entities in an airline network. They are the elements that can either cause (airline schedules) or have caused (historical data) the largest disturbances in the network. An airline can improve its operations by avoiding delays caused by the most disruptive elements. The proposed network science approach for disruptive element analysis was validated using a case study of an operating airline. The analysis indicates that potential disruptive elements in a schedule of an airline are also actual disruptive elements in the historical data and they should be considered to improve operations. The airline network exhibits small-world effects and delays can propagate to any part of the network with a minimum of four delayed flights. Finally, we observed that passenger connections between flights are the most disruptive connection type. Therefore, the proposed methodology provides a tool for airlines to build robust flight schedules that reduce delays and propagation.</p></div>","PeriodicalId":100353,"journal":{"name":"Data Science and Management","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49749482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Forecasting hourly PM2.5 concentrations based on decomposition-ensemble-reconstruction framework incorporating deep learning algorithms","authors":"Peilei Cai, Chengyuan Zhang, Jian Chai","doi":"10.1016/j.dsm.2023.02.002","DOIUrl":"https://doi.org/10.1016/j.dsm.2023.02.002","url":null,"abstract":"<div><p>Accurate predictions of hourly PM<sub>2.5</sub> concentrations are crucial for preventing the harmful effects of air pollution. In this study, a new decomposition-ensemble framework incorporating the variational mode decomposition method (VMD), econometric forecasting method (autoregressive integrated moving average model, ARIMA), and deep learning techniques (convolutional neural networks (CNN) and temporal convolutional network (TCN)) was developed to model the data characteristics of hourly PM<sub>2.5</sub> concentrations. Taking the PM<sub>2.5</sub> concentration of Lanzhou, Gansu Province, China as the sample, the empirical results demonstrated that the developed decomposition-ensemble framework is significantly superior to the benchmarks with the econometric model, machine learning models, basic deep learning models, and traditional decomposition-ensemble models, within one-, two-, or three-step-ahead. This study verified the effectiveness of the new prediction framework to capture the data patterns of PM<sub>2.5</sub> concentration and can be employed as a meaningful PM<sub>2.5</sub> concentrations prediction tool.</p></div>","PeriodicalId":100353,"journal":{"name":"Data Science and Management","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49749479","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Qing Zhu , Yinglin Ruan , Shan Liu , Sung-Byung Yang , Lin Wang , Jianhua Che
{"title":"Cross-border electronic commerce’s new path: from literature review to AI text generation","authors":"Qing Zhu , Yinglin Ruan , Shan Liu , Sung-Byung Yang , Lin Wang , Jianhua Che","doi":"10.1016/j.dsm.2022.12.001","DOIUrl":"https://doi.org/10.1016/j.dsm.2022.12.001","url":null,"abstract":"<div><p>Digitization, informatization, and Internet penetration have led to a significant rise in cross-border e-commerce (CBEC), attracting considerable interest from academia, government, and industry. This study employed a novel method combining automatic text generation technology and traditional bibliometric analysis to summarize and categorize the research on CBEC evolution from 2000 to 2021. Articles were selected and examined with a focus on four dimensions: customer, risk, supply chain, and platform. Contradictions in these dimensions were found to result in two major obstacles to CBEC development, namely, dataset sharing and platform scalability. These obstacles prevent research on cross-border platforms from moving beyond theory-based studies. Further research needs to examine how soft computing can be used to accelerate and remodel the global trade ecosystem.</p></div>","PeriodicalId":100353,"journal":{"name":"Data Science and Management","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49749901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Boluwaji A. Akinnuwesi , Kehinde A. Olayanju , Benjamin S. Aribisala , Stephen G. Fashoto , Elliot Mbunge , Moses Okpeku , Patrick Owate
{"title":"Application of support vector machine algorithm for early differential diagnosis of prostate cancer","authors":"Boluwaji A. Akinnuwesi , Kehinde A. Olayanju , Benjamin S. Aribisala , Stephen G. Fashoto , Elliot Mbunge , Moses Okpeku , Patrick Owate","doi":"10.1016/j.dsm.2022.10.001","DOIUrl":"https://doi.org/10.1016/j.dsm.2022.10.001","url":null,"abstract":"<div><p>Prostate cancer (PCa) symptoms are commonly confused with benign prostate hyperplasia (BPH), particularly in the early stages due to similarities between symptoms, and in some instances, underdiagnoses. Clinical methods have been utilized to diagnose PCa; however, at the full-blown stage, clinical methods usually present high risks of complicated side effects. Therefore, we proposed the use of support vector machine for early differential diagnosis of PCa (SVM-PCa-EDD). SVM was used to classify persons with and without PCa. We used the PCa dataset from the Kaggle Healthcare repository to develop and validate SVM model for classification. The PCa dataset consisted of 250 features and one class of features. Attributes considered in this study were age, body mass index (BMI), race, family history, obesity, trouble urinating, urine stream force, blood in semen, bone pain, and erectile dysfunction. The SVM-PCa-EDD was used for preprocessing the PCa dataset, specifically dealing with class imbalance, and for dimensionality reduction. After eliminating class imbalance, the area under the receiver operating characteristic (ROC) curve (AUC) of the logistic regression (LR) model trained with the downsampled dataset was 58.4%, whereas that of the AUC-ROC of LR trained with the class imbalance dataset was 54.3%. The SVM-PCa-EDD achieved 90% accuracy, 80% sensitivity, and 80% specificity. The validation of SVM-PCa-EDD using random forest and LR showed that SVM-PCa-EDD performed better in early differential diagnosis of PCa. The proposed model can assist medical experts in early diagnosis of PCa, particularly in resource-constrained healthcare settings and making further recommendations for PCa testing and treatment.</p></div>","PeriodicalId":100353,"journal":{"name":"Data Science and Management","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49765234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The influence of e-commerce live streaming affordance on consumer’s gift-giving and purchase intention","authors":"Yunfan Lu, Yucheng He, Yifei Ke","doi":"10.1016/j.dsm.2022.10.002","DOIUrl":"https://doi.org/10.1016/j.dsm.2022.10.002","url":null,"abstract":"<div><p>In e-commerce live streaming, sellers choose the most suitable streamers to endorse their products. The streamer introduces the main functions of the goods, organizes marketing activities, improves the consumers’ shopping experience, and finally facilitates transactions and obtains gifts. However, the formation mechanism of guanxi between streamers and consumers remain unclear. Based on affordance theory, this study uses structural equations to empirically study the decision-making mechanism of consumer gift-giving and purchase behavior in e-commerce live streaming. The study finds that affective affordance and cognitive affordance have positive impacts on swift guanxi; swift guanxi is an antecedent of consumers’ purchase intention and gift-giving intention.</p></div>","PeriodicalId":100353,"journal":{"name":"Data Science and Management","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49765240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Comparison of administrative and regulatory green technologies development between China and the U.S. based on patent analysis","authors":"Mengshu Liu, Ju’e Guo, Dan Bi","doi":"10.1016/j.dsm.2023.01.001","DOIUrl":"https://doi.org/10.1016/j.dsm.2023.01.001","url":null,"abstract":"<div><p>With the increasing importance of computer intelligence in the new round of the industrial revolution, administrative, regulatory, or design (ARD) green technology contributes to improving national technological competitiveness and promoting the transformation of green technology, which is becoming an important field under sustainable development goals. The U.S. and China ranked top two in terms of paper influence and patent applications in the field of ARD green technology. However, few comparative studies have been conducted in these two countries. This study presents the evolution and landscapes of ARD green technology between China and the U.S., focusing on comparing development priorities and technical layouts in each five-year plan period. According to the “International Patent Classification (IPC) Green Inventory” launched by the World Intellectual Property Organization (WIPO), we retrieved 69,412 patents published between 2001 and 2020 from the PatSnap database. Descriptive, content, and thematic network analyses were conducted using latent dirichlet allocation (LDA) and community detection algorithms. The results show that both China and the U.S. strategically focus on ARD green technology development. The technical topics in this field can be divided into three themes: data processing systems, traffic control systems, and building designs. The emphasis on technology research and development (R&D) differs between China and the U.S. There is also evidence that the U.S. has advantages in terms of technological innovation and capabilities. However, China has an advantage in terms of data volume, and the gap between China and the U.S. is gradually narrowing. We also highlight the contributions and limitations of this study.</p></div>","PeriodicalId":100353,"journal":{"name":"Data Science and Management","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49758594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Two-phase tabu search algorithm for solving Chinese high school timetabling problems under the new college entrance examination reform","authors":"Zhe Sun, Qinghua Wu","doi":"10.1016/j.dsm.2023.02.001","DOIUrl":"https://doi.org/10.1016/j.dsm.2023.02.001","url":null,"abstract":"<div><p>Upon the latest reform to the college entrance examination in China (i.e., Gaokao), high schools began implementing an optional class system. Under this scheme, students’ time slots become complex, thereby increasing the difficulty in formulating a suitable timetable from the available ones. To address this problem, the course-scheduling model was improved. On the basis of the original hard constraints, the “concurrent group” was considered, and the softer constraints were regarded as optimization goals, such as “teaching plans synchronously”, “no idle periods in the timetables of teachers”, and “evenly distributed lessons”. Given these soft constraints, the model becomes more practical. In this study, a two-phase tabu search algorithm was proposed to solve the problem. The proposed algorithm uses the characteristics of the graph coloring model to eliminate redundant calculations in the neighborhood search process, thereby effectively improving computational efficiency. Fifteen practical instances of different scales were selected for testing to verify the effectiveness of the algorithm. The proposed algorithm can formulate high-quality available timetables (The average satisfaction rate of soft constraints is more than 71%) in a short period.</p></div>","PeriodicalId":100353,"journal":{"name":"Data Science and Management","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49749484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"VERS UN FILTRE ÉTHIQUE POUR LES IA CHARGÉES DE LA VENTE ?","authors":"Thomas Michaud","doi":"10.36863/mds.a.25456","DOIUrl":"https://doi.org/10.36863/mds.a.25456","url":null,"abstract":"L’éthique de la vente est de plus en plus nécessaire pour garantir la réputation des entreprises auprès des clients, mais aussi pour assurer la fidélité des vendeurs à leurs employeurs. Les progrès de l’intelligence artificielle dans le secteur du marketing promettent de révolutionner les pratiques dans un futur proche. Il convient donc de s’interroger sur l’éventualité d’intégrer un filtre éthique à ces technologies. À l’image des trois lois de la robotique d’Asimov, il conviendrait de réguler les comportements des IA, afin d’orienter leurs analyses et leur rationalité artificielle vers un respect accru de l’humanité des clients et des écosystèmes.","PeriodicalId":100353,"journal":{"name":"Data Science and Management","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135107362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"L’IMPACT DE LA TRANSFORMATION NUMÉRIQUE DANS LES ADMINISTRATIONS PUBLIQUES AFRICAINES","authors":"Jean Babei, MOYO NZOLOLO","doi":"10.36863/mds.a.25559","DOIUrl":"https://doi.org/10.36863/mds.a.25559","url":null,"abstract":"La massification des technologies de l’information ne laisse personne indifférent. Des simples logiciels métiers aux progiciels de gestion intégrée, en passant par les réseaux sociaux numériques et l’intelligence artificielle; ces technologies sont devenues incontournables. L’objectif de cet article est d’analyser les perceptions des agents publics face au déploiement du numérique. Une étude a été conduite au Cameroun, auprès d’un échantillon d’agents publics. Une analyse en composantes principales a été réalisée. Elle a été suivie par des analyses unies variées. Les principaux résultats indiquent dans des proportions importantes que les technologies de l’information sont sources de transparence et de traçabilité; et qu’elles diminuent les comportements opportunistes. En revanche, ils sont responsables de troubles de santé et constituent une menace à la souveraineté des données.","PeriodicalId":100353,"journal":{"name":"Data Science and Management","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135213599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"COMMENT RENDRE LE MANAGEMENT ALGORITHMIQUE CAPACITANT ?","authors":"SOPHIA GALIÈRE","doi":"10.36863/mds.a.25594","DOIUrl":"https://doi.org/10.36863/mds.a.25594","url":null,"abstract":"Les outils de management algorithmique sont de plus en plus utilisés pour coordonner le travail, mais l'approche actuelle se concentre sur les opportunités de contrôle accru via le monitoring en temps réel et l'envoi automatique de directives personnalisées. Cet article propose des axes de réflexion visant à dépasser l'association du management algorithmique avec la déshumanisation du travail, en enjoignant les managers à reconnaître le pouvoir d’agir comme condition essentielle à la performance organisationnelle. Nos recommandations pour rendre le management capacitant incluent la promotion d'une approche délibérative de co-construction des outils IA, la promotion de la littératie algorithmique, et la mise en place de mécanismes de compensation du contrôle par des dispositifs relationnels pour apporter une assistance aux travailleurs.","PeriodicalId":100353,"journal":{"name":"Data Science and Management","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135319566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}