Machine learning with applications最新文献

筛选
英文 中文
ChatReview: A ChatGPT-enabled natural language processing framework to study domain-specific user reviews ChatReview:支持 ChatGPT 的自然语言处理框架,用于研究特定领域的用户评论
Machine learning with applications Pub Date : 2023-12-28 DOI: 10.1016/j.mlwa.2023.100522
Brittany Ho, Ta’Rhonda Mayberry, Khanh Linh Nguyen, Manohar Dhulipala, Vivek Krishnamani Pallipuram
{"title":"ChatReview: A ChatGPT-enabled natural language processing framework to study domain-specific user reviews","authors":"Brittany Ho,&nbsp;Ta’Rhonda Mayberry,&nbsp;Khanh Linh Nguyen,&nbsp;Manohar Dhulipala,&nbsp;Vivek Krishnamani Pallipuram","doi":"10.1016/j.mlwa.2023.100522","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100522","url":null,"abstract":"<div><p>Intelligent search engines including pre-trained generative transformers (GPT) have revolutionized the user search experience. Several fields including e-commerce, education, and hospitality are increasingly exploring GPT tools to study user reviews and gain critical insights to improve their service quality. However, massive user-review data and imprecise prompt engineering lead to biased, irrelevant, and impersonal search results. In addition, exposing user data to these search engines may pose privacy issues. Motivated by these factors, we present ChatReview, a ChatGPT-enabled natural language processing (NLP) framework that effectively studies domain-specific user reviews to offer relevant and personalized search results at multiple levels of granularity. The framework accomplishes this task using four phases including data collection, tokenization, query construction, and response generation. The data collection phase involves gathering domain-specific user reviews from public and private repositories. In the tokenization phase, ChatReview applies sentiment analysis to extract keywords and categorize them into various sentiment classes. This process creates a token repository that best describes the user sentiments for a given user-review data. In the query construction phase, the framework uses the token repository and domain knowledge to construct three types of ChatGPT prompts including explicit, implicit, and creative. In the response generation phase, ChatReview pipelines these prompts into ChatGPT to generate search results at varying levels of granularity. We analyze our framework using three real-world domains including education, local restaurants, and hospitality. We assert that our framework simplifies prompt engineering for general users to produce effective results while minimizing the exposure of sensitive user data to search engines. We also present a one-of-a-kind Large Language Model (LLM) peer assessment of the ChatReview framework. Specifically, we employ Google’s Bard to objectively and qualitatively analyze the various ChatReview outputs. Our Bard-based analyses yield over 90% satisfaction, establishing ChatReview as a viable survey analysis tool.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100522"},"PeriodicalIF":0.0,"publicationDate":"2023-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000750/pdfft?md5=82dd36b16ed5d43b7a9134111f9ce072&pid=1-s2.0-S2666827023000750-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139109222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning nonlinear integral operators via recurrent neural networks and its application in solving integro-differential equations 通过递归神经网络学习非线性积分算子及其在求解积分微分方程中的应用
Machine learning with applications Pub Date : 2023-12-27 DOI: 10.1016/j.mlwa.2023.100524
Hardeep Bassi , Yuanran Zhu , Senwei Liang , Jia Yin , Cian C. Reeves , Vojtěch Vlček , Chao Yang
{"title":"Learning nonlinear integral operators via recurrent neural networks and its application in solving integro-differential equations","authors":"Hardeep Bassi ,&nbsp;Yuanran Zhu ,&nbsp;Senwei Liang ,&nbsp;Jia Yin ,&nbsp;Cian C. Reeves ,&nbsp;Vojtěch Vlček ,&nbsp;Chao Yang","doi":"10.1016/j.mlwa.2023.100524","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100524","url":null,"abstract":"<div><p>In this paper, we propose using LSTM-RNNs (Long Short-Term Memory-Recurrent Neural Networks) to learn and represent nonlinear integral operators that appear in nonlinear integro-differential equations (IDEs). The LSTM-RNN representation of the nonlinear integral operator allows us to turn a system of nonlinear integro-differential equations into a system of ordinary differential equations for which many efficient solvers are available. Furthermore, because the use of LSTM-RNN representation of the nonlinear integral operator in an IDE eliminates the need to perform a numerical integration in each numerical time evolution step, the overall temporal cost of the LSTM-RNN-based IDE solver can be reduced to <span><math><mrow><mi>O</mi><mrow><mo>(</mo><msub><mrow><mi>n</mi></mrow><mrow><mi>T</mi></mrow></msub><mo>)</mo></mrow></mrow></math></span> from <span><math><mrow><mi>O</mi><mrow><mo>(</mo><msubsup><mrow><mi>n</mi></mrow><mrow><mi>T</mi></mrow><mrow><mn>2</mn></mrow></msubsup><mo>)</mo></mrow></mrow></math></span> if a <span><math><msub><mrow><mi>n</mi></mrow><mrow><mi>T</mi></mrow></msub></math></span>-step trajectory is to be computed. We illustrate the efficiency and robustness of this LSTM-RNN-based numerical IDE solver with a model problem. Additionally, we highlight the generalizability of the learned integral operator by applying it to IDEs driven by different external forces. As a practical application, we show how this methodology can effectively solve the Dyson’s equation for quantum many-body systems.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100524"},"PeriodicalIF":0.0,"publicationDate":"2023-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000774/pdfft?md5=e7dece6581dbf391df9d505308d5c9d8&pid=1-s2.0-S2666827023000774-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139109224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatiotemporal integration of GCN and E-LSTM networks for PM2.5 forecasting GCN 和 E-LSTM 网络的时空整合用于 PM2.5 预报
Machine learning with applications Pub Date : 2023-12-15 DOI: 10.1016/j.mlwa.2023.100521
Ali Kamali Mohammadzadeh , Halima Salah , Roohollah Jahanmahin , Abd E Ali Hussain , Sara Masoud , Yaoxian Huang
{"title":"Spatiotemporal integration of GCN and E-LSTM networks for PM2.5 forecasting","authors":"Ali Kamali Mohammadzadeh ,&nbsp;Halima Salah ,&nbsp;Roohollah Jahanmahin ,&nbsp;Abd E Ali Hussain ,&nbsp;Sara Masoud ,&nbsp;Yaoxian Huang","doi":"10.1016/j.mlwa.2023.100521","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100521","url":null,"abstract":"<div><p>PM<sub>2.5</sub>, inhalable particles, with a size of 2.5 micrometers or less, negatively impact the environment as well as our health. Monitoring PM<sub>2.5</sub> is critical to guard against extreme events by alerting people and initiating actions to alleviate PM<sub>2.5′</sub>s impacts. Developing PM<sub>2.5</sub> forecasting frameworks empowers the authorities to predict extremely polluted events in advance and gives them time to implement necessary strategies in advance (e.g., Action! Days). Understanding the spatiotemporal behavior of PM<sub>2.5</sub> and meteorological factors is of significance for having accurate predictions. This study utilizes EPA sensor data to quantify the PM<sub>2.5</sub> air quality index (AQI) and meteorological factors such as temperature over 2015–2019 across Michigan, USA. Here, a spatiotemporal deep neural structure is proposed through integrating graph convolutional neural (GCN) and exogenous long short-term memory (E-LSTM) networks to incorporate spatial and temporal patterns within PM<sub>2.5</sub> AQI and meteorological factors for predicting PM<sub>2.5</sub> AQI. Results illustrate that not only does our proposed framework outperform the traditional approaches such as LSTM and E-LSTM, but also it is robust against the network structure of EPA stations. The study's findings demonstrate that the integration of GCN with E-LSTM significantly enhances the accuracy of PM<sub>2.5</sub> AQI predictions compared to traditional models. This advancement indicates a promising direction for environmental monitoring, offering improved forecasting tools that can aid in timely and effective decision-making for air quality management and public health protection.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100521"},"PeriodicalIF":0.0,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000749/pdfft?md5=c4a01ec68a9b50044fed32950ca15dc5&pid=1-s2.0-S2666827023000749-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138839832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving top-N recommendations using batch approximation for weighted pair-wise loss 使用批量近似加权配对损失改进 Top-N 推荐
Machine learning with applications Pub Date : 2023-12-13 DOI: 10.1016/j.mlwa.2023.100520
Sofia Aftab, Heri Ramampiaro
{"title":"Improving top-N recommendations using batch approximation for weighted pair-wise loss","authors":"Sofia Aftab,&nbsp;Heri Ramampiaro","doi":"10.1016/j.mlwa.2023.100520","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100520","url":null,"abstract":"<div><p>In collaborative filtering, matrix factorization and collaborative metric learning are challenged by situations where non-preferred items may appear so close to a user in the feature embedding space that they lead to degrading the recommendation performance. We call such items ‘potential impostor’ risks. Addressing the issues with ‘potential impostor’ is important because it can result in inefficient learning and poor feature extraction. To achieve this, we propose a novel loss function formulation designed to enhance learning efficiency by actively identifying and addressing impostors, leveraging item associations and learning the distribution of negative items. This approach is crucial for models to differentiate between positive and negative items effectively, even when they are closely aligned in the feature space. Here, a loss function is generally an objective optimization function that is defined based on user–item interaction data, through either implicit or explicit feedback. The loss function essentially decides how well a recommendation algorithm performs. In this paper, we introduce and define the concept of ‘potential impostor’, highlighting its impact on learned representation quality and algorithmic efficiency. We tackle the limitations of non-metric methods, like the Weighted Approximate Rank Pairwise Loss (WARP) method, which struggles to capture item–item similarities, by using a ‘similarity propagation’ strategy with a new loss term. Similarly, we address fixed margin inefficiencies in Weighted Collaborative Metric Learning (WCML), through density distribution approximation. This moves potential impostors away from the margin for more robust learning. Additionally, we propose a large-scale batch approximation algorithm for increased detection of impostors, coupled with an active learning strategy for improved top-<span><math><mi>N</mi></math></span> recommendation performance. Our extensive empirical analysis across five major and diverse datasets demonstrates the effectiveness and feasibility of our methods, compared to existing techniques with respect to improving AUC, reducing impostor rate, and increasing the average distance metrics. More specifically, our evaluation shows that our two proposed methods outperform the existing state-of-the-art techniques, with an improvement of AUC by 3.5% and 3.7%, NDCG by 1.0% and 9.1% and HR by 1.3% and 3.6%, respectively. Similarly, the impostor rate is decreased by 35% and 18%, and their average distance is increased by 33% and 37%, respectively.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100520"},"PeriodicalIF":0.0,"publicationDate":"2023-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000737/pdfft?md5=9ed329936dd4420c5fffd4c4464c6908&pid=1-s2.0-S2666827023000737-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138656881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive survey on machine learning applications for drilling and blasting in surface mining 露天采矿中钻孔和爆破的机器学习应用综合调查
Machine learning with applications Pub Date : 2023-12-11 DOI: 10.1016/j.mlwa.2023.100517
Venkat Munagala , Srikanth Thudumu , Irini Logothetis , Sushil Bhandari , Rajesh Vasa , Kon Mouzakis
{"title":"A comprehensive survey on machine learning applications for drilling and blasting in surface mining","authors":"Venkat Munagala ,&nbsp;Srikanth Thudumu ,&nbsp;Irini Logothetis ,&nbsp;Sushil Bhandari ,&nbsp;Rajesh Vasa ,&nbsp;Kon Mouzakis","doi":"10.1016/j.mlwa.2023.100517","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100517","url":null,"abstract":"<div><p>Drilling and blasting operations are pivotal for productivity and safety in hard rock surface mining. These operations are restricted due to complexities such as site-specific uncertainties, safety risks, and environmental and economic constraints. Machine Learning (ML) is a transformative approach to tackle these complexities resulting in significant cost reductions. ML applications can reduce overall blasting costs by up to 23% and decrease the amount of explosives by as much as 89% compared to traditional methods. This survey presents a comprehensive review of how ML can be applied to optimize drill and blast designs while accounting for its operational challenges. Our research highlights the difficulties in collecting quality site-specific data, the complexity of interpreting this data into insightful information, the selection of ML models relating to mining objectives, and the need for established methods to assess blast efficiency quantitatively. We provide a synthesis of ML model development practices in drilling and blasting and demonstrate the value of ML methodologies. Based on our survey, we present actionable recommendations for developing ML methodologies to improve safety, reduce costs, and enhance efficiency in drilling and blasting processes. This includes establishing standardized data schematics, multiobjective model optimization, and comprehensive evaluation metrics. These benefits can guide mine management and engineers to adopt ML techniques and improve on-ground operational practices. This survey aims to serve as a resource for both practitioners and researchers shaping the future research direction in ML applications for drilling and blasting practices.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100517"},"PeriodicalIF":0.0,"publicationDate":"2023-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000701/pdfft?md5=5f0687a7294aa89ab47bf61d3c8a02e7&pid=1-s2.0-S2666827023000701-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138577447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An inpatient fall risk assessment tool: Application of machine learning models on intrinsic and extrinsic risk factors 住院病人跌倒风险评估工具:将机器学习模型应用于内在和外在风险因素
Machine learning with applications Pub Date : 2023-12-08 DOI: 10.1016/j.mlwa.2023.100519
Sonia Jahangiri, Masoud Abdollahi, Rasika Patil, Ehsan Rashedi, Nasibeh Azadeh-Fard
{"title":"An inpatient fall risk assessment tool: Application of machine learning models on intrinsic and extrinsic risk factors","authors":"Sonia Jahangiri,&nbsp;Masoud Abdollahi,&nbsp;Rasika Patil,&nbsp;Ehsan Rashedi,&nbsp;Nasibeh Azadeh-Fard","doi":"10.1016/j.mlwa.2023.100519","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100519","url":null,"abstract":"<div><h3>Background</h3><p>This study aimed to identify the most impactful set of intrinsic and extrinsic fall risk factors and develop a data-driven inpatient fall risk assessment tool (FRAT).</p></div><div><h3>Methods</h3><p>The dataset used for the study comprised in-hospital fall records from 2012 to 2017. Four machine learning (ML) algorithms, Support Vector Machine (SVM), Random Forest (RF), Gradient Boosting (Gboost), and Deep Neural Network (DNN) were utilized to predict the inpatient fall risk level. To enhance the performance of the prediction models, two approaches were implemented, including (1) feature selection to identify the optimal feature set and (2) the development of three distinct shift-wise models. Furthermore, the optimal feature sets in the shift-wise models were extracted.</p></div><div><h3>Results</h3><p>According to the results, DNN outperformed other methods by reaching an accuracy, sensitivity, specificity, and AUC of 0.71, 0.8, 0.6, and 0.7, respectively, considering the full set of features. The performance of the models was further improved (by 3-5 %) by conducting a feature selection process for all models. Specifically, the DNN model achieved an accuracy of 0.74 while considering the optimal set of predictors. Moreover, the shift-wise RF models demonstrated higher accuracies (by 4-10 %) compared to the same model using a full feature set.</p></div><div><h3>Conclusions</h3><p>This study's outcome confirms ML models' compelling capability in developing an inpatient FRAT while considering intrinsic and extrinsic factors. The insight from such models could form a foundation to (1) monitor the inpatients’ fall risk, (2) identify the major factors involved in inpatient falls, and (3) create subject-specific self-care plans.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100519"},"PeriodicalIF":0.0,"publicationDate":"2023-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000725/pdfft?md5=4a2d470f18c0e62a81481dcea8beab5f&pid=1-s2.0-S2666827023000725-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138582125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comparison of machine learning surrogate models of street-scale flooding in Norfolk, Virginia 弗吉尼亚州诺福克市街头洪水的机器学习替代模型的比较
Machine learning with applications Pub Date : 2023-11-29 DOI: 10.1016/j.mlwa.2023.100518
Diana McSpadden , Steven Goldenberg , Binata Roy , Malachi Schram , Jonathan L. Goodall , Heather Richter
{"title":"A comparison of machine learning surrogate models of street-scale flooding in Norfolk, Virginia","authors":"Diana McSpadden ,&nbsp;Steven Goldenberg ,&nbsp;Binata Roy ,&nbsp;Malachi Schram ,&nbsp;Jonathan L. Goodall ,&nbsp;Heather Richter","doi":"10.1016/j.mlwa.2023.100518","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100518","url":null,"abstract":"<div><p>Low-lying coastal cities, exemplified by Norfolk, Virginia, face the challenge of street flooding caused by rainfall and tides, which strain transportation and sewer systems and can lead to personal and property damage. While high-fidelity, physics-based simulations provide accurate predictions of urban pluvial flooding, their computational complexity renders them unsuitable for real-time applications. Using data from Norfolk rainfall events between 2016 and 2018, this study compares the performance of a previous surrogate model based on a random forest algorithm with two deep learning models: Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU). The comparison of deep learning to the random forest algorithm is motivated by the desire to utilize a machine learning architecture that allows for the future inclusion of common uncertainty quantification techniques and the effective integration of relevant, multi-modal features.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100518"},"PeriodicalIF":0.0,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000713/pdfft?md5=0b7cf337c2324211c4a189837657851e&pid=1-s2.0-S2666827023000713-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138466741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Considerations for adapting higher education technology courses for AI large language models: A critical review of the impact of ChatGPT 为人工智能大型语言模型调整高等教育技术课程的考虑因素:对 ChatGPT 影响的批判性评论
Machine learning with applications Pub Date : 2023-11-25 DOI: 10.1016/j.mlwa.2023.100513
Omar Tayan , Ali Hassan , Khaled Khankan , Sanaa Askool
{"title":"Considerations for adapting higher education technology courses for AI large language models: A critical review of the impact of ChatGPT","authors":"Omar Tayan ,&nbsp;Ali Hassan ,&nbsp;Khaled Khankan ,&nbsp;Sanaa Askool","doi":"10.1016/j.mlwa.2023.100513","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100513","url":null,"abstract":"<div><p>Following the very recent launch of the ChatGPT chatbot, numerous comments and speculations were posted concerning the potential aspects of society that are expected to benefit from this AI revolution. In particular, the education sector is considered as one of the primary domains affected by this application, the impact of which remains yet to be fully understood. Furthermore, many Higher Education institutions are required to get to terms with its impact on teaching and learning, and to clarify their stances on the use of ChatGPT software. This study was developed to investigate some critical case studies considered as relevant to the inevitable re-evaluation of educational aspects needed, ranging from academic missions to student and course learning outcomes and its ethical uses. Following a review of some of the pros and cons of ChatGPT in the higher educational sector, this paper shall demonstrate several case studies of early trials in teaching and learning assessments related to various specializations. Next, the ability of some well-known AI detector software and analyzed in terms of their capacity to successfully detect AI-generated content. Analysis shall be made of the foreseen impact on important aspects including challenges and benefits related to its use in course assessments as well as academic integrity and ethical use. The study concludes with a set of recommendations made from our findings and benchmarks obtained from top universities in order to assist faculty members and decision makers at Higher Education institutions concerning their response strategy and use of ChatGPT.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100513"},"PeriodicalIF":0.0,"publicationDate":"2023-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266682702300066X/pdfft?md5=e38f604707a5a8e7810d6cdf19695918&pid=1-s2.0-S266682702300066X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138489575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning for an explainable cost prediction of medical insurance 用于医疗保险可解释成本预测的机器学习
Machine learning with applications Pub Date : 2023-11-24 DOI: 10.1016/j.mlwa.2023.100516
Ugochukwu Orji , Elochukwu Ukwandu
{"title":"Machine learning for an explainable cost prediction of medical insurance","authors":"Ugochukwu Orji ,&nbsp;Elochukwu Ukwandu","doi":"10.1016/j.mlwa.2023.100516","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100516","url":null,"abstract":"<div><p>Predictive modeling in healthcare continues to be an active actuarial research topic as more insurance companies aim to maximize the potential of Machine Learning (ML) approaches to increase their productivity and efficiency. In this paper, the authors deployed three regression-based ensemble ML models that combine variations of decision trees through Extreme Gradient Boosting (XGBoost), Gradient-boosting Machine (GBM), and Random Forest (RF) methods in predicting medical insurance costs. Explainable Artificial Intelligence (XAi) methods SHapley Additive exPlanations (SHAP) and Individual Conditional Expectation (ICE) plots were deployed to discover and explain the key determinant factors that influence medical insurance premium prices in the dataset. The dataset used comprised 986 records and is publicly available in the KAGGLE repository. The models were evaluated using four performance evaluation metrics, including R-squared (R<sup>2</sup>), Mean Absolute Error (MAE), Root Mean Squared Error (RMSE), and Mean Absolute Percentage Error (MAPE). The results show that all models produced impressive outcomes; however, the XGBoost model achieved a better overall performance although it also expanded more computational resources, while the RF model recorded a lesser prediction error and consumed far fewer computing resources than the XGBoost model. Furthermore, we compared the outcome of both XAi methods in identifying the key determinant features that influenced the PremiumPrices for each model and whereas both XAi methods produced similar outcomes, we found that the ICE plots showed in more detail the interactions between each variable than the SHAP analysis which seemed to be more high-level. It is the aim of the authors that the contributions of this study will help policymakers, insurers, and potential medical insurance buyers in their decision-making process for selecting the right policies that meet their specific needs.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"15 ","pages":"Article 100516"},"PeriodicalIF":0.0,"publicationDate":"2023-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000695/pdfft?md5=fcd73094ae2ec3d7f5d01f086997c258&pid=1-s2.0-S2666827023000695-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138480780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting aggression in clinical treatment videos 在临床治疗视频中检测攻击性
Machine learning with applications Pub Date : 2023-11-22 DOI: 10.1016/j.mlwa.2023.100515
Walker S. Arce , Seth G. Walker , Jordan DeBrine , Benjamin S. Riggan , James E. Gehringer
{"title":"Detecting aggression in clinical treatment videos","authors":"Walker S. Arce ,&nbsp;Seth G. Walker ,&nbsp;Jordan DeBrine ,&nbsp;Benjamin S. Riggan ,&nbsp;James E. Gehringer","doi":"10.1016/j.mlwa.2023.100515","DOIUrl":"https://doi.org/10.1016/j.mlwa.2023.100515","url":null,"abstract":"<div><p>Many clinical spaces are outfitted with centralized video recording systems to monitor patient–client interactions. Considering the increasing interest in video-based machine learning methods, the potential of using these clinical recordings to automate observational data collection is apparent. To explore this, seven patients had videos of their functional assessment and treatment sessions annotated by coders trained by our clinical team. Commonly used clinical software has inherent limitations aligning behavioral and video data, so a custom software tool was employed to address this functionality gap. After developing a Canvas-based coder training course for this tool, a team of six trained coders annotated 82.33 h of data. Two machine learning approaches were considered, where both used a convolutional neural network as a video feature extractor. The first approach used a recurrent network as the classifier on the extracted features and the second used a Transformer architecture. Both models produced promising metrics indicating that the capability of detecting aggression from clinical videos is possible and generalizable. Model performance is directly tied to the feature extractor’s performance on ImageNet, where ConvNeXtXL produced the best performing models. This has applications in automating patient incident response to improve patient and clinician safety and could be directly integrated into existing video management systems for real-time analysis.</p></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"14 ","pages":"Article 100515"},"PeriodicalIF":0.0,"publicationDate":"2023-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666827023000683/pdfft?md5=7be193e80aa9244b29f8609ccc55e9e6&pid=1-s2.0-S2666827023000683-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138355934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信