Sudhashree Sayenju, Ramazan S. Aygun, Bill Franks, Sereres Johnston, George Lee, Hansook Choi, Girish Modgil
{"title":"Quantifying Domain Knowledge in Large Language Models","authors":"Sudhashree Sayenju, Ramazan S. Aygun, Bill Franks, Sereres Johnston, George Lee, Hansook Choi, Girish Modgil","doi":"10.1109/CAI54212.2023.00091","DOIUrl":"https://doi.org/10.1109/CAI54212.2023.00091","url":null,"abstract":"Transformer based Large language models such as BERT, have demonstrated the ability to derive contextual information from the words surrounding it. However, when these models are applied in specific domains such as medicine, insurance, or scientific disciplines, publicly available models trained on general knowledge sources such as Wikipedia, it may not be as effective in inferring the appropriate context compared to domain-specific models trained on specialized corpora. Given the limited availability of training data for specific domains, pre-trained models can be fine-tuned via transfer learning using relatively small domain-specific corpora. However, there is currently no standardized method for quantifying the effectiveness of these domain-specific models in acquiring the necessary domain knowledge. To address this issue, we explore hidden layer embeddings and introduce domain_gain, a measure to quantify the ability of a model to infer the correct context. In this paper, we show how our measure could be utilized to determine whether words with multiple meanings are more likely to be associated with domain-related meanings rather than their colloquial meanings.","PeriodicalId":129324,"journal":{"name":"2023 IEEE Conference on Artificial Intelligence (CAI)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132646416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Blended Temperature Forecasting Model for Thailand Using Multiple Data Sources","authors":"Sukrit Jaidee, Walanchaporn Boon-Nontae, Weerayut Srithiam","doi":"10.1109/cai54212.2023.00141","DOIUrl":"https://doi.org/10.1109/cai54212.2023.00141","url":null,"abstract":"With the escalation in the cost of electricity, there has been a noticeable inclination towards the installation of solar photovoltaic (PV) systems in multiple regions across Thailand. The increase in PV installations has led to an electricity demand that fluctuates depending on the prevailing weather conditions, creating challenges in managing and regulating electricity demand. In order to support electricity regulators in managing the fluctuations, it is crucial to implement a solar power forecasting system for individual households. One of the critical variables in forecasting solar power generation, besides solar irradiance, is temperature. This study introduces a temperature prediction system for every geographic location in Thailand at a 10x magnification level, which provided an hourly temperature for each location in the country. The proposed model integrated input data from three open-source platforms, namely Meteostat, Weatherapi, and IBM Weather. Utilizing the capabilities of each input source, the deep learning model was employed. The system, powered by the proposed model, achieved a Mean Squared Error (MSE) of 1.17 °C when compared to the actual data acquired from the Meteorological Department of Thailand.","PeriodicalId":129324,"journal":{"name":"2023 IEEE Conference on Artificial Intelligence (CAI)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131287885","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Structural health monitoring of steel moment frame buildings via sequence-based recurrent neural networks","authors":"Khashayar Heydarpour, Doeun Choe, Kyungyong Chung","doi":"10.1109/CAI54212.2023.00154","DOIUrl":"https://doi.org/10.1109/CAI54212.2023.00154","url":null,"abstract":"Signal-based damage detection has gained extensive attention in recent years due to its capability in improving the deficiencies of previous structural health monitoring methods. Deep learning, with high capabilities in feature learning, has emerged as a powerful tool for sequence classification. In this paper, sequence-based deep learning models using long short-term memory (LSTM) and gated recurrent units (GRU) networks are used to detect structural damages and damage locations applied to steel building structures. To propose an appropriate deep-learning method for structural health monitoring, sets of monitoring data from the IASC-ASCE benchmark building were used. The data was collected from 15 sensors to collect accelerations attached to a 4-story steel moment frame building. The data has been properly pre-processed, denoised, sliced, and normalized. K-fold cross-validation validation is performed. The networks are designed using various combinations of recurrent neural networks, such as LSTM and GRU. It is concluded that stacked multilayer bidirectional long short-term memory networks, with an accuracy of 98%, have a superior performance in detecting the presence and location of structural damage.","PeriodicalId":129324,"journal":{"name":"2023 IEEE Conference on Artificial Intelligence (CAI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131304621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Dudnyk Oleksii, Sokolovska Zoia, A. Gegov, Farzad Arabikhan
{"title":"Forecasting IT Industry trends using a Fuzzy Decision Support System","authors":"Dudnyk Oleksii, Sokolovska Zoia, A. Gegov, Farzad Arabikhan","doi":"10.1109/cai54212.2023.00086","DOIUrl":"https://doi.org/10.1109/cai54212.2023.00086","url":null,"abstract":"The Information Technology (IT) industry plays a vital role in the global economy and its significance will only continue to grow as digitalization accelerates. During times of increased entropy resulting from crises or military conflicts, the stable functioning of the IT industry is of paramount importance. This study explores the possibility of using a fuzzy Decision Support System (DSS) to forecast and predict how the IT industry in a specific country will operate.For the purpose of demonstration, we use self-developed FuzzyKIDE fuzzy DSS software platform, which employs fuzzy logic to provide a reasonable assessment even when the information is imprecise. We propose a DSS structure that utilizes a combined model of semantic networks and fuzzy implication rules. To demonstrate its effectiveness, we analyze Ukraine's IT industry at various time points and compare the results with historical data.","PeriodicalId":129324,"journal":{"name":"2023 IEEE Conference on Artificial Intelligence (CAI)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124406925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Dong Quan Vu, Y. Marnissi, S. Razakarivony, M. Nocture
{"title":"A constrained Langevin-adapted Particle Filter for Aircraft Engines’ Health Monitoring","authors":"Dong Quan Vu, Y. Marnissi, S. Razakarivony, M. Nocture","doi":"10.1109/CAI54212.2023.00087","DOIUrl":"https://doi.org/10.1109/CAI54212.2023.00087","url":null,"abstract":"We examine the application of particle filter in estimating performance indicators of an aircraft engine; these indicators are a crucial aspect in health monitoring and condition-based maintenance for aeronautics. This approach is flexible and not restricted by rigid assumptions often found in other methods; however, it poses three challenges in our context: (i) high computation cost: classical particle filters require a large number of particles, each of them calling a heavy model; (ii) non-observability: in our system, different system states might provide the same measurements; (iii) constraints: constraints on the estimation are required to be integrated dynamically. We propose a version of particle filter, based on Langevin dynamics, to resolve these challenges.","PeriodicalId":129324,"journal":{"name":"2023 IEEE Conference on Artificial Intelligence (CAI)","volume":"32 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123284010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sang Mee Lee, M. Chung, Zero Kim, K. Lee, Da Eun Kim, Ji Su Kim
{"title":"Automatic segmentation and evaluation techniques for free flap in reconstruction surgery using deep learning","authors":"Sang Mee Lee, M. Chung, Zero Kim, K. Lee, Da Eun Kim, Ji Su Kim","doi":"10.1109/CAI54212.2023.00066","DOIUrl":"https://doi.org/10.1109/CAI54212.2023.00066","url":null,"abstract":"Postoperative free flap monitoring would be crucial to achieve the best outcomes of microsurgical reconstruction, for which meticulous clinical examination has served as the gold standard. Despite its high reliability, requirement of excessive consumption of manpower has been addressed a main drawback. To overcome this issue, this study aimed to develop a novel system for free flap monitoring, which is based on artificial intelligence (AI) using deep learning system. Photographs of postoperative appearance of free flaps were gathered retrospectively from patients who underwent free flap-based reconstruction between 2020 and 2021, and prospectively from those in 2022, and used with dividing into 80%:20% for model training and test. The development process proceeded with categorizing into two parts; flap segmentation model that can identify the flap region in the photographs, and flap status classification model that can evaluate the flap perfusion status based on its color. A total of 2,068 photographs of 433 patients were used. The most reliable model for flap segmentation was developed based on U-Net algorithm, achieving a Dice Coefficient of 0.972. For the flap status classification model, the Support Vector Machine algorithms was adopted, showing an accuracy of 0.926, from which the ratio of red pixels in the flap region was extracted as a quantitative indicator. Our results suggest that this novel AI-based model for flap segmentation and status classification could achieve reliable outcomes. A further upgraded system based on this model may provide optimal flap monitoring, keeping high accuracy and reducing medical staffs’ labor burden.","PeriodicalId":129324,"journal":{"name":"2023 IEEE Conference on Artificial Intelligence (CAI)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125286194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yu-Hsing Hsieh, Jia-Da Li, Yao Lee, Chu-Song Chen, LiFu Wu, S. H. Cheng
{"title":"Improved Contrastive Unpaired Translation for Metal Artifacts Reduction in Nasopharyngeal CT Images","authors":"Yu-Hsing Hsieh, Jia-Da Li, Yao Lee, Chu-Song Chen, LiFu Wu, S. H. Cheng","doi":"10.1109/CAI54212.2023.00152","DOIUrl":"https://doi.org/10.1109/CAI54212.2023.00152","url":null,"abstract":"Metal artifacts (MA) reduction is crucial for clinical application yet often lacks paired training data. Learning MA reduction from unpaired data and enforcing fidelity seems a trade-off. The study proposed an improved contrastive unpaired translation solution to address the issues and demonstrate its efficacy.","PeriodicalId":129324,"journal":{"name":"2023 IEEE Conference on Artificial Intelligence (CAI)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127661582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Affective Computing for Social Companion Robots Using Fine-grained Speech Emotion Recognition","authors":"Saransh Ahuja, Amir Shabani","doi":"10.1109/cai54212.2023.00146","DOIUrl":"https://doi.org/10.1109/cai54212.2023.00146","url":null,"abstract":"The increasing demand and diverse applications for social companion robots necessitate the development of more engaging and meaningful human-robot interactions and hence affective computing or emotion Al. In this paper, we propose a fine-grained speech emotion recognition using a state-of-the-art Deep Convolutional Neural Network trained on three-channel representations of speech signals to classify each emotion and also their intensity level. Experimental results on a publicly available dataset with intensity level (RAVEDESS) show that our method can effectively predict the users emotion and their intensity with 95.85±1.38% accuracy, a promising results towards empowering companion robots to be more affective and potentially be helpful in emotion regulations of their users.","PeriodicalId":129324,"journal":{"name":"2023 IEEE Conference on Artificial Intelligence (CAI)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127701447","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
V. Kaburlasos, C. Lytridis, G. Siavalas, Viktoria Nikoleta Tsakalidou, Christos Tsakmakis, Ioannis Kalathas, T. Pachidis, K. Rantos, K. Kalaboukas
{"title":"Skilled Agricultural Task Delivery by a Digital Twin","authors":"V. Kaburlasos, C. Lytridis, G. Siavalas, Viktoria Nikoleta Tsakalidou, Christos Tsakmakis, Ioannis Kalathas, T. Pachidis, K. Rantos, K. Kalaboukas","doi":"10.1109/CAI54212.2023.00159","DOIUrl":"https://doi.org/10.1109/CAI54212.2023.00159","url":null,"abstract":"Human intelligence is inherently associated with non-numerical data including structures, e.g. the human hand and other. Data semantics are represented here by an order relation in the context of the Lattice Computing (LC) paradigm. A well-defined inclusion measure function on partially (lattice) ordered data is used by a semi-autonomous Cyber-Physical System (CPS) for decision-making based on logic; in particular, an agricultural robot, or agrobot for short, is a CPS proposed for harvesting grapes. The proposed agrobot is driven remotely by a human using a Digital Twin (DT). Preliminary results are outlined, where an agrobot includes a mechanical hand mounted on a robotic arm. Cyber-security issues are also considered.","PeriodicalId":129324,"journal":{"name":"2023 IEEE Conference on Artificial Intelligence (CAI)","volume":"147 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128601481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}