Hybrid-LLM-GNN: integrating large language models and graph neural networks for enhanced materials property prediction†

IF 6.2 Q1 CHEMISTRY, MULTIDISCIPLINARY

Digital discovery Pub Date : 2024-12-18 DOI:10.1039/D4DD00199K

Youjia Li, Vishu Gupta, Muhammed Nur Talha Kilic, Kamal Choudhary, Daniel Wines, Wei-keng Liao, Alok Choudhary and Ankit Agrawal

{"title":"Hybrid-LLM-GNN: integrating large language models and graph neural networks for enhanced materials property prediction†","authors":"Youjia Li, Vishu Gupta, Muhammed Nur Talha Kilic, Kamal Choudhary, Daniel Wines, Wei-keng Liao, Alok Choudhary and Ankit Agrawal","doi":"10.1039/D4DD00199K","DOIUrl":null,"url":null,"abstract":"<p >Graph-centric learning has attracted significant interest in materials informatics. Accordingly, a family of graph-based machine learning models, primarily utilizing Graph Neural Networks (GNN), has been developed to provide accurate prediction of material properties. In recent years, Large Language Models (LLM) have revolutionized existing scientific workflows that process text representations, thanks to their exceptional ability to utilize extensive common knowledge for understanding semantics. With the help of automated text representation tools, fine-tuned LLMs have demonstrated competitive prediction accuracy as standalone predictors. In this paper, we propose to integrate the insights from GNNs and LLMs to enhance both prediction accuracy and model interpretability. Inspired by the feature-extraction-based transfer learning study for the GNN model, we introduce a novel framework that extracts and combines GNN and LLM embeddings to predict material properties. In this study, we employed ALIGNN as the GNN model and utilized BERT and MatBERT as the LLM model. We evaluated the proposed framework in cross-property scenarios using 7 properties. We find that the combined feature extraction approach using GNN and LLM outperforms the GNN-only approach in the majority of the cases with up to 25% improvement in accuracy. We conducted model explanation analysis through text erasure to interpret the model predictions by examining the contribution of different parts of the text representation.</p>","PeriodicalId":72816,"journal":{"name":"Digital discovery","volume":" 2","pages":" 376-383"},"PeriodicalIF":6.2000,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/dd/d4dd00199k?page=search","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital discovery","FirstCategoryId":"1085","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2025/dd/d4dd00199k","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Graph-centric learning has attracted significant interest in materials informatics. Accordingly, a family of graph-based machine learning models, primarily utilizing Graph Neural Networks (GNN), has been developed to provide accurate prediction of material properties. In recent years, Large Language Models (LLM) have revolutionized existing scientific workflows that process text representations, thanks to their exceptional ability to utilize extensive common knowledge for understanding semantics. With the help of automated text representation tools, fine-tuned LLMs have demonstrated competitive prediction accuracy as standalone predictors. In this paper, we propose to integrate the insights from GNNs and LLMs to enhance both prediction accuracy and model interpretability. Inspired by the feature-extraction-based transfer learning study for the GNN model, we introduce a novel framework that extracts and combines GNN and LLM embeddings to predict material properties. In this study, we employed ALIGNN as the GNN model and utilized BERT and MatBERT as the LLM model. We evaluated the proposed framework in cross-property scenarios using 7 properties. We find that the combined feature extraction approach using GNN and LLM outperforms the GNN-only approach in the majority of the cases with up to 25% improvement in accuracy. We conducted model explanation analysis through text erasure to interpret the model predictions by examining the contribution of different parts of the text representation.

Abstract Image

查看原文本刊更多论文

Hybrid-LLM-GNN：集成大型语言模型和图形神经网络的增强材料性能预测

以图为中心的学习已经引起了材料信息学领域的极大兴趣。因此，一系列基于图的机器学习模型，主要利用图神经网络（GNN），已经被开发出来，以提供准确的材料性能预测。近年来，大型语言模型（LLM）已经彻底改变了现有的处理文本表示的科学工作流，这要归功于它们利用广泛的公共知识来理解语义的特殊能力。在自动文本表示工具的帮助下，经过微调的llm已经证明了作为独立预测器具有竞争力的预测准确性。在本文中，我们建议整合gnn和llm的见解，以提高预测精度和模型可解释性。受基于特征提取的GNN模型迁移学习研究的启发，我们引入了一种新的框架，该框架提取并结合GNN和LLM嵌入来预测材料性能。在本研究中，我们使用ALIGNN作为GNN模型，使用BERT和MatBERT作为LLM模型。我们使用7个属性在跨属性场景中评估了提议的框架。我们发现，在大多数情况下，使用GNN和LLM的组合特征提取方法优于仅使用GNN的方法，准确率提高了25%。我们通过文本擦除进行模型解释分析，通过检查文本表示的不同部分的贡献来解释模型预测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Digital discovery

CiteScore

2.80

自引率

0.00%

发文量