NumLLM：对数字敏感的中文金融大语言模型

arXiv - QuantFin - General Finance Pub Date : 2024-05-01 DOI:arxiv-2405.00566

Huan-Yi Su, Ke Wu, Yu-Hao Huang, Wu-Jun Li

{"title":"NumLLM：对数字敏感的中文金融大语言模型","authors":"Huan-Yi Su, Ke Wu, Yu-Hao Huang, Wu-Jun Li","doi":"arxiv-2405.00566","DOIUrl":null,"url":null,"abstract":"Recently, many works have proposed various financial large language models\n(FinLLMs) by pre-training from scratch or fine-tuning open-sourced LLMs on\nfinancial corpora. However, existing FinLLMs exhibit unsatisfactory performance\nin understanding financial text when numeric variables are involved in\nquestions. In this paper, we propose a novel LLM, called numeric-sensitive\nlarge language model (NumLLM), for Chinese finance. We first construct a\nfinancial corpus from financial textbooks which is essential for improving\nnumeric capability of LLMs during fine-tuning. After that, we train two\nindividual low-rank adaptation (LoRA) modules by fine-tuning on our constructed\nfinancial corpus. One module is for adapting general-purpose LLMs to financial\ndomain, and the other module is for enhancing the ability of NumLLM to\nunderstand financial text with numeric variables. Lastly, we merge the two LoRA\nmodules into the foundation model to obtain NumLLM for inference. Experiments\non financial question-answering benchmark show that NumLLM can boost the\nperformance of the foundation model and can achieve the best overall\nperformance compared to all baselines, on both numeric and non-numeric\nquestions.","PeriodicalId":501372,"journal":{"name":"arXiv - QuantFin - General Finance","volume":"12 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"NumLLM: Numeric-Sensitive Large Language Model for Chinese Finance\",\"authors\":\"Huan-Yi Su, Ke Wu, Yu-Hao Huang, Wu-Jun Li\",\"doi\":\"arxiv-2405.00566\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, many works have proposed various financial large language models\\n(FinLLMs) by pre-training from scratch or fine-tuning open-sourced LLMs on\\nfinancial corpora. However, existing FinLLMs exhibit unsatisfactory performance\\nin understanding financial text when numeric variables are involved in\\nquestions. In this paper, we propose a novel LLM, called numeric-sensitive\\nlarge language model (NumLLM), for Chinese finance. We first construct a\\nfinancial corpus from financial textbooks which is essential for improving\\nnumeric capability of LLMs during fine-tuning. After that, we train two\\nindividual low-rank adaptation (LoRA) modules by fine-tuning on our constructed\\nfinancial corpus. One module is for adapting general-purpose LLMs to financial\\ndomain, and the other module is for enhancing the ability of NumLLM to\\nunderstand financial text with numeric variables. Lastly, we merge the two LoRA\\nmodules into the foundation model to obtain NumLLM for inference. Experiments\\non financial question-answering benchmark show that NumLLM can boost the\\nperformance of the foundation model and can achieve the best overall\\nperformance compared to all baselines, on both numeric and non-numeric\\nquestions.\",\"PeriodicalId\":501372,\"journal\":{\"name\":\"arXiv - QuantFin - General Finance\",\"volume\":\"12 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - QuantFin - General Finance\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2405.00566\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuantFin - General Finance","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.00566","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

最近，许多研究都提出了各种金融大型语言模型（FinLLMs），通过在金融语料库上从头开始预训练或微调开源的 LLMs 来实现。然而，当问题中涉及数字变量时，现有的金融大语言模型在理解金融文本方面的表现并不令人满意。在本文中，我们提出了一种适用于中文金融的新型 LLM，即数字敏感大语言模型（NumLLM）。我们首先从金融教科书中构建了一个金融语料库，该语料库对于在微调过程中提高 LLM 的数字能力至关重要。然后，我们在构建的金融语料库上进行微调，训练出两个单独的低秩适应（Low-rank adaptation，LoRA）模块。一个模块用于将通用 LLM 适应于金融领域，另一个模块用于增强 NumLLM 理解包含数字变量的金融文本的能力。最后，我们将两个 LoRA 模块合并到基础模型中，得到用于推理的 NumLLM。金融问题解答基准实验表明，NumLLM 可以提高基础模型的性能，并且与所有基准相比，在数字和非数字问题上都能获得最佳的总体性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

NumLLM: Numeric-Sensitive Large Language Model for Chinese Finance

Recently, many works have proposed various financial large language models (FinLLMs) by pre-training from scratch or fine-tuning open-sourced LLMs on financial corpora. However, existing FinLLMs exhibit unsatisfactory performance in understanding financial text when numeric variables are involved in questions. In this paper, we propose a novel LLM, called numeric-sensitive large language model (NumLLM), for Chinese finance. We first construct a financial corpus from financial textbooks which is essential for improving numeric capability of LLMs during fine-tuning. After that, we train two individual low-rank adaptation (LoRA) modules by fine-tuning on our constructed financial corpus. One module is for adapting general-purpose LLMs to financial domain, and the other module is for enhancing the ability of NumLLM to understand financial text with numeric variables. Lastly, we merge the two LoRA modules into the foundation model to obtain NumLLM for inference. Experiments on financial question-answering benchmark show that NumLLM can boost the performance of the foundation model and can achieve the best overall performance compared to all baselines, on both numeric and non-numeric questions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - QuantFin - General Finance

自引率

0.00%

发文量