Large language models for ingredient substitution in food recipes using supervised fine-tuning and direct preference optimization

Natural Language Processing Journal Pub Date : 2025-08-13 DOI:10.1016/j.nlp.2025.100177

Thevin Senath , Kumuthu Athukorala , Ransika Costa , Surangika Ranathunga , Rishemjit Kaur

{"title":"Large language models for ingredient substitution in food recipes using supervised fine-tuning and direct preference optimization","authors":"Thevin Senath , Kumuthu Athukorala , Ransika Costa , Surangika Ranathunga , Rishemjit Kaur","doi":"10.1016/j.nlp.2025.100177","DOIUrl":null,"url":null,"abstract":"<div><div>In this paper, we address the challenge of recipe personalization through ingredient substitution. We make use of Large Language Models (LLMs) to build an ingredient substitution system designed to predict plausible substitute ingredients within a given recipe context. Given that the use of LLMs for this task has been barely done, we carry out an extensive set of experiments to determine the best LLM, prompt, and the fine-tuning setups. We further experiment with methods such as multi-task learning, two-stage fine-tuning, and Direct Preference Optimization (DPO). The experiments are conducted using the publicly available Recipe1MSub corpus. The best results are produced by the Mistral7-Base LLM after fine-tuning and DPO. This result outperforms the strong baseline available for the same corpus with a Hit@1 score of 22.04. Although LLM results lag behind the baseline with respect to other metrics such as Hit@3 and Hit@10, we believe that this research represents a promising step towards enabling personalized and creative culinary experiences by utilizing LLM-based ingredient substitution.</div></div>","PeriodicalId":100944,"journal":{"name":"Natural Language Processing Journal","volume":"12 ","pages":"Article 100177"},"PeriodicalIF":0.0000,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Language Processing Journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949719125000536","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In this paper, we address the challenge of recipe personalization through ingredient substitution. We make use of Large Language Models (LLMs) to build an ingredient substitution system designed to predict plausible substitute ingredients within a given recipe context. Given that the use of LLMs for this task has been barely done, we carry out an extensive set of experiments to determine the best LLM, prompt, and the fine-tuning setups. We further experiment with methods such as multi-task learning, two-stage fine-tuning, and Direct Preference Optimization (DPO). The experiments are conducted using the publicly available Recipe1MSub corpus. The best results are produced by the Mistral7-Base LLM after fine-tuning and DPO. This result outperforms the strong baseline available for the same corpus with a Hit@1 score of 22.04. Although LLM results lag behind the baseline with respect to other metrics such as Hit@3 and Hit@10, we believe that this research represents a promising step towards enabling personalized and creative culinary experiences by utilizing LLM-based ingredient substitution.

查看原文本刊更多论文

使用监督微调和直接偏好优化的食品配方成分替代的大型语言模型

在本文中，我们通过成分替代来解决配方个性化的挑战。我们利用大型语言模型（LLMs）来构建一个成分替代系统，旨在预测给定配方上下文中合理的替代成分。考虑到在这项任务中很少使用LLM，我们进行了一组广泛的实验，以确定最佳的LLM、提示和微调设置。我们进一步实验了多任务学习、两阶段微调和直接偏好优化（DPO）等方法。实验是使用公开可用的Recipe1MSub语料库进行的。经过微调和DPO后，Mistral7-Base LLM获得了最好的效果。该结果优于相同语料库的强大基线，得分为Hit@1 22.04。虽然法学硕士的研究结果在Hit@3和Hit@10等其他指标方面落后于基线，但我们相信，这项研究代表了利用法学硕士为基础的成分替代实现个性化和创造性烹饪体验的有希望的一步。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Natural Language Processing Journal

自引率

0.00%

发文量