{"title":"结合语言注释和CRF选择从食谱中提取配料名称","authors":"Thierry Hamon, N. Grabar","doi":"10.1145/2506023.2506035","DOIUrl":null,"url":null,"abstract":"Nutrition and diet have direct and considerable impact on our well-being and health. This field attracts researchers from different areas, such as medicine, nutrition and epidemiology, computer sciences, artificial intelligence and natural language processing (NLP).We process the recipes with NLP methods in order to automatically identify ingredient names within recipes. We propose a hybrid system based on linguistic enrichment of the recipes and selection of the relevant ingredient names with a CRF method. Semantic resources have been specifically built for processing two kinds of information: exact (e.g. quantity expressed in grams or liters, durations expressed in minutes or days) and fuzzy (e.g. quantities expressed in chouilla (smidgeon) and louche (ladle), durations sequenced with aprèes, ensuite, alors que (the, after that, while)). The experiments are performed with French-language textual data. The results demonstrate that the proposed method is useful for searching and managing the recipes.","PeriodicalId":91851,"journal":{"name":"CEA'13 : proceedings of the 5th International Workshop on Multimedia for Cooking & Eating Activities : October 21, 2013, Barcelona, Spain. Workshop on Multimedia for Cooking and Eating Activities (5th : 2013 : Barcelona, Spain)","volume":"107 1","pages":"63-68"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Extraction of ingredient names from recipes by combining linguistic annotations and CRF selection\",\"authors\":\"Thierry Hamon, N. Grabar\",\"doi\":\"10.1145/2506023.2506035\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nutrition and diet have direct and considerable impact on our well-being and health. This field attracts researchers from different areas, such as medicine, nutrition and epidemiology, computer sciences, artificial intelligence and natural language processing (NLP).We process the recipes with NLP methods in order to automatically identify ingredient names within recipes. We propose a hybrid system based on linguistic enrichment of the recipes and selection of the relevant ingredient names with a CRF method. Semantic resources have been specifically built for processing two kinds of information: exact (e.g. quantity expressed in grams or liters, durations expressed in minutes or days) and fuzzy (e.g. quantities expressed in chouilla (smidgeon) and louche (ladle), durations sequenced with aprèes, ensuite, alors que (the, after that, while)). The experiments are performed with French-language textual data. The results demonstrate that the proposed method is useful for searching and managing the recipes.\",\"PeriodicalId\":91851,\"journal\":{\"name\":\"CEA'13 : proceedings of the 5th International Workshop on Multimedia for Cooking & Eating Activities : October 21, 2013, Barcelona, Spain. Workshop on Multimedia for Cooking and Eating Activities (5th : 2013 : Barcelona, Spain)\",\"volume\":\"107 1\",\"pages\":\"63-68\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-10-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CEA'13 : proceedings of the 5th International Workshop on Multimedia for Cooking & Eating Activities : October 21, 2013, Barcelona, Spain. Workshop on Multimedia for Cooking and Eating Activities (5th : 2013 : Barcelona, Spain)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2506023.2506035\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CEA'13 : proceedings of the 5th International Workshop on Multimedia for Cooking & Eating Activities : October 21, 2013, Barcelona, Spain. Workshop on Multimedia for Cooking and Eating Activities (5th : 2013 : Barcelona, Spain)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2506023.2506035","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
摘要
营养和饮食对我们的幸福和健康有着直接而可观的影响。该领域吸引了来自不同领域的研究人员,如医学、营养学和流行病学、计算机科学、人工智能和自然语言处理(NLP)。我们使用NLP方法处理食谱,以便自动识别食谱中的成分名称。我们提出了一种基于语言丰富的食谱和基于CRF方法的相关配料名称选择的混合系统。语义资源专门用于处理两种类型的信息:精确的(例如以克或升表示的数量,以分钟或天表示的持续时间)和模糊的(例如以chouilla (smidgeon)和louche (ladle)表示的数量,以aprures, ensuite, alors que(在那之后,while)排序的持续时间)。实验是用法语文本数据进行的。结果表明,该方法可用于菜谱的搜索和管理。
Extraction of ingredient names from recipes by combining linguistic annotations and CRF selection
Nutrition and diet have direct and considerable impact on our well-being and health. This field attracts researchers from different areas, such as medicine, nutrition and epidemiology, computer sciences, artificial intelligence and natural language processing (NLP).We process the recipes with NLP methods in order to automatically identify ingredient names within recipes. We propose a hybrid system based on linguistic enrichment of the recipes and selection of the relevant ingredient names with a CRF method. Semantic resources have been specifically built for processing two kinds of information: exact (e.g. quantity expressed in grams or liters, durations expressed in minutes or days) and fuzzy (e.g. quantities expressed in chouilla (smidgeon) and louche (ladle), durations sequenced with aprèes, ensuite, alors que (the, after that, while)). The experiments are performed with French-language textual data. The results demonstrate that the proposed method is useful for searching and managing the recipes.