{"title":"从现实世界文本中获取知识:一些经验教训","authors":"F. Gomez","doi":"10.1109/TAI.1994.346490","DOIUrl":null,"url":null,"abstract":"In recent years, natural language processing (NLP) has experienced a dramatic research shift by focusing on the processing of real-world texts, rather than on restricted domains. This shift of focus has been an acid test for the core components of NLP, namely parsing and semantic interpretation. For the last few years, we have been working on knowledge acquisition from texts (F. Gomez, 1985; F. Gomez and C. Segani, 1989). The research started as a set of theoretical ideas and, then, gradually we built a system that embodies the theory. The system could be at first called a \"toy system\", that works in restricted domains. Recently, we have extended every component of the system to handle \"real-world\" texts. The model has been implemented in a program that reads unedited texts from The World Book Encyclopedia (1994), and acquires new concepts and conceptual relations about topics dealing with the dietary habits of animals, their classifications and habitats. The program is also able to answer an ample set of questions about the database that it has automatically acquired. Some of the major lessons that derive from the research are presented.<<ETX>>","PeriodicalId":262014,"journal":{"name":"Proceedings Sixth International Conference on Tools with Artificial Intelligence. TAI 94","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Knowledge acquisition from real-world texts: some lessons learned\",\"authors\":\"F. Gomez\",\"doi\":\"10.1109/TAI.1994.346490\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, natural language processing (NLP) has experienced a dramatic research shift by focusing on the processing of real-world texts, rather than on restricted domains. This shift of focus has been an acid test for the core components of NLP, namely parsing and semantic interpretation. For the last few years, we have been working on knowledge acquisition from texts (F. Gomez, 1985; F. Gomez and C. Segani, 1989). The research started as a set of theoretical ideas and, then, gradually we built a system that embodies the theory. The system could be at first called a \\\"toy system\\\", that works in restricted domains. Recently, we have extended every component of the system to handle \\\"real-world\\\" texts. The model has been implemented in a program that reads unedited texts from The World Book Encyclopedia (1994), and acquires new concepts and conceptual relations about topics dealing with the dietary habits of animals, their classifications and habitats. The program is also able to answer an ample set of questions about the database that it has automatically acquired. Some of the major lessons that derive from the research are presented.<<ETX>>\",\"PeriodicalId\":262014,\"journal\":{\"name\":\"Proceedings Sixth International Conference on Tools with Artificial Intelligence. TAI 94\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1994-11-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings Sixth International Conference on Tools with Artificial Intelligence. TAI 94\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TAI.1994.346490\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Sixth International Conference on Tools with Artificial Intelligence. TAI 94","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TAI.1994.346490","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
近年来,自然语言处理(NLP)的研究经历了一个巨大的转变,将重点放在现实世界文本的处理上,而不是局限于有限的领域。这种焦点的转移是对NLP的核心组件,即解析和语义解释的严峻考验。在过去的几年里,我们一直致力于从文本中获取知识(F. Gomez, 1985;F. Gomez和C. Segani, 1989)。研究从一套理论思想开始,逐渐构建了一个体现理论的体系。该系统最初可以被称为“玩具系统”,在有限的领域中工作。最近,我们扩展了系统的每个组件来处理“真实世界”的文本。该模型已在一个程序中实施,该程序读取《世界图书百科全书》(The World Book Encyclopedia, 1994)中未编辑的文本,并获得有关动物饮食习惯、分类和栖息地等主题的新概念和概念关系。该程序还能够回答关于它自动获取的数据库的大量问题。本文提出了从研究中得出的一些主要经验教训。
Knowledge acquisition from real-world texts: some lessons learned
In recent years, natural language processing (NLP) has experienced a dramatic research shift by focusing on the processing of real-world texts, rather than on restricted domains. This shift of focus has been an acid test for the core components of NLP, namely parsing and semantic interpretation. For the last few years, we have been working on knowledge acquisition from texts (F. Gomez, 1985; F. Gomez and C. Segani, 1989). The research started as a set of theoretical ideas and, then, gradually we built a system that embodies the theory. The system could be at first called a "toy system", that works in restricted domains. Recently, we have extended every component of the system to handle "real-world" texts. The model has been implemented in a program that reads unedited texts from The World Book Encyclopedia (1994), and acquires new concepts and conceptual relations about topics dealing with the dietary habits of animals, their classifications and habitats. The program is also able to answer an ample set of questions about the database that it has automatically acquired. Some of the major lessons that derive from the research are presented.<>