将语料库特定的语义信息整合到问答上下文中

Ontologies and Information Systems for the Semantic Web Pub Date : 2008-10-30 DOI:10.1145/1458484.1458497

Protima Banerjee, Hyoil Han

{"title":"将语料库特定的语义信息整合到问答上下文中","authors":"Protima Banerjee, Hyoil Han","doi":"10.1145/1458484.1458497","DOIUrl":null,"url":null,"abstract":"In today's environment of information overload, Question Answering (QA) is a critically important research area for the Semantic Web. In order for humans to make effective use of the expansive information sources available to us, we require automated tools to help us make sense of large amounts of data. Within this framework, Question Context plays an important role. We define Question Context to be an semantic structure that can be used to enrich queries so that the user's information need is better represented. This paper describes the theoretical foundations of a novel approach that uses statistical language modeling techniques to create Question Context and to then integrate it into the Information Retrieval stage of QA. We base our approach on two established language modeling methods - the Aspect Model, which is the basis of Probabilistic Latent Semantic Analysis (PLSA) and Relevance-Based Language Models. Our approach proposes an Aspect-Based Relevance Language Model as the Question Context Model, and our methodology incorporates corpus-specific semantic concepts into the QA process. Words from the most heavily relevant aspects are then incorporated into the query. We present some interesting preliminary qualitative results that show the potential usefulness of the Question Context Model to both the first (IR) and second (Intelligent Information Processing) stages of QA.","PeriodicalId":363359,"journal":{"name":"Ontologies and Information Systems for the Semantic Web","volume":"390 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Incorporation of corpus-specific semantic information into question answering context\",\"authors\":\"Protima Banerjee, Hyoil Han\",\"doi\":\"10.1145/1458484.1458497\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In today's environment of information overload, Question Answering (QA) is a critically important research area for the Semantic Web. In order for humans to make effective use of the expansive information sources available to us, we require automated tools to help us make sense of large amounts of data. Within this framework, Question Context plays an important role. We define Question Context to be an semantic structure that can be used to enrich queries so that the user's information need is better represented. This paper describes the theoretical foundations of a novel approach that uses statistical language modeling techniques to create Question Context and to then integrate it into the Information Retrieval stage of QA. We base our approach on two established language modeling methods - the Aspect Model, which is the basis of Probabilistic Latent Semantic Analysis (PLSA) and Relevance-Based Language Models. Our approach proposes an Aspect-Based Relevance Language Model as the Question Context Model, and our methodology incorporates corpus-specific semantic concepts into the QA process. Words from the most heavily relevant aspects are then incorporated into the query. We present some interesting preliminary qualitative results that show the potential usefulness of the Question Context Model to both the first (IR) and second (Intelligent Information Processing) stages of QA.\",\"PeriodicalId\":363359,\"journal\":{\"name\":\"Ontologies and Information Systems for the Semantic Web\",\"volume\":\"390 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-10-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ontologies and Information Systems for the Semantic Web\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/1458484.1458497\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ontologies and Information Systems for the Semantic Web","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1458484.1458497","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

摘要

在当今信息过载的环境下，问答(QA)是语义网一个非常重要的研究领域。为了使人类能够有效地利用我们可用的大量信息源，我们需要自动化工具来帮助我们理解大量的数据。在这个框架中，问题语境起着重要的作用。我们将问题上下文定义为可用于丰富查询的语义结构，以便更好地表示用户的信息需求。本文描述了一种新方法的理论基础，该方法使用统计语言建模技术来创建问题上下文，然后将其集成到QA的信息检索阶段。我们的方法基于两种已建立的语言建模方法-方面模型，这是概率潜在语义分析(PLSA)和基于相关性的语言模型的基础。我们的方法提出了一个基于方面的相关语言模型作为问题上下文模型，我们的方法将语料库特定的语义概念纳入QA过程。然后将最相关方面的单词合并到查询中。我们提出了一些有趣的初步定性结果，显示了问题上下文模型在QA的第一(IR)和第二(智能信息处理)阶段的潜在有用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Incorporation of corpus-specific semantic information into question answering context

In today's environment of information overload, Question Answering (QA) is a critically important research area for the Semantic Web. In order for humans to make effective use of the expansive information sources available to us, we require automated tools to help us make sense of large amounts of data. Within this framework, Question Context plays an important role. We define Question Context to be an semantic structure that can be used to enrich queries so that the user's information need is better represented. This paper describes the theoretical foundations of a novel approach that uses statistical language modeling techniques to create Question Context and to then integrate it into the Information Retrieval stage of QA. We base our approach on two established language modeling methods - the Aspect Model, which is the basis of Probabilistic Latent Semantic Analysis (PLSA) and Relevance-Based Language Models. Our approach proposes an Aspect-Based Relevance Language Model as the Question Context Model, and our methodology incorporates corpus-specific semantic concepts into the QA process. Words from the most heavily relevant aspects are then incorporated into the query. We present some interesting preliminary qualitative results that show the potential usefulness of the Question Context Model to both the first (IR) and second (Intelligent Information Processing) stages of QA.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Ontologies and Information Systems for the Semantic Web

自引率

0.00%

发文量