RIAO Conference最新文献

筛选
英文 中文
Deriving implicit user feedback from partial URLs for effective web page retrieval 从部分url获取隐式用户反馈,实现有效的网页检索
RIAO Conference Pub Date : 2010-04-28 DOI: 10.5555/1937055.1937083
Rongmei Li, T. V. D. Weide
{"title":"Deriving implicit user feedback from partial URLs for effective web page retrieval","authors":"Rongmei Li, T. V. D. Weide","doi":"10.5555/1937055.1937083","DOIUrl":"https://doi.org/10.5555/1937055.1937083","url":null,"abstract":"User click-throughs provide a search context for understanding the user need of complex information. This paper re-examines the effectiveness of this approach when based on partial clicked data using the language modeling framework. We expand the original query by topical terms derived from clicked Web pages and enhance early precision via a more compact document representation. Since our URLs of Web pages are stripped, we first reconstruct them at different levels based on different collections. Our experimental results on the GOV2 test collection and AOL query log show improvement by 31.7% and 28.3% significantly in statMAP for two sources of reconstruction and 153 ad-hoc queries. Our model also outperforms pseudo relevance feedback.","PeriodicalId":120472,"journal":{"name":"RIAO Conference","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131380010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Answer type validation in question answering systems 问答系统中的答案类型验证
RIAO Conference Pub Date : 2010-04-28 DOI: 10.5555/1937055.1937058
Arnaud Grappy, Brigitte Grau
{"title":"Answer type validation in question answering systems","authors":"Arnaud Grappy, Brigitte Grau","doi":"10.5555/1937055.1937058","DOIUrl":"https://doi.org/10.5555/1937055.1937058","url":null,"abstract":"In open domain question-answering systems, numerous questions wait for answers of an explicit type. For example, the question \"Which president succeeded Jacques Chirac?\" requires an instance of president as answer. The method we present in this article aims at verifying that an answer given by a system corresponds to the given type. This verification is done by combining criteria provided by different methods dedicated to verify the appropriateness between an answer and a type. The first types of criteria are statistical and compute the presence rate of both the answer and the type in documents, other criteria rely on named entity recognizers and the last criteria are based on the use of Wikipedia.","PeriodicalId":120472,"journal":{"name":"RIAO Conference","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127999533","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Squido, a SaaS web mining system for professionals Squido,一个面向专业人士的SaaS网络挖掘系统
RIAO Conference Pub Date : 2010-04-28 DOI: 10.5555/1937055.1937115
François Pouilloux, Louis-Marc Perez
{"title":"Squido, a SaaS web mining system for professionals","authors":"François Pouilloux, Louis-Marc Perez","doi":"10.5555/1937055.1937115","DOIUrl":"https://doi.org/10.5555/1937055.1937115","url":null,"abstract":"Web information overload is a common issue for knowledge workers. In this application paper, we describe how Squido, a SaaS Web Mining system developed by IXXO, enables knowledge workers to get more value from their Web research. \u0000 \u0000We present Squido's main innovations, an overview of its features and market, as well as experimental results comparing the efficiency of the crawl strategies available in the product.","PeriodicalId":120472,"journal":{"name":"RIAO Conference","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132345750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving tag recommendation using social networks 使用社交网络改进标签推荐
RIAO Conference Pub Date : 2010-04-28 DOI: 10.5555/1937055.1937077
Adam Rae, Börkur Sigurbjörnsson, R. V. Zwol
{"title":"Improving tag recommendation using social networks","authors":"Adam Rae, Börkur Sigurbjörnsson, R. V. Zwol","doi":"10.5555/1937055.1937077","DOIUrl":"https://doi.org/10.5555/1937055.1937077","url":null,"abstract":"In this paper we address the task of recommending additional tags to partially annotated media objects, in our case images. We propose an extendable framework that can recommend tags using a combination of different personalised and collective contexts. We combine information from four contexts: (1) all the photos in the system, (2) a user's own photos, (3) the photos of a user's social contacts, and (4) the photos posted in the groups of which a user is a member. Variants of methods (1) and (2) have been proposed in previous work, but the use of (3) and (4) is novel. \u0000 \u0000For each of the contexts we use the same probabilistic model and Borda Count based aggregation approach to generate recommendations from different contexts into a unified ranking of recommended tags. We evaluate our system using a large set of real-world data from Flickr. We show that by using personalised contexts we can significantly improve tag recommendation compared to using collective knowledge alone. We also analyse our experimental results to explore the capabilities of our system with respect to a user's social behaviour.","PeriodicalId":120472,"journal":{"name":"RIAO Conference","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133891121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 111
Linguistic information extraction for job ads (SIRE project) 面向招聘广告的语言信息提取(SIRE项目)
RIAO Conference Pub Date : 2010-04-28 DOI: 10.5555/1937055.1937114
Romain Loth, D. Battistelli, François-Régis Chaumartin, Hugues de Mazancourt, J. Minel, Axelle Vinckx
{"title":"Linguistic information extraction for job ads (SIRE project)","authors":"Romain Loth, D. Battistelli, François-Régis Chaumartin, Hugues de Mazancourt, J. Minel, Axelle Vinckx","doi":"10.5555/1937055.1937114","DOIUrl":"https://doi.org/10.5555/1937055.1937114","url":null,"abstract":"As a text, each job advertisement expresses rich information about the occupation at hand, such as competence needs (i.e. required degrees, field knowledge, task expertise or technical skills). To facilitate the access to this information, the SIRE project conducted a corpus based study of how to articulate HR expert ontologies with modern semi-supervised information extraction techniques. An adaptive semantic labeling framework is developed through a parallel work on retrieval rules and on latent semantic lexicons of terms and jargon phrases. In its operational stage, our prototype will collect online job ads and index their content into detailed RDF triples compatible with applications ranging from enhanced job search to automated labor-market analysis.","PeriodicalId":120472,"journal":{"name":"RIAO Conference","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115561277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Boiling down information retrieval test collections 简化信息检索测试集合
RIAO Conference Pub Date : 2010-04-28 DOI: 10.5555/1937055.1937066
T. Sakai, T. Mitamura
{"title":"Boiling down information retrieval test collections","authors":"T. Sakai, T. Mitamura","doi":"10.5555/1937055.1937066","DOIUrl":"https://doi.org/10.5555/1937055.1937066","url":null,"abstract":"Constructing large-scale test collections is costly and time-consuming, and a few relevance assessment methods have been proposed for constructing \"minimal\" information retrieval test collections that may still provide reliable experimental results. In contrast to building up such test collections, we take existing test collections constructed through the traditional pooling approach and empirically investigate whether they can be \"boiled down.\" More specifically, we report on experiments with test collections from both NT-CIR and TREC to investigate the effect of reducing both the topic set size and the pool depth on the outcome of a statistical significance test between two systems, starting with (approximately) 100 topics and depth-100 pools. We define cost (of manual relevance assessment) as the pool depth multiplied by the topic set size, and error as a system pair whose outcome of statistical significance testing differs from the original result based on the full test collection. Our main findings are: (a) Cost and the number of errors are negatively correlated, and any attempt at substantially reducing cost introduces some errors; (b) The NTCIR-7 IR4QA and the TREC 2004 robust track test collections all yield a comparable and considerable number of errors in response to cost reduction, and this is true despite the fact that the TREC relevance assessments relied on more than twice as many runs as the NTCIR ones; (c) Using 100 topics with depth-30 pools generally yields fewer errors than using 30 topics with depth-100 pools; and (d) Even with depth-100 pools, using fewer than 100 topics results in false alarms, i.e. two systems are declared significantly different even though the full topic set would declare otherwise.","PeriodicalId":120472,"journal":{"name":"RIAO Conference","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124449596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A generic framework for the integration of heterogeneous metadata standards into a multimedia information retrieval system 将异构元数据标准集成到多媒体信息检索系统中的通用框架
RIAO Conference Pub Date : 2010-04-28 DOI: 10.5555/1937055.1937073
Sébastien Laborie, Ana-Maria Manzat, F. Sèdes
{"title":"A generic framework for the integration of heterogeneous metadata standards into a multimedia information retrieval system","authors":"Sébastien Laborie, Ana-Maria Manzat, F. Sèdes","doi":"10.5555/1937055.1937073","DOIUrl":"https://doi.org/10.5555/1937055.1937073","url":null,"abstract":"The number and the heterogeneity of multimedia contents handled by information systems are increasing steeply. These contents are indexed in order to produce some metadata that are used during the retrieval process. However, several existing metadata standards can be used for describing the multimedia contents and choosing a particular one does not cover all the metadata features. A solution is the mixing of these standards and formats, but this does not ensure interoperability. To overcome this problem, we have proposed a generic metadata framework that could encapsulate the most common metadata standards. In this paper, we present the validation of this framework in the context of the LINDO project.","PeriodicalId":120472,"journal":{"name":"RIAO Conference","volume":"46 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120940197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Essential summarizer: innovative automatic text summarization software in twenty languages Essential summarizer:创新的自动文本摘要软件,支持20种语言
RIAO Conference Pub Date : 2010-04-28 DOI: 10.5555/1937055.1937111
Abderrafih Lehmam
{"title":"Essential summarizer: innovative automatic text summarization software in twenty languages","authors":"Abderrafih Lehmam","doi":"10.5555/1937055.1937111","DOIUrl":"https://doi.org/10.5555/1937055.1937111","url":null,"abstract":"With the advent of electronic textual documents following the fulgurating development of data processing, there are now pressing needs to extract useful and reusable information from text. It is thus quite natural to address the problem of the overabundance of digital textual information. The technology of automatic text summarization, along with other solutions in the area of text mining, tries to remedy this by providing easier access to essential information, in condensed form and for better potential reuse. Through a specific process, this technology makes it possible to analyze a text in order to extract only efficient information for reuse in view of precise goals, saving time and enhancing productivity. We have developed an automatic summarization software called Essential Summarizer, with an approach based on linguistic techniques to perform semantic analysis of written text. This innovative application is very fast and produces summaries tailored to the user's needs in twenty languages.","PeriodicalId":120472,"journal":{"name":"RIAO Conference","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126927727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Know.Right.Now: a technical documentation system for dynamically publishing personalized content Know.Right.Now:用于动态发布个性化内容的技术文档系统
RIAO Conference Pub Date : 2010-04-28 DOI: 10.5555/1937055.1937113
M. Manago, Ralph Traphöner, Bruno Defude
{"title":"Know.Right.Now: a technical documentation system for dynamically publishing personalized content","authors":"M. Manago, Ralph Traphöner, Bruno Defude","doi":"10.5555/1937055.1937113","DOIUrl":"https://doi.org/10.5555/1937055.1937113","url":null,"abstract":"Know.Right.Now is a technical documentation system that is used to dynamically publish personalized content. Depending on the user (i.e. according to his/her declared profile), the task that has to be performed and the product that the user is interested in, the system generates personalized content on-demand. The system is applied to the management of service literature at a large international company that publishes vast amounts of technical documents.","PeriodicalId":120472,"journal":{"name":"RIAO Conference","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131456484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Can Esculape cure the complex of œdipe in the medical domain? 在医学领域可以脱皮治疗œdipe的复合体吗?
RIAO Conference Pub Date : 2010-04-28 DOI: 10.5555/1937055.1937060
M. Embarek, Olivier Ferret
{"title":"Can Esculape cure the complex of œdipe in the medical domain?","authors":"M. Embarek, Olivier Ferret","doi":"10.5555/1937055.1937060","DOIUrl":"https://doi.org/10.5555/1937055.1937060","url":null,"abstract":"In this article, we present Esculape, a question-answering system for French dedicated to family doctors and built from œdipe, an open-domain system. Esculape adds to œdipe the capability to exploit the concepts and relations of a domain model, the medical domain in the present case. Although a large number of resources exist in this domain (UMLS, MeSH ...), it is not possible to rely only on them, and more specifically on the relations they contain, to answer questions. We show how this difficulty can be overcome by learning linguistic patterns for identifying relations and applying them to extract answers.","PeriodicalId":120472,"journal":{"name":"RIAO Conference","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134598693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信