International Workshop On Research Issues in Digital Libraries最新文献

On the science of search: statistical approaches, evaluation, optimisation 关于搜索的科学:统计方法，评估，优化

International Workshop On Research Issues in Digital Libraries Pub Date : 2006-12-12 DOI: 10.1145/1364742.1364745

S. Robertson

引用次数: 1

How to compose a complex document recognition system 如何组成一个复杂的文档识别系统

International Workshop On Research Issues in Digital Libraries Pub Date : 2006-12-12 DOI: 10.1145/1364742.1364759

H. Fujisawa

引用次数: 0

Finding an answer to a question 寻找问题的答案

International Workshop On Research Issues in Digital Libraries Pub Date : 2006-12-12 DOI: 10.1145/1364742.1364751

Brigitte Grau

{"title":"Finding an answer to a question","authors":"Brigitte Grau","doi":"10.1145/1364742.1364751","DOIUrl":"https://doi.org/10.1145/1364742.1364751","url":null,"abstract":"The huge quantity of available electronic information leads to a growing need for users to have tools able to be precise and selective. These kinds of tools have to provide answers to requests quite rapidly without requiring the user to explore each document, to reformulate her request or to seek for the answer inside documents. From that viewpoint, finding an answer consists not only in finding relevant documents but also in extracting relevant parts. This leads us to express the question-answering problem in terms of an information retrieval problem that can be solved using natural language processing (NLP) approaches. In my talk, I will focus on defining what a \"good\" answer is, and how a system can find it.\u0000 A good answer has to give the required piece of information. However, it is not sufficient; it also has both to be presented within its context of interpretation and to be justified in order to give a user means to evaluate if the answer fits her needs and is appropriate.\u0000 One can view searching an answer to a question as a reformulation problem: according to what is asked, find one of the different linguistic expressions of the answer in all candidate sentences. Within this framework, interlingual question-answering can also be seen as another kind of linguistic variation. The answer phrasing can be considered as an affirmative reformulation of the question, partly or totally, which entails the definition of models that match with sentences containing the answer. According to the different approaches, the kinds of model and the matching criteria greatly differ. It can consist in building a structured representation that makes explicit the semantic relations between the concepts of the question and that is compared to a similar representation of sentences. As this approach requires a syntactic parser and a semantic knowledge base, which are not always available in all the languages, systems often apply a less formal approach based on a similarity measure between a passage and the question and answers are extracted from highest scored passages. Similarity involves different criteria: question terms and their linguistic variations in passages, syntactic proximity, answer type. We will see that, in such an approach, justifications can be envisioned by using text themselves, considered as depositories of semantic knowledge. I will focus on the approach the LIR group of LIMSI has taken for its monolingual and bilingual systems.","PeriodicalId":287514,"journal":{"name":"International Workshop On Research Issues in Digital Libraries","volume":"121 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124011340","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Information retrieval and digital libraries: lessons of research 信息检索与数字图书馆:研究的经验教训

International Workshop On Research Issues in Digital Libraries Pub Date : 2006-12-12 DOI: 10.1145/1364742.1364743

Karen Spärck Jones

引用次数: 5

Open source search and research 开源搜索和研究

International Workshop On Research Issues in Digital Libraries Pub Date : 2006-12-12 DOI: 10.1145/1364742.1364748

M. Beigbeder, Wray L. Buntine, Wai Gen Yee

引用次数: 1

Digital audiovisual repositories: an introduction 数字音像资源库:介绍

International Workshop On Research Issues in Digital Libraries Pub Date : 2006-12-12 DOI: 10.1145/1364742.1364753

Richard Wright

引用次数: 2

From CLIR to CLIE: some lessons in NTCIR evaluation 从CLIR到CLIE: NTCIR评价的几点启示

International Workshop On Research Issues in Digital Libraries Pub Date : 2006-12-12 DOI: 10.1145/1364742.1364762

Hsin-Hsi Chen

引用次数: 0

Shallow syntax analysis in Sanskrit guided by semantic nets constraints 语义网约束下的梵文浅语法分析

International Workshop On Research Issues in Digital Libraries Pub Date : 2006-12-12 DOI: 10.1145/1364742.1364750

G. Huet

{"title":"Shallow syntax analysis in Sanskrit guided by semantic nets constraints","authors":"G. Huet","doi":"10.1145/1364742.1364750","DOIUrl":"https://doi.org/10.1145/1364742.1364750","url":null,"abstract":"We present the state of the art of a computational platform for the analysis of classical Sanskrit. The platform comprises modules for phonology, morphology, segmentation and shallow syntax analysis, organized around a structured lexical database. It relies on the Zen toolkit for finite state automata and transducers, which provides data structures and algorithms for the modular construction and execution of finite state machines, in a functional framework.\u0000 Some of the layers proceed in bottom-up synthesis mode - for instance, noun and verb morphological modules generate all inflected forms from stems and roots listed in the lexicon. Morphemes are assembled through internal sandhi, and the inflected forms are stored with morphological tags in dictionaries usable for lemmatizing. These dictionaries are then compiled into transducers, implementing the analysis of external sandhi, the phonological process which merges words together by euphony. This provides a tagging segmenter, which analyses a sentence presented as a stream of phonemes and produces a stream of tagged lexical entries, hyperlinked to the lexicon.\u0000 The next layer is a syntax analyser, guided by semantic nets constraints expressing dependencies between the word forms. Finite verb forms demand semantic roles, according to valency patterns depending on the voice (active, passive) of the form and the governance (transitive, etc) of the root. Conversely, noun/adjective forms provide actors which may fill those roles, provided agreement constraints are satisfied. Tool words are mapped to transducers operating on tagged streams, allowing the modeling of linguistic phenomena such as coordination by abstract interpretation of actor streams. The parser ranks the various interpretations (matching actors with roles) with penalties, and returns to the user the minimum penalty analyses, for final validation of ambiguities. The whole platform is organized as a Web service, allowing the piecewise tagging of a Sanskrit text.","PeriodicalId":287514,"journal":{"name":"International Workshop On Research Issues in Digital Libraries","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131164032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 38

Toward a common semantics between media and languages 媒体和语言之间的共同语义

International Workshop On Research Issues in Digital Libraries Pub Date : 2006-12-12 DOI: 10.1145/1364742.1364755

C. Fluhr, G. Grefenstette, Adrian Daniel Popescu

引用次数: 2

Multilingual information access: the contribution of evaluation 多语种信息获取:评价的贡献

International Workshop On Research Issues in Digital Libraries Pub Date : 2006-12-12 DOI: 10.1145/1364742.1364761

C. Peters

{"title":"Multilingual information access: the contribution of evaluation","authors":"C. Peters","doi":"10.1145/1364742.1364761","DOIUrl":"https://doi.org/10.1145/1364742.1364761","url":null,"abstract":"Since evaluation of cross-language information retrieval systems began at TREC in 1997 and NTCIR in 1998 and, in particular, with the launch of the Cross-Language Evaluation Forum (CLEF) in 2000, considerable progress has been made in this particular sector of IR. Advances can be considered in two stages. The first stage regarded in particular the development of text retrieval systems from simple so-called \"bilingual\" systems in which a query in one language is used to search a document collection in another to truly \"multilingual\" retrieval systems where a query in one language can find relevant results from a collection of documents in multiple languages. In the second stage, the focus was no longer just on multilingual document retrieval but was diversified to include different kinds of text retrieval across languages (e.g multilingual question answering) and retrieval on different kinds of media (e.g. collections containing images or speech). However, although the results from the research perspective have been interesting, there has been little real take-up by the applications communities. In the paper we describe the results achieved by CLEF over the years and propose a third stage for multilingual system evaluation which gives far more attention to questions regarding usability and user satisfaction but also provides ways for the results achieved to be transferred to the operational context.","PeriodicalId":287514,"journal":{"name":"International Workshop On Research Issues in Digital Libraries","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128263844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2