Proceedings Eighth Symposium on String Processing and Information Retrieval最新文献

Speeding-up hirschberg and hunt-szymanski LCS algorithms 加速hirschberg和hunt-szymanski LCS算法

Proceedings Eighth Symposium on String Processing and Information Retrieval Pub Date : 2001-11-13 DOI: 10.1109/SPIRE.2001.989737

M. Crochemore, C. Iliopoulos, Y. Pinzón

引用次数: 16

A model for the representation and focussed retrieval of structured documents based on fuzzy aggregation 基于模糊聚合的结构化文档表示和集中检索模型

Proceedings Eighth Symposium on String Processing and Information Retrieval Pub Date : 2001-11-13 DOI: 10.1109/SPIRE.2001.989746

G. Kazai, M. Lalmas, T. Roelleke

引用次数: 29

Design of a graphical user interface for focussed retrieval of structured documents 为集中检索结构化文档而设计的图形用户界面

Proceedings Eighth Symposium on String Processing and Information Retrieval Pub Date : 2001-11-13 DOI: 10.1109/SPIRE.2001.989775

F. Crestani, P. de la Fuente, J. Vegas

引用次数: 4

Using semantics for paragraph selection in question answering systems 在问答系统中使用语义进行段落选择

Proceedings Eighth Symposium on String Processing and Information Retrieval Pub Date : 2001-11-13 DOI: 10.1109/SPIRE.2001.989765

J. Vicedo

引用次数: 1

Musical sequence comparison for melodic and rhythmic similarities 旋律和节奏相似的音乐序列比较

Proceedings Eighth Symposium on String Processing and Information Retrieval Pub Date : 2001-11-13 DOI: 10.1109/SPIRE.2001.989744

T. Kadota, Masahiro Hirao, A. Ishino, M. Takeda, A. Shinohara, F. Matsuo

引用次数: 10

Fast categorisation of large document collections 大型文档集合的快速分类

Proceedings Eighth Symposium on String Processing and Information Retrieval Pub Date : 2001-11-13 DOI: 10.1109/SPIRE.2001.989757

Vaughan R. Shanks, H. Williams

{"title":"Fast categorisation of large document collections","authors":"Vaughan R. Shanks, H. Williams","doi":"10.1109/SPIRE.2001.989757","DOIUrl":"https://doi.org/10.1109/SPIRE.2001.989757","url":null,"abstract":"As the volume of data stored online increases, careful management of large document collections becomes increasingly important. Categorisation is one important document management technique. It has been efectively employed in the Web, where links to documents are maintained in topic or interest areas in, for example, the manuallycategorised Yahoo!‘ hierarchy. The drawback of manual categorisation is that it is practical only on small numbers of documents, it is not scalable, and relies on the subjective judgement of human assessors. Automatic categorisation has been shown to be an accurate alternative to manual categorisation. In automatic categorisation, documents are processed and automatically assigned to pre-defined categories that represent an interest or topic area. We propose and investigate heuristics for fast categorisation of laGe collections of documents that are focused on selecting a minimal set of representative features from uncategorised documents. We show that these new heuristics are accurate-in some cases more accurate than the baseline techniques-and also permit more than three-fold reductions in processing time for categorising large collections.","PeriodicalId":107511,"journal":{"name":"Proceedings Eighth Symposium on String Processing and Information Retrieval","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115830031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

Compaction techniques for nextword indexes nextword索引的压缩技术

Proceedings Eighth Symposium on String Processing and Information Retrieval Pub Date : 2001-11-13 DOI: 10.1109/SPIRE.2001.989735

D. Bahle, H. Williams, J. Zobel

引用次数: 19

Re-store: a system for compressing, browsing, and searching large documents Re-store:用于压缩、浏览和搜索大型文档的系统

Proceedings Eighth Symposium on String Processing and Information Retrieval Pub Date : 2001-11-13 DOI: 10.1109/SPIRE.2001.989752

Alistair Moffat, R. Wan

引用次数: 18

An efficient bottom-up distance between trees 树之间有效的自下而上的距离

Proceedings Eighth Symposium on String Processing and Information Retrieval Pub Date : 2001-11-13 DOI: 10.1109/SPIRE.2001.989761

G. Valiente

引用次数: 127

A documental database query language 一种文档数据库查询语言

Proceedings Eighth Symposium on String Processing and Information Retrieval Pub Date : 2001-11-13 DOI: 10.1109/SPIRE.2001.989772

N. Brisaboa, Miguel R. Penabad, Á. Places, F. J. Rodríguez

{"title":"A documental database query language","authors":"N. Brisaboa, Miguel R. Penabad, Á. Places, F. J. Rodríguez","doi":"10.1109/SPIRE.2001.989772","DOIUrl":"https://doi.org/10.1109/SPIRE.2001.989772","url":null,"abstract":"This work presents a natural language based technique to build user interfaces to query document databases through the web. We call such technique Bounded Natural Language (BNL). Interfaces based on BNL are useful to query document databases containing only structured data, containing only text or containing both of them. That is, the underlying formalism of BNL can integrate restrictions over structured and non-structured data (as text).Interfaces using BNL can be programmed ad hoc for any document database but in this paper we present a system with an ontology based architecture in which the user interface is automatically generated by a software module (User Interface Generator) capable of reading and following the ontology. This ontology is a conceptualization of the database model, which uses a label in natural language for any concept in the ontology. Each label represents the usual name for a concept in the real world.The ontology includes general concepts useful when the user is interested in documents in any corpus in the database, and specific concepts useful when the user is interested in a specific corpus. That is, databases can store one or more corpus of documents and queries can be issued either over the whole database or over a specific corpus.The ontology guides the execution of the User Interface Generator and other software modules in such a way that any change in the database does not imply making changes in the program code, because the whole system runs following the ontology. That is, if a modification in the database schema occurs, only the ontology must be changed and the User Interface Generator will produce a new and different user interface adapted to the new database.","PeriodicalId":107511,"journal":{"name":"Proceedings Eighth Symposium on String Processing and Information Retrieval","volume":"126 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121872303","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8