Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications最新文献
{"title":"Knowledge discovery from users Web-page navigation","authors":"C. Shahabi, A. Zarkesh, Jafar Adibi, Vishal Shah","doi":"10.1109/RIDE.1997.583692","DOIUrl":"https://doi.org/10.1109/RIDE.1997.583692","url":null,"abstract":"The authors propose to detect users' navigation paths to the advantage of Web site owners. First, they explain the design and implementation of a profiler which captures a client's selected links and page order, accurate page viewing time and cache references, using a Java based remote agent. The information captured by the profiler is then utilized by a knowledge discovery technique to cluster users with similar interests. They introduce a novel path clustering method based on the similarity of the history of user navigation. This approach is capable of capturing the interests of the user which could persist through several subsequent hypertext link selections. Finally, they evaluate their path clustering technique via a simulation study on a sample WWW site. They show that, depending on the level of inserted noise, they can recover the correct clusters by 10%-27% of average error margin.","PeriodicalId":177468,"journal":{"name":"Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130188440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"PRIME-GC. A medical information retrieval prototype on the Web","authors":"M. Mechkour, P. Mulhem, F. Fourel, C. Berrut","doi":"10.1109/RIDE.1997.583681","DOIUrl":"https://doi.org/10.1109/RIDE.1997.583681","url":null,"abstract":"The authors describe a prototype, PRIME, of a multimedia medical information retrieval system. The documents managed by PRIME are patient records, which are composed of administrative data, textual reports, and magnetic resonance images. PRIME is developed on top of the object oriented DBMS O/sub 2/ and its interface can be any WWW navigator (Netscape, InternetExplorer, etc.). The retrieval engine of PRIME is based on Sowa's (1984, 1991) conceptual graph formalism structures and operations.","PeriodicalId":177468,"journal":{"name":"Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130917309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On disconnected browsing of distributed information","authors":"A. Joshi, S. Weerawarana, E. Houstis","doi":"10.1109/RIDE.1997.583714","DOIUrl":"https://doi.org/10.1109/RIDE.1997.583714","url":null,"abstract":"The software and protocols associated with information browsing systems are largely designed with static access points and wired networks in mind; HTTP and the Web are a case in point. Static hosts are connected to wired, high bandwidth networks, and are capable of transmitting and receiving large amounts of data without significant delays. As such, the size and format of the data files being received by the browser/client has never been a concern. However, this causes problems when information access is desired on mobile hosts, since data transmission over a wireless network is much slower than on a wired network. Mobile computers are also relatively resource-poor, compared to their desktop counterparts. This fact is ignored by HTTP servers, and large data files are transmitted to computers that cannot properly display them. Also, mobile computers operate in constantly changing network environments. It is possible for a mobile computer to become temporarily disconnected from a network when it changes base stations or goes out of range of a base station. A mobile host may also doze off to preserve battery power and thus be disconnected. The information browsing system and protocol associated with mobile computers should thus be able to tolerate the fault of temporary disconnection. The article addresses these problems in the context of Web browsing from a mobile host. It investigates an efficient model for browsing and describes the design of a smart Web browsing application which performs transactions based on the user's available resources and manages disconnection.","PeriodicalId":177468,"journal":{"name":"Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications","volume":"27 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120995416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Caching of large database objects in Web servers","authors":"D. Jadav, Monish Gupta, M. Lakshmi","doi":"10.1109/RIDE.1997.583688","DOIUrl":"https://doi.org/10.1109/RIDE.1997.583688","url":null,"abstract":"The popularity of the World Wide Web has been increasing at an exponential rate of late. As such growth was unanticipated, the infrastructure is increasingly experiencing problems. The combination of increased network bandwidth demand and overloaded servers results in increased data retrieval latency for the end-user. Caching data at appropriate points in the Web helps alleviate this problem. Almost all previous and existing Web servers use a flat file approach to store data, with use of database management systems (DBMSs) rudimentary, if extant at all. Storing pages in a file system may result in faster retrieval, but storing them in a DBMS gives the user greater administrative control. The use of a DBMS in a Web server, and the concomitant implication of frequently changing data, complicates the caching problem an Web-based applications. The Illustra Object Relational DBMS provides a flexible and user-friendly environment for building Web applications where all the server data is stored in the DBMS. The authors develop a caching scheme for large objects in the Web DataBlade module of the Illustra ORDBMS. Implementation details and preliminary performance results are presented.","PeriodicalId":177468,"journal":{"name":"Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications","volume":"401 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116029117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A dynamic migration algorithm for a distributed memory-based file management system","authors":"J. Griffioen, T. A. Anderson, Y. Breitbart","doi":"10.1109/RIDE.1997.583722","DOIUrl":"https://doi.org/10.1109/RIDE.1997.583722","url":null,"abstract":"Conventional migration strategies attempt to evenly balance the load across all available server machines. The paper discusses why conventional migration approaches are not necessarily appropriate for distributed memory based file systems and presents an alternative approach that spreads data (possibly unevenly) across as few machines as possible and involves other available machines only as needed. The main advantage of our approach is that it keeps the system minimally distributed, thereby reducing the failure rate among servers, the communication overhead among servers, the time needed to compute data relocation, distributed addressing costs, and the probability of unanticipated migrations (e.g., caused by, and an inconvenience to, returning users).","PeriodicalId":177468,"journal":{"name":"Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications","volume":"108 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133185971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"CRITIKAL: client-server rule induction technology for industrial knowledge acquisition from large databases","authors":"A. Al-Attar","doi":"10.1109/RIDE.1997.583703","DOIUrl":"https://doi.org/10.1109/RIDE.1997.583703","url":null,"abstract":"This paper describes a European Commission funded project to develop and demonstrate an advanced client-server induction system capable of supporting efficient effective data mining of very large databases in business environments. The project combines the induction technology of Attar Software, the HPC application enabling expertise of the Universities of Southampton and Stuttgart, and the user-pull of two very different large database users; GEHE, the German pharmaceutical wholesaler, and Lloyds-TSB the British banking group.","PeriodicalId":177468,"journal":{"name":"Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123156078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The DBInspector project","authors":"P. Stofella","doi":"10.1109/RIDE.1997.583704","DOIUrl":"https://doi.org/10.1109/RIDE.1997.583704","url":null,"abstract":"This paper provides an overview of the DBInspector Esprit project, part of the HPCN PCI/Capri initiative. The project focused on the employment of High Performance Database and 3D Data Visualisation technologies for the construction of a data mining environment to be employed in the anti money laundering activities performed by the Ufficio Italiano dei Cambi (UIC). UIC, part of the Italian central bank, is in charge of the analysis of the Financial Flows Archive, a centralised database holding information on the transactions of the entire Italian financial system.","PeriodicalId":177468,"journal":{"name":"Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117127337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"WIDE-a distributed architecture for workflow management","authors":"S. Ceri, P. Grefen, G. Sanchez","doi":"10.1109/RIDE.1997.583705","DOIUrl":"https://doi.org/10.1109/RIDE.1997.583705","url":null,"abstract":"This paper presents the distributed architecture of the WIDE workflow management system. We show how distribution and scalability are obtained by the use of a distributed object model, a client/server architecture, and a distributed workflow server architecture. Specific attention is paid to the extended transaction support and active rule support subarchitectures.","PeriodicalId":177468,"journal":{"name":"Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124580949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"DAFS: data mining file server","authors":"Iain McLaren, E. Babb, J. Bocca","doi":"10.1109/RIDE.1997.583701","DOIUrl":"https://doi.org/10.1109/RIDE.1997.583701","url":null,"abstract":"The report gives an overview of the DAFS Esprit project (no. 20169). The objective of the DAFS project is to build a parallel data mining file server. Current data mining tools often suffer from limited functionality and poor cost performance on data sets greater than 1 GByte. The DAFS project aims to overcome these problems through the close integration of data mining and database technologies on a scalable parallel platform.","PeriodicalId":177468,"journal":{"name":"Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications","volume":"119 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125532867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}