{"title":"布尔格同构模式挖掘问题的快速原型","authors":"Frédéric Flouvat, F. D. Marchi, Jean-Marc Petit","doi":"10.1109/RCIS.2008.4632104","DOIUrl":null,"url":null,"abstract":"Interesting pattern mining is an important family of data mining problems with applications in many domains. In this paper, we focus on the special class of pattern mining problems known to be dasiarepresentable as setspsila. The main contribution of this paper is to take advantage of the common theoretical background of these problems from an implementation point of view by providing efficient data structures for boolean lattice representation and several implementations of well known algorithms. By the way, these problems can be implemented with only minimal effort, i.e. programmers do not have to be aware of low level code, customized data structures and algorithms being available for free. A toolkit, called iZi, has been devised and applied to several problems such as itemset mining, constraint mining in relational databases and query rewriting in data integration systems. According to our first results, the programs obtained using our toolkit offer a very good tradeoff between performances and development simplicity. Some methodological guidelines are also provided to guide the programmers both at the theoretical level and at the code level.","PeriodicalId":416272,"journal":{"name":"2008 Second International Conference on Research Challenges in Information Science","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Rapid prototyping of pattern mining problems isomorphic to boolean lattices\",\"authors\":\"Frédéric Flouvat, F. D. Marchi, Jean-Marc Petit\",\"doi\":\"10.1109/RCIS.2008.4632104\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Interesting pattern mining is an important family of data mining problems with applications in many domains. In this paper, we focus on the special class of pattern mining problems known to be dasiarepresentable as setspsila. The main contribution of this paper is to take advantage of the common theoretical background of these problems from an implementation point of view by providing efficient data structures for boolean lattice representation and several implementations of well known algorithms. By the way, these problems can be implemented with only minimal effort, i.e. programmers do not have to be aware of low level code, customized data structures and algorithms being available for free. A toolkit, called iZi, has been devised and applied to several problems such as itemset mining, constraint mining in relational databases and query rewriting in data integration systems. According to our first results, the programs obtained using our toolkit offer a very good tradeoff between performances and development simplicity. Some methodological guidelines are also provided to guide the programmers both at the theoretical level and at the code level.\",\"PeriodicalId\":416272,\"journal\":{\"name\":\"2008 Second International Conference on Research Challenges in Information Science\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-06-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 Second International Conference on Research Challenges in Information Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RCIS.2008.4632104\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 Second International Conference on Research Challenges in Information Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RCIS.2008.4632104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Rapid prototyping of pattern mining problems isomorphic to boolean lattices
Interesting pattern mining is an important family of data mining problems with applications in many domains. In this paper, we focus on the special class of pattern mining problems known to be dasiarepresentable as setspsila. The main contribution of this paper is to take advantage of the common theoretical background of these problems from an implementation point of view by providing efficient data structures for boolean lattice representation and several implementations of well known algorithms. By the way, these problems can be implemented with only minimal effort, i.e. programmers do not have to be aware of low level code, customized data structures and algorithms being available for free. A toolkit, called iZi, has been devised and applied to several problems such as itemset mining, constraint mining in relational databases and query rewriting in data integration systems. According to our first results, the programs obtained using our toolkit offer a very good tradeoff between performances and development simplicity. Some methodological guidelines are also provided to guide the programmers both at the theoretical level and at the code level.