A. Gisbrecht, B. Hammer, Frank-Michael Schleif, Xibin Zhu
{"title":"生物医学数据分析的加速核聚类","authors":"A. Gisbrecht, B. Hammer, Frank-Michael Schleif, Xibin Zhu","doi":"10.1109/CIBCB.2011.5948460","DOIUrl":null,"url":null,"abstract":"The increasing size and complexity of modern data sets turns modern data mining techniques to indispensable tools when inspecting biomedical data sets. Thereby, dedicated data formats and detailed information often cause the need for problem specific similarities or dissimilarities instead of the standard Euclidean norm. Therefore, a number of clustering techniques which rely on similarities or dissimilarities only have recently been proposed. In this contribution, we review some of the most popular dissimilarity based clustering techniques and we discuss possibilities how to get around the usually squared complexity of the models due to their dependency on the full dissimilarity matrix. We evaluate the techniques on two benchmarks from the biomedical domain.","PeriodicalId":395505,"journal":{"name":"2011 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Accelerating kernel clustering for biomedical data analysis\",\"authors\":\"A. Gisbrecht, B. Hammer, Frank-Michael Schleif, Xibin Zhu\",\"doi\":\"10.1109/CIBCB.2011.5948460\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The increasing size and complexity of modern data sets turns modern data mining techniques to indispensable tools when inspecting biomedical data sets. Thereby, dedicated data formats and detailed information often cause the need for problem specific similarities or dissimilarities instead of the standard Euclidean norm. Therefore, a number of clustering techniques which rely on similarities or dissimilarities only have recently been proposed. In this contribution, we review some of the most popular dissimilarity based clustering techniques and we discuss possibilities how to get around the usually squared complexity of the models due to their dependency on the full dissimilarity matrix. We evaluate the techniques on two benchmarks from the biomedical domain.\",\"PeriodicalId\":395505,\"journal\":{\"name\":\"2011 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIBCB.2011.5948460\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIBCB.2011.5948460","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Accelerating kernel clustering for biomedical data analysis
The increasing size and complexity of modern data sets turns modern data mining techniques to indispensable tools when inspecting biomedical data sets. Thereby, dedicated data formats and detailed information often cause the need for problem specific similarities or dissimilarities instead of the standard Euclidean norm. Therefore, a number of clustering techniques which rely on similarities or dissimilarities only have recently been proposed. In this contribution, we review some of the most popular dissimilarity based clustering techniques and we discuss possibilities how to get around the usually squared complexity of the models due to their dependency on the full dissimilarity matrix. We evaluate the techniques on two benchmarks from the biomedical domain.