A procedure for meaningful unsupervised clustering and its application for solvent classification

Yaroslava Pushkarova, Y. Kholin
{"title":"A procedure for meaningful unsupervised clustering and its application for solvent classification","authors":"Yaroslava Pushkarova, Y. Kholin","doi":"10.2478/s11532-014-0514-6","DOIUrl":null,"url":null,"abstract":"AbstractArtificial neural networks have proven to be a powerful tool for solving classification problems. Some difficulties still need to be overcome for their successful application to chemical data. The use of supervised neural networks implies the initial distribution of patterns between the pre-determined classes, while attribution of objects to the classes may be uncertain. Unsupervised neural networks are free from this problem, but do not always reveal the real structure of data. Classification algorithms which do not require a priori information about the distribution of patterns between the pre-determined classes and provide meaningful results are of special interest. This paper presents an approach based on the combination of Kohonen and probabilistic networks which enables the determination of the number of classes and the reliable classification of objects. This is illustrated for a set of 76 solvents based on nine characteristics. The resulting classification is chemically interpretable. The approach proved to be also applicable in a different field, namely in examining the solubility of C60 fullerene. The solvents belonging to the same group demonstrate similar abilities to dissolve C60. This makes it possible to estimate the solubility of fullerenes in solvents for which there are no experimental data\n","PeriodicalId":9888,"journal":{"name":"Central European Journal of Chemistry","volume":"35 1","pages":"594-603"},"PeriodicalIF":0.0000,"publicationDate":"2014-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Central European Journal of Chemistry","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/s11532-014-0514-6","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

AbstractArtificial neural networks have proven to be a powerful tool for solving classification problems. Some difficulties still need to be overcome for their successful application to chemical data. The use of supervised neural networks implies the initial distribution of patterns between the pre-determined classes, while attribution of objects to the classes may be uncertain. Unsupervised neural networks are free from this problem, but do not always reveal the real structure of data. Classification algorithms which do not require a priori information about the distribution of patterns between the pre-determined classes and provide meaningful results are of special interest. This paper presents an approach based on the combination of Kohonen and probabilistic networks which enables the determination of the number of classes and the reliable classification of objects. This is illustrated for a set of 76 solvents based on nine characteristics. The resulting classification is chemically interpretable. The approach proved to be also applicable in a different field, namely in examining the solubility of C60 fullerene. The solvents belonging to the same group demonstrate similar abilities to dissolve C60. This makes it possible to estimate the solubility of fullerenes in solvents for which there are no experimental data
一种有意义的无监督聚类方法及其在溶剂分类中的应用
摘要人工神经网络已被证明是解决分类问题的有力工具。它们要成功地应用于化学数据,还需要克服一些困难。监督神经网络的使用意味着预先确定的类别之间模式的初始分布,而对象归属于类别可能是不确定的。无监督神经网络不存在这个问题,但并不总能揭示数据的真实结构。不需要预先确定的类之间的模式分布的先验信息并提供有意义的结果的分类算法是特别有趣的。本文提出了一种基于Kohonen和概率网络相结合的方法,可以确定目标的类数和可靠分类。这是基于九种特性的一组76种溶剂的例子。由此产生的分类在化学上是可以解释的。该方法被证明也适用于另一个领域,即检测C60富勒烯的溶解度。同属一类的溶剂溶解C60的能力相似。这使得估计富勒烯在没有实验数据的溶剂中的溶解度成为可能
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
审稿时长
3 months
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信