{"title":"新的相似度函数","authors":"H. Yazdani, D. Ortiz-Arroyo, H. Kwasnicka","doi":"10.1109/ICAIPR.2016.7585210","DOIUrl":null,"url":null,"abstract":"In data science, there are some parameters that affect the accuracy of selected algorithms, regardless of their type. Type of data objects, membership assignments, and distance or similarity functions are the most important parameters that provide or not a proper environment for learning algorithms. The paper evaluates similarity functions as fundamental keys for membership assignments. The issues on conventional similarity functions are discussed in this paper. The paper introduces Weighted Feature Distance (WFD), and Prioritized Weighted Feature Distance (PWFD) to cover diversity in feature spaces. Most of the conventional distance functions compare data objects on vector space where any dominant feature may massively skew the final results. WFD functions perform better in supervised and unsupervised methods by comparing data objects on their feature spaces in addition to covering similarity on vector space. Prioritized Weighted Feature Distance (PWFD) works as same as WFD with ability to give priorities to desirable features. The accuracy of proposed functions are compared with other similarity functions on some data sets. Promising results show that the proposed functions work better than the other methods presented in this literature.","PeriodicalId":127231,"journal":{"name":"2016 Third International Conference on Artificial Intelligence and Pattern Recognition (AIPR)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"New similarity functions\",\"authors\":\"H. Yazdani, D. Ortiz-Arroyo, H. Kwasnicka\",\"doi\":\"10.1109/ICAIPR.2016.7585210\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In data science, there are some parameters that affect the accuracy of selected algorithms, regardless of their type. Type of data objects, membership assignments, and distance or similarity functions are the most important parameters that provide or not a proper environment for learning algorithms. The paper evaluates similarity functions as fundamental keys for membership assignments. The issues on conventional similarity functions are discussed in this paper. The paper introduces Weighted Feature Distance (WFD), and Prioritized Weighted Feature Distance (PWFD) to cover diversity in feature spaces. Most of the conventional distance functions compare data objects on vector space where any dominant feature may massively skew the final results. WFD functions perform better in supervised and unsupervised methods by comparing data objects on their feature spaces in addition to covering similarity on vector space. Prioritized Weighted Feature Distance (PWFD) works as same as WFD with ability to give priorities to desirable features. The accuracy of proposed functions are compared with other similarity functions on some data sets. Promising results show that the proposed functions work better than the other methods presented in this literature.\",\"PeriodicalId\":127231,\"journal\":{\"name\":\"2016 Third International Conference on Artificial Intelligence and Pattern Recognition (AIPR)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 Third International Conference on Artificial Intelligence and Pattern Recognition (AIPR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAIPR.2016.7585210\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Third International Conference on Artificial Intelligence and Pattern Recognition (AIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIPR.2016.7585210","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
In data science, there are some parameters that affect the accuracy of selected algorithms, regardless of their type. Type of data objects, membership assignments, and distance or similarity functions are the most important parameters that provide or not a proper environment for learning algorithms. The paper evaluates similarity functions as fundamental keys for membership assignments. The issues on conventional similarity functions are discussed in this paper. The paper introduces Weighted Feature Distance (WFD), and Prioritized Weighted Feature Distance (PWFD) to cover diversity in feature spaces. Most of the conventional distance functions compare data objects on vector space where any dominant feature may massively skew the final results. WFD functions perform better in supervised and unsupervised methods by comparing data objects on their feature spaces in addition to covering similarity on vector space. Prioritized Weighted Feature Distance (PWFD) works as same as WFD with ability to give priorities to desirable features. The accuracy of proposed functions are compared with other similarity functions on some data sets. Promising results show that the proposed functions work better than the other methods presented in this literature.