{"title":"Big Data Mining: Managing the Costs of Data Mining","authors":"Jaya R Ganasan","doi":"10.1109/ICTKE47035.2019.8966806","DOIUrl":null,"url":null,"abstract":"The amount of data collected and stored in various industries has grown exponentially in the last decade. Data is collected and stored from industries consisting of large consumers such as telecommunications, banking or financial sectors. Further, given the advent of cloud computing and software availability in the cloud being cheaper, smaller industries are utilizing data storage for competitive advantage. Companies increasingly rely on analysis of huge amounts of data to gain a strategic advantage, improving on product quality and providing better services to their end users be it the employee, consumer or customer. A combination of statistical techniques and file management tools once sufficed for analyzing mounds of data. The costs of analysis are often charged out at very high rates for companies that require data analysis and the output is dependent very much on analyzing the correct attributes within large databases to ensure the data analyzed provides the relevant result. The most known technique or tools are the subject of the growing field of knowledge discovery in databases (KDD) [1]. Using business process data mapping (BPDM) to define the targeted data along with the process of knowledge discovery mapping in the database may provide a more targeted approach with much lest costs expended.","PeriodicalId":442255,"journal":{"name":"2019 17th International Conference on ICT and Knowledge Engineering (ICT&KE)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 17th International Conference on ICT and Knowledge Engineering (ICT&KE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTKE47035.2019.8966806","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The amount of data collected and stored in various industries has grown exponentially in the last decade. Data is collected and stored from industries consisting of large consumers such as telecommunications, banking or financial sectors. Further, given the advent of cloud computing and software availability in the cloud being cheaper, smaller industries are utilizing data storage for competitive advantage. Companies increasingly rely on analysis of huge amounts of data to gain a strategic advantage, improving on product quality and providing better services to their end users be it the employee, consumer or customer. A combination of statistical techniques and file management tools once sufficed for analyzing mounds of data. The costs of analysis are often charged out at very high rates for companies that require data analysis and the output is dependent very much on analyzing the correct attributes within large databases to ensure the data analyzed provides the relevant result. The most known technique or tools are the subject of the growing field of knowledge discovery in databases (KDD) [1]. Using business process data mapping (BPDM) to define the targeted data along with the process of knowledge discovery mapping in the database may provide a more targeted approach with much lest costs expended.