Big Data Mining and Analytics最新文献

筛选
英文 中文
Effect of Feature Selection on the Prediction of Direct Normal Irradiance 特征选择对直接法向辐照度预测的影响
IF 13.6 1区 计算机科学
Big Data Mining and Analytics Pub Date : 2022-07-18 DOI: 10.26599/BDMA.2022.9020003
Mohamed Khalifa Boutahir;Yousef Farhaoui;Mourade Azrour;Imad Zeroual;Ahmad El Allaoui
{"title":"Effect of Feature Selection on the Prediction of Direct Normal Irradiance","authors":"Mohamed Khalifa Boutahir;Yousef Farhaoui;Mourade Azrour;Imad Zeroual;Ahmad El Allaoui","doi":"10.26599/BDMA.2022.9020003","DOIUrl":"https://doi.org/10.26599/BDMA.2022.9020003","url":null,"abstract":"Solar radiation is capable of producing heat, causing chemical reactions, or generating electricity. Thus, the amount of solar radiation at different times of the day must be determined to design and equip all solar systems. Moreover, it is necessary to have a thorough understanding of different solar radiation components, such as Direct Normal Irradiance (DNI), Diffuse Horizontal Irradiance (DHI), and Global Horizontal Irradiance (GHI). Unfortunately, measurements of solar radiation are not easily accessible for the majority of regions on the globe. This paper aims to develop a set of deep learning models through feature importance algorithms to predict the DNI data. The proposed models are based on historical data of meteorological parameters and solar radiation properties in a specific location of the region of Errachidia, Morocco, from January 1, 2017, to December 31, 2019, with an interval of 60 minutes. The findings demonstrated that feature selection approaches play a crucial role in forecasting of solar radiation accurately when compared with the available data.","PeriodicalId":52355,"journal":{"name":"Big Data Mining and Analytics","volume":"5 4","pages":"309-317"},"PeriodicalIF":13.6,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/8254253/9832761/09832772.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68067554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Total Contents 总目录
IF 13.6 1区 计算机科学
Big Data Mining and Analytics Pub Date : 2022-07-18
{"title":"Total Contents","authors":"","doi":"","DOIUrl":"https://doi.org/","url":null,"abstract":"","PeriodicalId":52355,"journal":{"name":"Big Data Mining and Analytics","volume":"5 4","pages":"I-II"},"PeriodicalIF":13.6,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/8254253/9832761/09832764.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68067884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
$tautext{JOWL}$: A Systematic Approach to Build and Evolve a Temporal OWL 2 Ontology Based on Temporal JSON Big Data $tautext{JOWL}$:一种基于时态JSON大数据构建和演化时态OWL2本体的系统方法
IF 13.6 1区 计算机科学
Big Data Mining and Analytics Pub Date : 2022-07-18 DOI: 10.26599/BDMA.2021.9020019
Zouhaier Brahmia;Fabio Grandi;Rafik Bouaziz
{"title":"$tautext{JOWL}$: A Systematic Approach to Build and Evolve a Temporal OWL 2 Ontology Based on Temporal JSON Big Data","authors":"Zouhaier Brahmia;Fabio Grandi;Rafik Bouaziz","doi":"10.26599/BDMA.2021.9020019","DOIUrl":"https://doi.org/10.26599/BDMA.2021.9020019","url":null,"abstract":"Nowadays, ontologies, which are defined under the OWL 2 Web Ontology Language (OWL 2), are being used in several fields like artificial intelligence, knowledge engineering, and Semantic Web environments to access data, answer queries, or infer new knowledge. In particular, ontologies can be used to model the semantics of big data as an enabling factor for the deployment of intelligent analytics. Big data are being widely stored and exchanged in JavaScript Object Notation (JSON) format, in particular by Web applications. However, JSON data collections lack explicit semantics as they are in general schema-less, which does not allow to efficiently leverage the benefits of big data. Furthermore, several applications require bookkeeping of the entire history of big data changes, for which no support is provided by mainstream Big Data management systems, including Not only SQL (NoSQL) database systems. In this paper, we propose an approach, named \u0000<tex>$tau text{JOWL}$</tex>\u0000 (temporal OWL 2 from temporal JSON), which allows users (i) to automatically build a temporal OWL 2 ontology of data, following the Closed World Assumption (CWA), from temporal JSON-based big data, and (ii) to manage its incremental maintenance accommodating the evolution of these data, in a temporal and multi-schema environment.","PeriodicalId":52355,"journal":{"name":"Big Data Mining and Analytics","volume":"5 4","pages":"271-281"},"PeriodicalIF":13.6,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/8254253/9832761/09832769.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68067551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of Internet of Things in the Health Sector: Toward Minimizing Energy Consumption 物联网在卫生领域的应用:实现能源消耗最小化
IF 13.6 1区 计算机科学
Big Data Mining and Analytics Pub Date : 2022-07-18 DOI: 10.26599/BDMA.2021.9020031
Mohammed Moutaib;Tarik Ahajjam;Mohammed Fattah;Yousef Farhaoui;Badraddine Aghoutane;Moulhime El Bekkali
{"title":"Application of Internet of Things in the Health Sector: Toward Minimizing Energy Consumption","authors":"Mohammed Moutaib;Tarik Ahajjam;Mohammed Fattah;Yousef Farhaoui;Badraddine Aghoutane;Moulhime El Bekkali","doi":"10.26599/BDMA.2021.9020031","DOIUrl":"https://doi.org/10.26599/BDMA.2021.9020031","url":null,"abstract":"The Internet of Things (IoT) is currently reflected in the increase in the number of connected objects, that is, devices with their own identity and computing and communication capacities. IoT is recognized as one of the most critical areas for future technologies, gaining worldwide attention. It applies to many areas, where it has achieved success, such as healthcare, where a patient is monitored using nodes and lightweight sensors. However, the powerful functions of IoT in the medical field are based on communication, analysis, processing, and management of data autonomously without any manual intervention, which presents many difficulties, such as energy consumption. However, these issues significantly slow down the development and rapid deployment of this technology. The main causes of wasted energy from connected objects include collisions that occur when two or more nodes send data simultaneously and the leading cause of data retransmission that occurs when a collision occurs or when data are not received correctly due to channel fading. The distance between nodes is one of the factors influencing energy consumption. In this article, we have proposed direct communication between nodes to avoid collision domains, which will help reduce data retransmission. The results show that the distribution can ensure the performance of the system under general conditions compared to the centralization and to the existing works.","PeriodicalId":52355,"journal":{"name":"Big Data Mining and Analytics","volume":"5 4","pages":"302-308"},"PeriodicalIF":13.6,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/8254253/9832761/09832765.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68067553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Predicting Students' Final Performance Using Artificial Neural Networks 用人工神经网络预测学生的期末成绩
IF 13.6 1区 计算机科学
Big Data Mining and Analytics Pub Date : 2022-07-18 DOI: 10.26599/BDMA.2021.9020030
Tarik Ahajjam;Mohammed Moutaib;Haidar Aissa;Mourad Azrour;Yousef Farhaoui;Mohammed Fattah
{"title":"Predicting Students' Final Performance Using Artificial Neural Networks","authors":"Tarik Ahajjam;Mohammed Moutaib;Haidar Aissa;Mourad Azrour;Yousef Farhaoui;Mohammed Fattah","doi":"10.26599/BDMA.2021.9020030","DOIUrl":"https://doi.org/10.26599/BDMA.2021.9020030","url":null,"abstract":"Artificial Intelligence (AI) is based on algorithms that allow machines to make decisions for humans. This technology enhances the users' experience in various ways. Several studies have been conducted in the field of education to solve the problem of student orientation and performance using various Machine Learning (ML) algorithms. The main goal of this article is to predict Moroccan students' performance in the region of Guelmim Oued Noun using an intelligent system based on neural networks, one of the best data mining techniques that provided us with the best results.","PeriodicalId":52355,"journal":{"name":"Big Data Mining and Analytics","volume":"5 4","pages":"294-301"},"PeriodicalIF":13.6,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/8254253/9832761/09832763.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68067883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Influencing Factors and Clustering Characteristics of COVID-19: A Global Analysis 新冠肺炎疫情影响因素及聚集特征的全球分析
IF 13.6 1区 计算机科学
Big Data Mining and Analytics Pub Date : 2022-07-18 DOI: 10.26599/BDMA.2022.9020010
Tianlong Zheng;Chunli Zhang;Yueting Shi;Debao Chen;Sheng Liu
{"title":"Influencing Factors and Clustering Characteristics of COVID-19: A Global Analysis","authors":"Tianlong Zheng;Chunli Zhang;Yueting Shi;Debao Chen;Sheng Liu","doi":"10.26599/BDMA.2022.9020010","DOIUrl":"https://doi.org/10.26599/BDMA.2022.9020010","url":null,"abstract":"The unprecedented coronavirus disease 2019 (COVID-19) pandemic is still raging (in year 2021) in many countries worldwide. Various response strategies to study the characteristics and distributions of the virus in various regions of the world have been developed to assist in the prevention and control of this epidemic. Descriptive statistics and regression analysis on COVID-19 data from different countries were conducted in this study to compare and evaluate various regression models. Results showed that the extreme random forest regression (ERFR) model had the best performance, and factors such as population density, ozone, median age, life expectancy, and Human Development Index (HDI) were relatively influential on the spread and diffusion of COVID-19 in the ERFR model. In addition, the epidemic clustering characteristics were analyzed through the spectral clustering algorithm. The visualization results of spectral clustering showed that the geographical distribution of global COVID-19 pandemic spread formation was highly clustered, and its clustering characteristics and influencing factors also exhibited some consistency in distribution. This study aims to deepen the understanding of the international community regarding the global COVID-19 pandemic to develop measures for countries worldwide to mitigate potential large-scale outbreaks and improve the ability to respond to such public health emergencies.","PeriodicalId":52355,"journal":{"name":"Big Data Mining and Analytics","volume":"5 4","pages":"318-338"},"PeriodicalIF":13.6,"publicationDate":"2022-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/8254253/9832761/09832767.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68068250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Optimal Dependence of Performance and Efficiency of Collaborative Filtering on Random Stratified Subsampling 随机分层子采样对协同滤波性能和效率的最优依赖性
IF 13.6 1区 计算机科学
Big Data Mining and Analytics Pub Date : 2022-06-09 DOI: 10.26599/BDMA.2021.9020032
Samin Poudel;Marwan Bikdash
{"title":"Optimal Dependence of Performance and Efficiency of Collaborative Filtering on Random Stratified Subsampling","authors":"Samin Poudel;Marwan Bikdash","doi":"10.26599/BDMA.2021.9020032","DOIUrl":"https://doi.org/10.26599/BDMA.2021.9020032","url":null,"abstract":"Dropping fractions of users or items judiciously can reduce the computational cost of Collaborative Filtering (CF) algorithms. The effect of this subsampling on the computing time and accuracy of CF is not fully understood, and clear guidelines for selecting optimal or even appropriate subsampling levels are not available. In this paper, we present a Density-based Random Stratified Subsampling using Clustering (DRSC) algorithm in which the desired Fraction of Users Dropped (FUD) and Fraction of Items Dropped (FID) are specified, and the overall density during subsampling is maintained. Subsequently, we develop simple models of the Training Time Improvement (TTI) and the Accuracy Loss (AL) as functions of FUD and FID, based on extensive simulations of seven standard CF algorithms as applied to various primary matrices from MovieLens, Yahoo Music Rating, and Amazon Automotive data. Simulations show that both TTI and a scaled AL are bi-linear in FID and FUD for all seven methods. The TTI linear regression of a CF method appears to be same for all datasets. Extensive simulations illustrate that TTI can be estimated reliably with FUD and FID only, but AL requires considering additional dataset characteristics. The derived models are then used to optimize the levels of subsampling addressing the tradeoff between TTI and AL. A simple sub-optimal approximation was found, in which the optimal AL is proportional to the optimal Training Time Reduction Factor (TTRF) for higher values of TTRF, and the optimal subsampling levels, like optimal FID/(1–FID), are proportional to the square root of TTRF.","PeriodicalId":52355,"journal":{"name":"Big Data Mining and Analytics","volume":"5 3","pages":"192-205"},"PeriodicalIF":13.6,"publicationDate":"2022-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/8254253/9793354/09793360.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67848636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Deep Feature Learning for Intrinsic Signature Based Camera Discrimination 基于特征识别的摄像头识别的深度特征学习
IF 13.6 1区 计算机科学
Big Data Mining and Analytics Pub Date : 2022-06-09 DOI: 10.26599/BDMA.2022.9020006
Chaity Banerjee;Tharun Kumar Doppalapudi;Eduardo Pasiliao;Tathagata Mukherjee
{"title":"Deep Feature Learning for Intrinsic Signature Based Camera Discrimination","authors":"Chaity Banerjee;Tharun Kumar Doppalapudi;Eduardo Pasiliao;Tathagata Mukherjee","doi":"10.26599/BDMA.2022.9020006","DOIUrl":"https://doi.org/10.26599/BDMA.2022.9020006","url":null,"abstract":"In this paper we consider the problem of “end-to-end” digital camera identification by considering sequence of images obtained from the cameras. The problem of digital camera identification is harder than the problem of identifying its analog counterpart since the process of analog to digital conversion smooths out the intrinsic noise in the analog signal. However it is known that identifying a digital camera is possible by analyzing the camera's intrinsic sensor artifacts that are introduced into the images/videos during the process of photo/video capture. It is known that such methods are computationally intensive requiring expensive pre-processing steps. In this paper we propose an end-to-end deep feature learning framework for identifying cameras using images obtained from them. We conduct experiments using three custom datasets: the first containing two cameras in an indoor environment where each camera may observe different scenes having no overlapping features, the second containing images from four cameras in an outdoor setting but where each camera observes scenes having overlapping features and the third containing images from two cameras observing the same checkerboard pattern in an indoor setting. Our results show that it is possible to capture the intrinsic hardware signature of the cameras using deep feature representations in an end-to-end framework. These deep feature maps can in turn be used to disambiguate the cameras from each another. Our system is end-to-end, requires no complicated pre-processing steps and the trained model is computationally efficient during testing, paving a way to have near instantaneous decisions for the problem of digital camera identification in production environments. Finally we present comparisons against the current state-of-the-art in digital camera identification which clearly establishes the superiority of the end-to-end solution.","PeriodicalId":52355,"journal":{"name":"Big Data Mining and Analytics","volume":"5 3","pages":"206-227"},"PeriodicalIF":13.6,"publicationDate":"2022-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/8254253/9793354/09793358.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68010339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Call for Papers: Special Issue on Big Data Computing for Internet of Things and Utility and Cloud Computing 论文征集:物联网大数据计算与公用事业与云计算特刊
IF 13.6 1区 计算机科学
Big Data Mining and Analytics Pub Date : 2022-06-09 DOI: 10.26599/BDMA.2022.9020011
{"title":"Call for Papers: Special Issue on Big Data Computing for Internet of Things and Utility and Cloud Computing","authors":"","doi":"10.26599/BDMA.2022.9020011","DOIUrl":"https://doi.org/10.26599/BDMA.2022.9020011","url":null,"abstract":"","PeriodicalId":52355,"journal":{"name":"Big Data Mining and Analytics","volume":"5 3","pages":"270-270"},"PeriodicalIF":13.6,"publicationDate":"2022-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/8254253/9793354/09792624.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68010340","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
$p$-Norm Broad Learning for Negative Emotion Classification in Social Networks 社交网络中消极情绪分类的$p$-范数广义学习
IF 13.6 1区 计算机科学
Big Data Mining and Analytics Pub Date : 2022-06-09 DOI: 10.26599/BDMA.2022.9020008
Guanghao Chen;Sancheng Peng;Rong Zeng;Zhongwang Hu;Lihong Cao;Yongmei Zhou;Zhouhao Ouyang;Xiangyu Nie
{"title":"$p$-Norm Broad Learning for Negative Emotion Classification in Social Networks","authors":"Guanghao Chen;Sancheng Peng;Rong Zeng;Zhongwang Hu;Lihong Cao;Yongmei Zhou;Zhouhao Ouyang;Xiangyu Nie","doi":"10.26599/BDMA.2022.9020008","DOIUrl":"https://doi.org/10.26599/BDMA.2022.9020008","url":null,"abstract":"Negative emotion classification refers to the automatic classification of negative emotion of texts in social networks. Most existing methods are based on deep learning models, facing challenges such as complex structures and too many hyperparameters. To meet these challenges, in this paper, we propose a method for negative emotion classification utilizing a Robustly Optimized BERT Pretraining Approach (RoBERTa) and \u0000<tex>$p$</tex>\u0000-norm Broad Learning (\u0000<tex>$p$</tex>\u0000-BL). Specifically, there are mainly three contributions in this paper. Firstly, we fine-tune the RoBERTa to adapt it to the task of negative emotion classification. Then, we employ the fine-tuned RoBERTa to extract features of original texts and generate sentence vectors. Secondly, we adopt \u0000<tex>$p$</tex>\u0000-BL to construct a classifier and then predict negative emotions of texts using the classifier. Compared with deep learning models, \u0000<tex>$p$</tex>\u0000-BL has advantages such as a simple structure that is only 3-layer and fewer parameters to be trained. Moreover, it can suppress the adverse effects of more outliers and noise in data by flexibly changing the value of \u0000<tex>$p$</tex>\u0000. Thirdly, we conduct extensive experiments on the public datasets, and the experimental results show that our proposed method outperforms the baseline methods on the tested datasets.","PeriodicalId":52355,"journal":{"name":"Big Data Mining and Analytics","volume":"5 3","pages":"245-256"},"PeriodicalIF":13.6,"publicationDate":"2022-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/8254253/9793354/09793355.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68010342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信