{"title":"Data Augmentation and Transfer Learning Approaches Applied to Facial Expressions Recognition","authors":"Enrico Randellini, Leonardo Rigutini, Claudio Saccà","doi":"10.5121/csit.2021.111912","DOIUrl":"https://doi.org/10.5121/csit.2021.111912","url":null,"abstract":"The face expression is the first thing we pay attention to when we want to understand a person’s state of mind. Thus, the ability to recognize facial expressions in an automatic way is a very interesting research field. In this paper, because the small size of available training datasets, we propose a novel data augmentation technique that improves the performances in the recognition task. We apply geometrical transformations and build from scratch GAN models able to generate new synthetic images for each emotion type. Thus, on the augmented datasets we fine tune pretrained convolutional neural networks with different architectures. To measure the generalization ability of the models, we apply extra-database protocol approach, namely we train models on the augmented versions of training dataset and test them on two different databases. The combination of these techniques allows to reach average accuracy values of the order of 85% for the InceptionResNetV2 model.","PeriodicalId":193651,"journal":{"name":"NLP Techniques and Applications","volume":"1118 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116068034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Intelligent System to Improve Athlete Depression and Eating Disorder using Artificial Intelligence and Big Data Analysis","authors":"Xuannuo Chen, Yu Sun","doi":"10.5121/csit.2021.111910","DOIUrl":"https://doi.org/10.5121/csit.2021.111910","url":null,"abstract":"The inspiration for the creation of this app stemmed from the deeply rooted history of eating disorders in sports, particularly in sports that emphasize appearance and muscularity which often includes gymnastics, figure skating, dance, and diving [1]. All three sports require rapid rotation in the air which automatically results in the necessity of a more stringent weight requirement. Eating disorders can also be aggravated by sports who focus on individual performances rather than team-oriented like basketball or soccer [5]. According to research, up to thirteen percent of all athletes have, or are currently suffering from a form of eating disorder such as anorexia [2] and bulimia [3]. In the National Collegiate Athletic Association, it is estimated that up to sixteen percent of male athletes and forty-five percent of female athletes have been diagnosed with an eating disorder.","PeriodicalId":193651,"journal":{"name":"NLP Techniques and Applications","volume":"90 21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129844645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Mitigation Techniques to Overcome Data Harm in Model Building for ML","authors":"A. Arslan","doi":"10.5121/csit.2021.111916","DOIUrl":"https://doi.org/10.5121/csit.2021.111916","url":null,"abstract":"Given the impact of Machine Learning (ML) on individuals and the society, understanding how harm might be occur throughout the ML life cycle becomes critical more than ever. By offering a framework to determine distinct potential sources of downstream harm in ML pipeline, the paper demonstrates the importance of choices throughout distinct phases of data collection, development, and deployment that extend far beyond just model training. Relevant mitigation techniques are also suggested for being used instead of merely relying on generic notions of what counts as fairness.","PeriodicalId":193651,"journal":{"name":"NLP Techniques and Applications","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129252351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Improved Framework for C-V2X Systems with Data Integration and Identity-based Authentication","authors":"Rui Huang","doi":"10.5121/csit.2021.111905","DOIUrl":"https://doi.org/10.5121/csit.2021.111905","url":null,"abstract":"Current trends of autonomous driving apply the hybrid use of on-vehicle and roadside smart devices to perform collaborative data sensing and computing, so as to achieve a comprehensive and stable decision making. The integrated system is usually named as C-V2X. However, several challenges have significantly hindered the development and adoption of such systems. For example, the difficulty of accessing multiple data protocols of multiple devices at the bottom layer, and the centralized deployment of computing arithmetic power. Therefore, this work proposes a novel framework for the design of C-V2X systems. First, a highly aggregated architecture is designed with fully integration with multiple traffic data resources. Then a multilevel information fusion model is designed based on multi-sensors in vehicle-road coordination. The model can fit different detection environments, detection mechanisms, and time frames. Finally, a lightweight and efficient identity-based authentication method is given. The method can realize bidirectional authentication between end devices and edge gateways.","PeriodicalId":193651,"journal":{"name":"NLP Techniques and Applications","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132503607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The use of Big Data in Machine Learning Algorithm","authors":"Yew Kee Wong","doi":"10.5121/csit.2021.111911","DOIUrl":"https://doi.org/10.5121/csit.2021.111911","url":null,"abstract":"In the information era, enormous amounts of data have become available on hand to decision makers. Big data refers to datasets that are not only big, but also high in variety and velocity, which makes them difficult to handle using traditional tools and techniques. Due to the rapid growth of such data, solutions need to be studied and provided in order to handle and extract value and knowledge from these datasets. Machine learning is a method of data analysis that automates analytical model building. It is a branch of artificial intelligence based on the idea that systems can learn from data, identify patterns and make decisions with minimal human intervention. Such minimal human intervention can be provided using big data analytics, which is the application of advanced analytics techniques on big data. This paper aims to analyse some of the different machine learning algorithms and methods which can be applied to big data analysis, as well as the opportunities provided by the application of big data analytics in various decision making domains.","PeriodicalId":193651,"journal":{"name":"NLP Techniques and Applications","volume":"211 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114223495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Unsupervised Named Entity Recognition for Hi-Tech Domain","authors":"Abinaya Govindan, Gyan Ranjan, Amit Verma","doi":"10.5121/csit.2021.111917","DOIUrl":"https://doi.org/10.5121/csit.2021.111917","url":null,"abstract":"This paper presents named entity recognition as a multi-answer QA task combined with contextual natural-language-inference based noise reduction. This method allows us to use pre-trained models that have been trained for certain downstream tasks to generate unsupervised data, reducing the need for manual annotation to create named entity tags with tokens. For each entity, we provide a unique context, such as entity types, definitions, questions and a few empirical rules along with the target text to train a named entity model for the domain of our interest. This formulation (a) allows the system to jointly learn NER-specific features from the datasets provided, and (b) can extract multiple NER-specific features, thereby boosting the performance of existing NER models (c) provides business-contextualized definitions to reduce ambiguity among similar entities. We conducted numerous tests to determine the quality of the created data, and we find that this method of data generation allows us to obtain clean, noise-free data with minimal effort and time. This approach has been demonstrated to be successful in extracting named entities, which are then used in subsequent components.","PeriodicalId":193651,"journal":{"name":"NLP Techniques and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132759410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ednaldo de Souza Vilela, Filipe Dias, Marcos B. L. Dalmau
{"title":"Development of Administration Professional Competences in Brazilian Public Universities: A Multicase Study in Florianópolis","authors":"Ednaldo de Souza Vilela, Filipe Dias, Marcos B. L. Dalmau","doi":"10.5121/csit.2021.111903","DOIUrl":"https://doi.org/10.5121/csit.2021.111903","url":null,"abstract":"The article aims to investigate how the development of competences applied to the professional formation of the egress administrator of public municipal higher education institutions in the Florianópolis region occurs under perspective of teachers and coordinators of the bachelor's degree in administration course. For this, a qualitative and documentary research was carried out, using a structured questionnaire applied to 20 people as a data collection instrument, including 2 course coordinators and 18 professors from the studied institutions who teach the subjects whose contents are related to professional formation from the administrator. The results show that the new national curriculum guidelines encourage the development of competences. In this context, despite the effort to comply with such devices, there is some misalignment between the teaching plans and the pedagogical project of the course. Difficulties in implementing formation based on competence and lack of institutional stimuli are also perceived.","PeriodicalId":193651,"journal":{"name":"NLP Techniques and Applications","volume":"33 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120892102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"E-Teaching and E-Learning in Crisis Situations: Their Effect on New Directions of Thinking in Higher Education","authors":"N. Davidovitch, R. Wadmany","doi":"10.5121/csit.2021.111908","DOIUrl":"https://doi.org/10.5121/csit.2021.111908","url":null,"abstract":"The COVID-19 year was a difficult and challenging year in all areas of life. The academic world as well was compelled, in a matter of days, to shift from face-to-face learning on campus to e-Learning from a distance, with no adequate preparation. Despite the difficulties generated by e-Learning and students’ many complaints, the Israeli Council for Higher Education and institutions of higher education are preparing for a new era, where online courses will constitute an integral part of studies. The purpose of the study was to examine the attitude of lecturers and students to the benefits and shortcomings of e-teaching with its various aspects from a systemic, multi-institutional perspective. The study included 2,015 students and 223 lecturers from different academic institutions: universities, academic colleges of education, academic colleges of engineering, and private colleges.","PeriodicalId":193651,"journal":{"name":"NLP Techniques and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130288835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Automated Chinese Essay Scoring using Pre-Trained Language Models","authors":"Lulu Dong, Lin Li, Hongchao Ma, Yeling Liang","doi":"10.5121/csit.2021.111901","DOIUrl":"https://doi.org/10.5121/csit.2021.111901","url":null,"abstract":"Automated Essay Scoring (AES) aims to assign a proper score to an essay written by a given prompt, which is a significant application of Natural Language Processing (NLP) in the education area. In this work, we focus on solving the Chinese AES problem by Pre-trained Language Models (PLMs) including state-of-the-art PLMs BERT and ERNIE. A Chinese essay dataset has been built up in this work, by which we conduct extensive AES experiments. Our PLMs-based AES models acquire 68.70% in Quadratic Weighted Kappa (QWK), which outperform classic feature-based linear regression AES model. The results show that our methods effectively alleviate the dependence on manual features and improve the portability of AES models. Furthermore, we acquire well-performed AES models with a limited scale of the dataset, which solves the lack of datasets in Chinese AES.","PeriodicalId":193651,"journal":{"name":"NLP Techniques and Applications","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127683333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An Intelligent Data-Driven Analytics System for Operation Management, Budgeting, and Resource Allocation using Machine Learning and Data Analytics","authors":"Dele Fei, Yu Sun","doi":"10.5121/csit.2021.111904","DOIUrl":"https://doi.org/10.5121/csit.2021.111904","url":null,"abstract":"This is a data science project for a manufacturing company in China [1]. The task was to forecast the likelihood that each product would need repair or service by a technician in order to forecast how often the products would need to be serviced after they were installed. That forecast could then be used to estimate the correct price for selling a product warranty [2]. The underlying forecast model in the R Programming language for all of the companies products is established. In addition, an interactive web app using R Shiny is developed so the business could see the forecast and recommended warranty price for each of their products and customer types [3]. The user can select a product and customer type and input the number of products and the web app displays charts and tables that show the probability of the product needing service over time, the forecasted costs of service, along with potential income and the recommended warranty price.","PeriodicalId":193651,"journal":{"name":"NLP Techniques and Applications","volume":"655 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116094786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}