{"title":"Augmenting Image Aesthetic Assessment with Diverse Deep Features","authors":"Rui Lin","doi":"10.1145/3508259.3508264","DOIUrl":"https://doi.org/10.1145/3508259.3508264","url":null,"abstract":"With the increasing prevalence of digital images, automatically assessing the aesthetic quality of photos could benefit many real-world applications. While many previous methods have produced binary classification results, this paper proposes a model to produce regression results with high accuracy. The proposed model exploits global visual information such as color palette, saturation, and clarity, as well as deep features like blur maps, saliency maps, and scene information to augment the DenseNet architecture. The augmented DenseNet, when evaluated on the AVA dataset, outperformed the current state-of-the-art methods, achieving an accuracy of 88.65% on the 10% subset and a Spearman's rank correlation coefficient of 0.5802 on the full dataset. Comparison of the augmented DenseNet and the DenseNet baseline also demonstrate the effectiveness of the proposed methods of augmentation.","PeriodicalId":259099,"journal":{"name":"Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122753330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Y. Indrianti, Sasmoko, S. R. Manalu, Queensy Lovenia Kerrin Waromi
{"title":"Literature Review Profiles of Specialization in Education and Profession as the basis for the development of Artificial Intelligence Website","authors":"Y. Indrianti, Sasmoko, S. R. Manalu, Queensy Lovenia Kerrin Waromi","doi":"10.1145/3508259.3508296","DOIUrl":"https://doi.org/10.1145/3508259.3508296","url":null,"abstract":"Profiles of specialization in education and profession are currently heavily influenced by the development of industry 4.0. The era of disruption is marked by massive changes due to innovations that change business systems and arrangements to newer levels. In various studies, there will be many professions that are extinct but there will also be many new professions that will be born. This study aims to conduct a literature review of profiles of specialization in education and profession that are relevant to industry 4.0 trends which will then be developed in a measurement through an artificial intelligence-based website. The research method used is a bibliometric approach through scientific studies of various library sources using VOSViewer with cartographic overlay techniques and density visualization maps, to represent sequences and their relationships. While the method used to develop website profiles of specialization in education and profession based on artificial intelligence is to use the waterfall method. The results of the literature review found 10 words with a high level of bibliometric correlation as the basis for developing artificial intelligence-based websites.","PeriodicalId":259099,"journal":{"name":"Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131896521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Hybrid Recommender System based on DeepCF and Wide Linear Model for K12 Online Education","authors":"Tuanji Gong, Xuan-xia Yao, Kate N. Sirota","doi":"10.1145/3508259.3508279","DOIUrl":"https://doi.org/10.1145/3508259.3508279","url":null,"abstract":"Compared to traditional classroom education, online education has the advantage of personalized teaching and is able to significantly improve learning efficiency and outcomes. K12 online education refers to online education serving students across 12 grade levels, from primary school to high school. As there are a plethora of courses offered on online education platforms, how to recommend proper courses for students is a serious challenge. In order to resolve this problem, this study proposes a hybrid recommendation model that combines the deep collaborative filtering (DeepCF) model and the wide linear model. Together, in the hybrid model, these models can integrate course features, student features, and side information. The DeepCF model learns low-dimension latent representations for both courses and students and integrates them into matrix factorization to predict ratings. The wide linear model uses a factorization machine to design and select features automatically. The hybrid model can achieve good performance and alleviate the problem of sparse features and cold start. Experimental results demonstrate that compared with the collaborative filter model, the hybrid model achieved a significant improvement with a 12.7% relative increase in AUC metric.","PeriodicalId":259099,"journal":{"name":"Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124844593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Given Name Sasmoko, Y. Indrianti, Davy Ronald Hermanus
{"title":"LINEN ASSESSMENT APPS: Artificial Intelligence Integration to develop Android-based applications that measure the \"literacy, numeracy, entrepreneurial mindset\" capacity of students in Indonesia","authors":"Given Name Sasmoko, Y. Indrianti, Davy Ronald Hermanus","doi":"10.1145/3508259.3508295","DOIUrl":"https://doi.org/10.1145/3508259.3508295","url":null,"abstract":"The LINEN Assessment application is an application developed to measure the capacity of students in Indonesia in terms of literacy, numeracy and entrepreneurial mindset. To be able to get an automatic profiling picture and be able to provide more accurate predictions, the application was developed based on artificial intelligence. This study aims to provide an overview of the integration of artificial intelligence that is carried out to be applied in the process of developing the LINEN Assessment application. The research method used is the Neuroresearch research method. Through the three stages in this method, namely exploratory, explanatory and confirmatory research, the research process is carried out to produce a research construct as the basis for compiling a standard and valid LINEN Assessment instrument. The application development stage itself uses the waterfall method, specifically through the agile development method. The results of the study are illustrated by the presence of several diagrams showing the integration process carried out. The description of the LINEN Assessment application also shows that the LINEN Assessment Apps were developed with the concept of self-assessment and diagnostic-assessment to equip students with fundamental skills and competencies that are important for their future. One of these important foundations is when students have basic skills, namely literacy, numeracy and skills that refer to innovative creative power, namely entrepreneurial mindset skills. Students can independently use smartphones to explore their own profiles and explore learning experiences through the LINEN Assessment application.","PeriodicalId":259099,"journal":{"name":"Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125566491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"COVID-19 Diagnosis Using Chest X-ray Images via Classification and Object Detection","authors":"Kenji Yoshitsugu, Y. Nakamoto","doi":"10.1145/3508259.3508268","DOIUrl":"https://doi.org/10.1145/3508259.3508268","url":null,"abstract":"We diagnose the symptoms and the localization of the affected area in COVID-19 cases/patients using chest x-ray images provided by the Kaggle competition. By training and predicting symptoms and the localization of the affected area using the YOLOv5 object detection algorithm, we obtained a low accuracy of approximately 20%. However, we improved the accuracy to approximately 80% by using the image classification model Keras / EfficientNetB7, in addition to YOLOv5. Although it is difficult to detect visually ambiguous objects such as pneumonia, we believe that we can improve the accuracy by training/predicting symptoms using the image classification model and the localization of the affected area using the object detection algorithm.","PeriodicalId":259099,"journal":{"name":"Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121596012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Reconfigurable Hybrid Model Convolutional Stage – Infinity Laplacian Applied to Depth Completion","authors":"V. Lazcano, F. Calderero","doi":"10.1145/3508259.3508275","DOIUrl":"https://doi.org/10.1145/3508259.3508275","url":null,"abstract":"Convolutional networks are the current approach that presents the best performance in many applications. The principal critic of classical models is that they are hand-crafted by the designer. Still, the proposed architecture of a convolutional network is also hand-crafted, for example, the number of layers. Depth map completion is crucial for computer vision due to its applications in different fields such as video games or autonomous vehicles. Depth maps are acquired by a sensor or obtained by a stereo algorithm and present a lack of information due to occlusions or sensor misinterpretation. In this paper, we offer a reconfigurable hybrid model to interpolate depth maps. This model consists of a convolutional stage (SC1) pipeline, interpolation model, and convolutional stage (SC2). The convolutional stage input is a color reference image of the scene, creating a color features map as input for the next step. The interpolation model is the infinity Laplacian. We interpolated the incomplete depth map solving the Infinity Laplacian in a Manifold. Then, the completed depth map is processed again by the last convolutional stage. In this pipeline, we used a fixed number of convolutional filters, but we can interchange convolutional steps, i.e., interchange SC1 by SC2, reconfiguring the computing sequence. We estimated the parameters of the convolutional filter and the Infinity Laplacian using Particle Swarm Optimization (PSO). Our proposal obtained MSE=1.315 in the KITTI depth completion suite outperforming some contemporaneous methods.","PeriodicalId":259099,"journal":{"name":"Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132156414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"DECA: DoD Enterprise Cloud Architecture Concept for Cloud-Based Cross Domain Solutions","authors":"Leonardo Aguilera, D. Jacobson","doi":"10.1145/3508259.3508283","DOIUrl":"https://doi.org/10.1145/3508259.3508283","url":null,"abstract":"The Department of Defense (DoD) battlefield exists both in cyber and the physical world. Information sharing is a top priority for the DoD in support of our warfighters and allies. To maintain military technological advantage and superiority, access to information and the capacity to process it are critical components for empowering the warfighter for mission success. The volume of information shared has increased exponentially, necessitating the development of a DoD enterprise cloud capable of sustaining, and supporting strategic worldwide DoD missions through effective information sharing. However, the existing U.S. Government cloud design does not support enterprise use, and legacy software and hardware applications such as a Cross Domain Solution (CDS) will need to be re-architected, certified, accredited, and authorized for future enterprise cloud use. A CDS is a requirement for information sharing in both unclassified and classified systems and information transmission from one system to another, but there must also be a DoD enterprise cloud structure to leverage the CDS in the U.S. Government cloud. The purpose of this research is to explore the future possibilities of using an enterprise cloud CDS and to present a conceptual design for a DoD enterprise cloud architecture that will save the DoD time and money in the certification process while also allowing efficient information sharing across multiple DoD Command and Control (C2) systems. To have this architecture design approved and accredited by the DoD for future use, we adhere to the Federal Risk and Authorization Management Program (FedRAMP), a process required for federal agency cloud deployments and the National Institute of Standards and Technology (NIST) standards. We use existing systems from across the DoD and allies found in the open literature as a baseline.","PeriodicalId":259099,"journal":{"name":"Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122175968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sahith Kurapati, Om Shreenidhi BN, Saksham Gupta, Kalp Abhinn Aghada, Jayashree Rangareddy
{"title":"Colorizing Black and White Videos with Damping Factor and Pixel Tolerance: Reducing Flickering Effect","authors":"Sahith Kurapati, Om Shreenidhi BN, Saksham Gupta, Kalp Abhinn Aghada, Jayashree Rangareddy","doi":"10.1145/3508259.3508274","DOIUrl":"https://doi.org/10.1145/3508259.3508274","url":null,"abstract":"Colored videos are more attractive to audiences compared to black and white videos. This can be seen through recent adaptions of old black and white movies into their colored versions. Given the relevance in the industry, this project tackles the problem of colorizing videos effectively. Previous attempts on the same topic could not do this efficiently as they always encountered the vital issue of a “flickering effect.” The said flickering effect is observed when the frames are colored individually and then reassembled to form a video. To solve this issue, we introduce two parameters, namely, a damping factor and a pixel tolerance. We have also fine-tuned the parameters for the most optimal result through experimentation. Finally, using custom metrics, we showcase the efficiency and effectiveness of this method.","PeriodicalId":259099,"journal":{"name":"Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126595863","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Deep-Learning-Based Diagnosis of Cassava Leaf Diseases Using Vision Transformer","authors":"Li Zhuang","doi":"10.1145/3508259.3508270","DOIUrl":"https://doi.org/10.1145/3508259.3508270","url":null,"abstract":"Viral diseases are major causes leading to the poor yields of cassava, which is the second-largest source of food carbohydrates in Africa. As symptoms of these diseases can usually be identified by inspecting cassava leafs, visual diagnosis of cassava leaf diseases is of significant importance in food security and agriculture development. Considering the shortage of qualified agricultural experts, automatic approaches for the image-based detection of cassava leaf diseases are in great demand. In this paper, on the basis of Vision Transformer, we propose a deep learning method to identify the type of viral disease in a cassava leaf image. The image dataset of cassava leaves is provided by the Makerere Artificial Intelligence Lab in a Kaggle competition, consisting of 4 subtypes of diseases and healthy cassava leaves. Our results show that Vision-Transformer-based model can effectively achieve an excellent performance regarding the classification of cassava leaf diseases. After applying the K-Fold cross validation technique, our model reaches a categorization accuracy 0.9002 on the private test set. This score ranks top 3% in the leaderboard, and can get a silver medal prize in the Kaggle competition. Our method can be applied for the identification of diseased plants, and potentially prevent the irreparable damage of crops.","PeriodicalId":259099,"journal":{"name":"Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126677408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Combined Spatial Features Recognition Method for Similar Video Action Automatic Classification","authors":"Ling Wang, Yuanhao Mei, Shiru Gao, T. Zhou","doi":"10.1145/3508259.3508260","DOIUrl":"https://doi.org/10.1145/3508259.3508260","url":null,"abstract":"Due to the differences between general actions are obvious, these actions are easily recognized by traditional methods which are input with RGB data. But for similar actions, the differences between them are subtle. Especially when the background moves violently or the light is unstable, RGB data is not robust, and recognizing similar actions remains difficult. To improve the recognition accuracy of similar actions, the skeleton-based features and the difference between similar actions are focused on. In this paper, we proposed the FAU-Bi-LSTM algorithm, which could accurately recognize four types of similar activities (walking, running, riding bike, climbing stairs) based on the relative angle feature and relative distance feature and feature attribute unit, and these features could preserve the spatial difference information. The experiment results show that the FAU-Bi-LSTM algorithm could recognize these four types of similar activities by their SDC(Spatial-Difference-Contained) features and has a better performance in similar activities recognition.","PeriodicalId":259099,"journal":{"name":"Proceedings of the 2021 4th Artificial Intelligence and Cloud Computing Conference","volume":"1973 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130099414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}