{"title":"Analysing crypto news sentiment to predict bitcoin prices","authors":"Abhishek Kumar","doi":"10.2139/ssrn.3913652","DOIUrl":"https://doi.org/10.2139/ssrn.3913652","url":null,"abstract":"The purpose of this paper is to analyse the ability of news content to predict cryptocurrency markets. The role of news announcement is central to pricing and revisions in pricing of any asset. Right from the 16th century news of ships arriving at ports with tradable goods resulted in the fluctuation of local market prices. In the digital age where TV screen flashes breaking news at microsecond frequency, the influence on the prices has never been more profound. Using news articles to predict markets has always been a bottleneck for traders, the major issue being transforming words and semantics to financial numbers. The meteoric rise of natural language processing in other fields has finally made this task possible for humans. Some common ways include transforming raw texts to bags of words, one hot encoding or advanced word embeddings and feeding them to ML models which is attempted in this paper.","PeriodicalId":189628,"journal":{"name":"InfoSciRN: Machine Learning (Sub-Topic)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129732079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Escaping the 'Impossibility of Fairness': From Formal to Substantive Algorithmic Fairness","authors":"Ben Green","doi":"10.2139/ssrn.3883649","DOIUrl":"https://doi.org/10.2139/ssrn.3883649","url":null,"abstract":"In the face of compounding crises of social and economic inequality, many have turned to algorithmic decision-making to achieve greater fairness in society. As these efforts intensify, reasoning within the burgeoning field of “algorithmic fairness” increasingly shapes how fairness manifests in practice. This paper interrogates whether algorithmic fairness provides the appropriate conceptual and practical tools for enhancing social equality. I argue that the dominant, “formal” approach to algorithmic fairness is ill-equipped as a framework for pursuing equality, as its narrow frame of analysis generates restrictive approaches to reform. In light of these shortcomings, I propose an alternative: a “substantive” approach to algorithmic fairness that centers opposition to social hierarchies and provides a more expansive analysis of how to address inequality. This substantive approach enables more fruitful theorizing about the role of algorithms in combatting oppression. The distinction between formal and substantive algorithmic fairness is exemplified by each approach’s responses to the “impossibility of fairness” (an incompatibility between mathematical definitions of algorithmic fairness). While the formal approach requires us to accept the “impossibility of fairness” as a harsh limit on efforts to enhance equality, the substantive approach allows us to escape the “impossibility of fairness” by suggesting reforms that are not subject to this false dilemma and that are better equipped to ameliorate conditions of social oppression.","PeriodicalId":189628,"journal":{"name":"InfoSciRN: Machine Learning (Sub-Topic)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114437009","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Real-Time Prediction of Online False Information Purveyors and their Characteristics","authors":"Anil R. Doshi, S. Raghavan, W. Schmidt","doi":"10.2139/ssrn.3725919","DOIUrl":"https://doi.org/10.2139/ssrn.3725919","url":null,"abstract":"Disinformation, misinformation, and other 'fake news' - collectively false information - is quick and inexpensive to create and distribute in our increasingly digital and connected world. Identifying false information early and cost effectively can offset some of those operational advantages. In this paper, we develop light-weight machine learning models that utilize (1) a novel data set tracking browsing behavior and (2) domain registration data that is available for all websites when they are established. Using only the domain registration data, we develop and demonstrate a machine learning classifier that identifies domains, at the time the domain is registered, that will go on to produce false information. We then combine this data with our browsing data and develop a machine learning classifier that identifies false information domains whose content is most associated with higher levels of consumption. Finally, we use our data to identify false information domains that will cease operations after an event of interest, in our case the 2016 U.S. presidential election. We theorize that the last category involves actors seeking primarily to manipulate perceptions and outcomes of that event.","PeriodicalId":189628,"journal":{"name":"InfoSciRN: Machine Learning (Sub-Topic)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127746937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Deep Learning, Predictability, and Optimal Portfolio Returns","authors":"M. Babiak, Jozef Baruník","doi":"10.2139/ssrn.3688577","DOIUrl":"https://doi.org/10.2139/ssrn.3688577","url":null,"abstract":"We study optimal dynamic portfolio choice of a long-horizon investor who uses deep learning methods to predict equity returns when forming optimal portfolios. The results show statistically and economically significant out-of-sample portfolio benefits of deep learning as measured by high certainty equivalent returns and Sharpe ratios. Return predictability via deep learning generates substantially improved portfolio performance across different subsamples, particularly the recession periods. These gains are robust to including transaction costs, short-selling and borrowing constraints.","PeriodicalId":189628,"journal":{"name":"InfoSciRN: Machine Learning (Sub-Topic)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133275898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Impute, Select, Decision Tree and Naïve Bayes (ISE-DNC): An Ensemble Learning Approach to Classify the Lung Cancer","authors":"S. Bhanumathi, N. ChandrashekaraS","doi":"10.2139/ssrn.3667438","DOIUrl":"https://doi.org/10.2139/ssrn.3667438","url":null,"abstract":"In this work, we have introduced a hybrid novel approach to classify the lung cancer data using ensemble learning. According to this approach, first of all, we present data preprocessing model where missing values are imputed with the help of knn. Later, we incorporated filtering-based feature selection to reduce the feature dimension. Later, decision tree and Naive Bayes classifiers are used to create the ensemble learner. Finally, voting based decisions are made to classify the data. The proposed approach is represented as ISE-DNC (Impute, Select, Decision Tree and Naive Bayes) classifier. The proposed approach is implemented on two lung cancer public datasets which are obtained from the UCI repository. The experimental study shows that the proposed approach achieves 96.87% and 89.78% of classification accuracy for lung cancer and thoracic surgery dataset.","PeriodicalId":189628,"journal":{"name":"InfoSciRN: Machine Learning (Sub-Topic)","volume":"205 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121234078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"AI and Algorithmic Bias: Source, Detection, Mitigation and Implications","authors":"Runshan Fu, Yan Huang, Param Vir Singh","doi":"10.2139/ssrn.3681517","DOIUrl":"https://doi.org/10.2139/ssrn.3681517","url":null,"abstract":"Artificial intelligence (AI) and machine learning (ML) algorithms are widely used throughout our economy in making decisions that have far-reaching impacts on employment, education, access to credit, and other areas. Initially considered neutral and fair, ML algorithms have recently been found increasingly biased, creating and perpetuating structural inequalities in society. With the rising concerns about algorithmic bias, a growing body of literature attempts to understand and resolve the issue of algorithmic bias. In this tutorial, we discuss five important aspects of algorithmic bias. We start with its definition and the notions of fairness policy makers, practitioners, and academic researchers have used and proposed. Next, we note the challenges in identifying and detecting algorithmic bias given the observed decision outcome, and we describe methods for bias detection. We then explain the potential sources of algorithmic bias and review several bias-correction methods. Finally, we discuss how agents’ strategic behavior may lead to biased societal outcomes, even when the algorithm itself is unbiased. We conclude by discussing open questions and future research directions.","PeriodicalId":189628,"journal":{"name":"InfoSciRN: Machine Learning (Sub-Topic)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124314075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Addressing Big Data and AI Challenges: A Taxonomy and Why the GDPR Cannot Provide a One-size-fits-all Solution","authors":"M. Tzanou","doi":"10.2139/ssrn.3654119","DOIUrl":"https://doi.org/10.2139/ssrn.3654119","url":null,"abstract":"The paper challenges the assumption that data privacy frameworks in general and the GDPR in particular can provide an appropriate regulatory solution for big data. It argues that in order to be able to properly reflect on regulatory approaches that grasp with big data challenges, closer attention should be paid to these particular challenges. In this respect, this chapter makes three distinct contributions to the debate regarding regulatory approaches to big data: First, it develops a taxonomy of big data challenges that allows a comprehensive overview of the issues at stake. Second, it examines the capabilities and limitations of the GDPR to address the risks identified in the proposed taxonomy. Third, it offers some suggestions on the pathways that regulators should be considering when approaching big data and AI.","PeriodicalId":189628,"journal":{"name":"InfoSciRN: Machine Learning (Sub-Topic)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115223315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Machine Learning and IoT based Real Time Parking System: Challenges and Implementation","authors":"R. Gupta, Geeta Rani","doi":"10.2139/ssrn.3563377","DOIUrl":"https://doi.org/10.2139/ssrn.3563377","url":null,"abstract":"There is a tremendous increase in number of vehicles in last two decades. So, it becomes important to make effective use of technology to enable hassle free parking at public and/or private places. In traditional parking systems, drivers face difficulty in finding available parking slots. These systems ignore the fact of parking the vehicles on roads, time management in peak hours, wrong parking of a vehicle in a parking slot. Moreover, the traditional systems require more human intervention in a parking zone. To deal with above said issues, there is an urgent requirement of developing Smart Parking Systems. In this manuscript, the authors propose a Smart Parking System based on IoT and Machine learning techniques to answer the real time management of parking and uncertainties. The proposed solution utilizes smart sensors, cloud computing and cyber physical system. Development of graphical user interface for administrator and end-user is a major challenge as it requires to ensure smooth monitoring, control and security of parking system. Moreover, it needs to establish effortless coordination with an end-user. The proposed system is successful in smartly addressing the challenges such as indicating status of parking slot well in advance to end-user, use of reserved and unreserved parking slots, wrong parking, unauthorized parking, real time analysis of free and occupied slots, detecting multiple objects in a parking slot such as bike in car slot, fault detection in one or more components and traffic management during peak hours. The system minimizes the human intervention and saves time, money and energy.","PeriodicalId":189628,"journal":{"name":"InfoSciRN: Machine Learning (Sub-Topic)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122576861","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. Supriya, Chiluka Venkat, Aliketti Deepak, GV Hari Prasad
{"title":"Underwater Fish Images Classification by Deep Neural Network","authors":"A. Supriya, Chiluka Venkat, Aliketti Deepak, GV Hari Prasad","doi":"10.2139/ssrn.3622031","DOIUrl":"https://doi.org/10.2139/ssrn.3622031","url":null,"abstract":"Live fish recognition is one of the most crucial elements of fisheries survey applications where the vast amount of data is rapidly acquired. Different from general scenarios, challenges to underwater image recognition are posted by poor image quality, uncontrolled objects and environment, and difficulty in acquiring representative samples. In addition, most existing feature extraction techniques are hindered from automation due to involving human supervision. Toward this end, we propose an underwater fish recognition framework that consists of a fully unsupervised feature learning technique and an error-resilient classifier. In this development we used the deep neural network for development of system module. Neural network will provide the better accuracy under different conditions of input images and different targets. Experiments show that the proposed framework achieves high accuracy on both public and self-collected underwater fish images with high uncertainty and class imbalance.","PeriodicalId":189628,"journal":{"name":"InfoSciRN: Machine Learning (Sub-Topic)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133435376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}