Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security最新文献

Session details: Authentication and Intrusion Detection 会话详细信息:身份验证和入侵检测

Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security Pub Date : 2017-11-03 DOI: 10.1145/3252888

Sam Bretheim

引用次数: 0

Malware Analysis of Imaged Binary Samples by Convolutional Neural Network with Attention Mechanism 基于注意机制的卷积神经网络图像二值样本恶意软件分析

Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security Pub Date : 2017-11-03 DOI: 10.1145/3128572.3140457

Hiromu Yakura, S. Shinozaki, R. Nishimura, Y. Oyama, Jun Sakuma

引用次数: 1

Malware Classification and Class Imbalance via Stochastic Hashed LZJD 基于随机哈希LZJD的恶意软件分类与类不平衡

Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security Pub Date : 2017-11-03 DOI: 10.1145/3128572.3140446

Edward Raff, Charles K. Nicholas

引用次数: 36

An Early Warning System for Suspicious Accounts 可疑账户预警系统

Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security Pub Date : 2017-11-03 DOI: 10.1145/3128572.3140455

Hassan Halawa, M. Ripeanu, K. Beznosov, Baris Coskun, Meizhu Liu

引用次数: 5

Generating Look-alike Names For Security Challenges 为安全挑战生成相似的名称

Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security Pub Date : 2017-11-03 DOI: 10.1145/3128572.3140441

Shuchu Han, Yifan Hu, S. Skiena, Baris Coskun, Meizhu Liu, Hong Qin, Jaime Perez

{"title":"Generating Look-alike Names For Security Challenges","authors":"Shuchu Han, Yifan Hu, S. Skiena, Baris Coskun, Meizhu Liu, Hong Qin, Jaime Perez","doi":"10.1145/3128572.3140441","DOIUrl":"https://doi.org/10.1145/3128572.3140441","url":null,"abstract":"Motivated by the need to automatically generate behavior-based security challenges to improve user authentication for web services, we consider the problem of large-scale construction of realistic-looking names to serve as aliases for real individuals. We aim to use these names to construct security challenges, where users are asked to identify their real contacts among a presented pool of names. We seek these look-alike names to preserve name characteristics like gender, ethnicity, and popularity, while being unlinkable back to the source individual, thereby making the real contacts not easily guessable by attackers. To achive this, we introduce the technique of distributed name embeddings, representing names in a high-dimensional space such that distance between name components reflects the degree of cultural similarity between these strings. We present different approaches to construct name embeddings from contact lists observed at a large web-mail provider, and evaluate their cultural coherence. We demonstrate that name embeddings strongly encode gender and ethnicity, as well as name popularity. We applied this algorithm to generate imitation names in email contact list challenge. Our controlled user study verified that the proposed technique reduced the attacker's success rate to 26.08%, indistinguishable from random guessing, compared to a success rate of 62.16% from previous name generation algorithms. Finally, we use these embeddings to produce an open synthetic name resource of 1 million names for security applications, constructed to respect both cultural coherence and U.S. census name frequencies.","PeriodicalId":318259,"journal":{"name":"Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116347279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Robust Linear Regression Against Training Data Poisoning 抗训练数据中毒的鲁棒线性回归

Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security Pub Date : 2017-11-03 DOI: 10.1145/3128572.3140447

Chang Liu, Bo Li, Yevgeniy Vorobeychik, Alina Oprea

引用次数: 76

Beyond Big Data: What Can We Learn from AI Models?: Invited Keynote 超越大数据:我们能从人工智能模型中学到什么?:特邀演讲

Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security Pub Date : 2017-11-03 DOI: 10.1145/3128572.3140452

Aylin Caliskan

{"title":"Beyond Big Data: What Can We Learn from AI Models?: Invited Keynote","authors":"Aylin Caliskan","doi":"10.1145/3128572.3140452","DOIUrl":"https://doi.org/10.1145/3128572.3140452","url":null,"abstract":"My research involves the heavy use of machine learning and natural language processing in novel ways to interpret big data, develop privacy and security attacks, and gain insights about humans and society through these methods. I do not use machine learning only as a tool but I also analyze machine learning models? internal representations to investigate how the artificial intelligence perceives the world. This work [3] has been recently featured in Science where I showed that societal bias exists at the construct level of machine learning models, namely semantic space word embeddings which are dictionaries for machines to understand language. When I use machine learning as a tool to uncover privacy and security problems, I characterize and quantify human behavior in language, including programming languages, by coming up with a linguistic fingerprint for each individual. By extracting linguistic features from natural language or programming language texts of humans, I show that humans have unique linguistic fingerprints since they all learn language on an individual basis. Based on this finding, I can de-anonymize humans that have written certain text, source code, or even executable binaries of compiled code [2, 4, 5]. This is a serious privacy threat for individuals that would like to remain anonymous, such as activists, programmers in oppressed regimes, or malware authors. Nevertheless, being able to identify authors of malicious code enhances security. On the other hand, identifying authors can be used to resolve copyright disputes or detect plagiarism. The methods in this realm [1] have been used to identify so called doppelgängers to link the accounts that belong to the same identities across platforms, especially underground forums that are business platforms for cyber criminals. By analyzing machine learning models? internal representation and linguistic human fingerprints, I am able to uncover facts about the world, society, and the use of language, which have implications for privacy, security, and fairness in machine learning.","PeriodicalId":318259,"journal":{"name":"Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security","volume":"408 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129254182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Session details: Lightning Round 会话细节:闪电回合

Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security Pub Date : 2017-11-03 DOI: 10.1145/3252887

David Mandell Freeman

引用次数: 0

Session details: Defense against Poisoning 会议细节:防御中毒

Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security Pub Date : 2017-11-03 DOI: 10.1145/3252889

Luis Mu?oz-Gonz?lez

引用次数: 0

Differentially Private Noisy Search with Applications to Anomaly Detection (Abstract) 差分私有噪声搜索在异常检测中的应用(摘要)

Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security Pub Date : 2017-11-03 DOI: 10.1145/3128572.3140456

D. M. Bittner, A. Sarwate, R. Wright

{"title":"Differentially Private Noisy Search with Applications to Anomaly Detection (Abstract)","authors":"D. M. Bittner, A. Sarwate, R. Wright","doi":"10.1145/3128572.3140456","DOIUrl":"https://doi.org/10.1145/3128572.3140456","url":null,"abstract":"We consider the problem of privacy-sensitive anomaly detection - screening to detect individuals, behaviors, areas, or data samples of high interest. What defines an anomaly is context-specific; for example, a spoofed rather than genuine user attempting to log in to a web site, a fraudulent credit card transaction, or a suspicious traveler in an airport. The unifying assumption is that the number of anomalous points is quite small with respect to the population, so that deep screening of all individual data points would potentially be time-intensive, costly, and unnecessarily invasive of privacy. Such privacy violations can raise concerns due sensitive nature of data being used, raise fears about violations of data use agreements, and make people uncomfortable with anomaly detection methods. Anomaly detection is well studied, but methods to provide anomaly detection along with privacy are less well studied. Our overall goal in this research is to provide a framework for identifying anomalous data while guaranteeing quantifiable privacy in a rigorous sense. Once identified, such anomalies could warrant further data collection and investigation, depending on the context and relevant policies. In this research, we focus on privacy protection during the deployment of anomaly detection. Our main contribution is a differentially private access mechanism for finding anomalies using a search algorithm based on adaptive noisy group testing. To achieve this, we take as our starting point the notion of group testing [1], which was most famously used to screen US military draftees for syphilis during World War II. In group testing, individuals are tested in groups to limit the number of tests. Using multiple rounds of screenings, a small number of positive individuals can be detected very efficiently. Group testing has the added benefit of providing privacy to individuals through plausible deniability - since the group tests use aggregate data, individual contributions to the test are masked by the group. We follow on these concepts by demonstrating a search model utilizing adaptive queries on aggregated group data. Our work takes the first steps toward strengthening and formalizing these privacy concepts by achieving differential privacy [2]. Differential privacy is a statistical measure of disclosure risk that captures the intuition that an individual's privacy is protected if the results of a computation have at most a very small and quantifiable dependence on that individual's data. In the last decade, there hpractical adoption underway by high-profile companies such as Apple, Google, and Uber. In order to make differential privacy meaningful in the context of a task that seeks to specifically identify some (anomalous) individuals, we introduce the notion of anomaly-restricted differential privacy. Using ideas from information theory, we show that noise can be added to group query results in a way that provides differential privacy for non-anomalous indi","PeriodicalId":318259,"journal":{"name":"Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133481622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2