yoU aRe a Liar://A Unified Framework for Cross-Testing URL Parsers

2022 IEEE Security and Privacy Workshops (SPW) Pub Date : 2022-05-01 DOI:10.1109/spw54247.2022.9833883

Dashmeet Kaur Ajmani, Igibek Koishybayev, A. Kapravelos

{"title":"yoU aRe a Liar://A Unified Framework for Cross-Testing URL Parsers","authors":"Dashmeet Kaur Ajmani, Igibek Koishybayev, A. Kapravelos","doi":"10.1109/spw54247.2022.9833883","DOIUrl":null,"url":null,"abstract":"A variety of attacks, including phishing, remote-code execution, server-side request forgery, and hostname redirection, are delivered to users over the web. The beginning of most of the web exploits is an innocent-looking URL. Malformed or misinterpreted URLs can lead to remote code execution attacks as well. The IETF and WHATWG standards organizations define the components of a URL and act as an implementation guide for URL parsers. They state which characters are allowed in each portion of the URL and loosely suggest what to do in case an undefined character is present in the URL. The existence of two standards is the first concern, and the addition of server-side request forgery in the latest version of OWASP Top 10, suggests that neither of these standards is being followed accurately and concisely. Moreover, neither of these specifications describe an exact implementation standard, causing inconsistencies in the way the various parsers interpret the same URL. For example, malicious users can find ways to craft URLs to look like they are pointing to one resource but actually direct the user to different one. This problem is worsened when one application uses two separate parsers for validation and resource fetching.In this paper, we design a framework that unifies the testing suites of 8 URL parsers from popular web-related projects and highlights the inconsistencies between them. We examine and dive deep into the URL parser implementation across the most popular libraries, browsers, and command-line tools, and discover many open areas for exploitation. Our findings include identifying categories of inconsistencies, developing proof-of-concept exploits, and highlighting the need for a comprehensive implementation standard to be developed and enforced at the earliest.","PeriodicalId":334852,"journal":{"name":"2022 IEEE Security and Privacy Workshops (SPW)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE Security and Privacy Workshops (SPW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/spw54247.2022.9833883","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

A variety of attacks, including phishing, remote-code execution, server-side request forgery, and hostname redirection, are delivered to users over the web. The beginning of most of the web exploits is an innocent-looking URL. Malformed or misinterpreted URLs can lead to remote code execution attacks as well. The IETF and WHATWG standards organizations define the components of a URL and act as an implementation guide for URL parsers. They state which characters are allowed in each portion of the URL and loosely suggest what to do in case an undefined character is present in the URL. The existence of two standards is the first concern, and the addition of server-side request forgery in the latest version of OWASP Top 10, suggests that neither of these standards is being followed accurately and concisely. Moreover, neither of these specifications describe an exact implementation standard, causing inconsistencies in the way the various parsers interpret the same URL. For example, malicious users can find ways to craft URLs to look like they are pointing to one resource but actually direct the user to different one. This problem is worsened when one application uses two separate parsers for validation and resource fetching.In this paper, we design a framework that unifies the testing suites of 8 URL parsers from popular web-related projects and highlights the inconsistencies between them. We examine and dive deep into the URL parser implementation across the most popular libraries, browsers, and command-line tools, and discover many open areas for exploitation. Our findings include identifying categories of inconsistencies, developing proof-of-concept exploits, and highlighting the need for a comprehensive implementation standard to be developed and enforced at the earliest.

查看原文本刊更多论文

你是一个骗子://交叉测试URL解析器的统一框架

网络钓鱼、远程代码执行、服务器端请求伪造、主机名重定向等各种攻击通过web传递给用户。大多数网络攻击的开始都是一个看似无辜的URL。格式错误或曲解的url也可能导致远程代码执行攻击。IETF和WHATWG标准组织定义了URL的组件，并作为URL解析器的实现指南。它们说明URL的每个部分允许使用哪些字符，并粗略地建议在URL中出现未定义字符时该如何处理。两个标准的存在是第一个问题，在最新版本的OWASP Top 10中增加了服务器端请求伪造，这表明这两个标准都没有得到准确和简明的遵循。此外，这两个规范都没有描述确切的实现标准，导致各种解析器解释相同URL的方式不一致。例如，恶意用户可以设法制作url，使其看起来指向一个资源，但实际上将用户引向不同的资源。当一个应用程序使用两个独立的解析器进行验证和获取资源时，这个问题会更加严重。在本文中，我们设计了一个框架，统一了来自流行的web相关项目的8个URL解析器的测试套件，并突出了它们之间的不一致性。我们深入研究了最流行的库、浏览器和命令行工具中的URL解析器实现，并发现了许多可供利用的开放领域。我们的发现包括识别不一致的类别，开发概念验证漏洞，并强调需要尽早开发和执行一个全面的实现标准。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE Security and Privacy Workshops (SPW)

自引率

0.00%

发文量