Web scraping: Jurisprudence and legal doctrines

IF 0.9 Q2 LAW

Journal of World Intellectual Property Pub Date : 2024-11-09 DOI:10.1111/jwip.12331

Avv. Gino Fontana

{"title":"Web scraping: Jurisprudence and legal doctrines","authors":"Avv. Gino Fontana","doi":"10.1111/jwip.12331","DOIUrl":null,"url":null,"abstract":"Web scraping is a technique that allows the extraction of online information and data to train Generative Artificial Intelligence (GenAI) systems. Although the use of deep learning algorithms to produce user-requested outputs (texts, images, music and code) based on models learned from vast data sets dates back a few decades, its use has become fundamental with the recent development of GenAI and has been accompanied by the emergence of the first legal disputes. Doctrine and jurisprudence are called upon to consider the legal consequences arising from the combination of web scraping and GenAI, often encountering inadequate and fragmented legislation. Laws and regulations vary significantly across different countries and regions, reflecting diverse priorities and legal approaches. However, while doctrine, regardless of the latitudes, agrees in condemning the illicit acts and abuses due not so much to the extraction method but to the use of the extracted data (where protected by intellectual property rights), jurisprudence (particularly in Europe and North America) has already had the opportunity to express divergent opinions in some leading cases.","PeriodicalId":54129,"journal":{"name":"Journal of World Intellectual Property","volume":"28 1","pages":"197-212"},"PeriodicalIF":0.9000,"publicationDate":"2024-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of World Intellectual Property","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/jwip.12331","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"LAW","Score":null,"Total":0}

引用次数: 0

Abstract

Web scraping is a technique that allows the extraction of online information and data to train Generative Artificial Intelligence (GenAI) systems. Although the use of deep learning algorithms to produce user-requested outputs (texts, images, music and code) based on models learned from vast data sets dates back a few decades, its use has become fundamental with the recent development of GenAI and has been accompanied by the emergence of the first legal disputes. Doctrine and jurisprudence are called upon to consider the legal consequences arising from the combination of web scraping and GenAI, often encountering inadequate and fragmented legislation. Laws and regulations vary significantly across different countries and regions, reflecting diverse priorities and legal approaches. However, while doctrine, regardless of the latitudes, agrees in condemning the illicit acts and abuses due not so much to the extraction method but to the use of the extracted data (where protected by intellectual property rights), jurisprudence (particularly in Europe and North America) has already had the opportunity to express divergent opinions in some leading cases.

查看原文本刊更多论文

网络抓取：法理学和法律学说

网络抓取是一种允许提取在线信息和数据以训练生成式人工智能（GenAI）系统的技术。尽管使用深度学习算法根据从大量数据集中学习的模型生成用户要求的输出（文本、图像、音乐和代码）可以追溯到几十年前，但随着GenAI的最近发展，它的使用已经成为基础，并伴随着第一批法律纠纷的出现。理论和法理学被要求考虑网络抓取和基因人工智能相结合所产生的法律后果，经常遇到不充分和分散的立法。不同国家和地区的法律法规差异很大，反映了不同的优先事项和法律途径。然而，尽管学说（不论纬度）一致谴责非法行为和滥用行为，其主要原因不是提取方法，而是提取数据的使用（在受知识产权保护的情况下），但法理学（特别是在欧洲和北美）已经有机会在一些主要案件中表达不同的意见。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of World Intellectual Property LAW-

CiteScore

1.50

自引率

0.00%

发文量