{"title":"Automatic Annotation of HTML Documents Using the Microdata Standard","authors":"T. F. Ibragimov, A. A. Ferenets","doi":"10.3103/S0005105525700359","DOIUrl":null,"url":null,"abstract":"<p>The development of an application that is based on machine learning methods for automatic annotation of web pages according to the Microdata standard is described, with the possibility of an extension to other standards and injecting data to JSX files. Datasets were collected and prepared for training machine learning (ML) models. The ML model metrics were collected and analyzed.</p>","PeriodicalId":42995,"journal":{"name":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","volume":"58 5 supplement","pages":"S283 - S288"},"PeriodicalIF":0.5000,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.3103/S0005105525700359","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The development of an application that is based on machine learning methods for automatic annotation of web pages according to the Microdata standard is described, with the possibility of an extension to other standards and injecting data to JSX files. Datasets were collected and prepared for training machine learning (ML) models. The ML model metrics were collected and analyzed.
期刊介绍:
Automatic Documentation and Mathematical Linguistics is an international peer reviewed journal that covers all aspects of automation of information processes and systems, as well as algorithms and methods for automatic language analysis. Emphasis is on the practical applications of new technologies and techniques for information analysis and processing.