Log Parser with One-to-One Markup

Zhang Chunyong, Xiaojing Meng
{"title":"Log Parser with One-to-One Markup","authors":"Zhang Chunyong, Xiaojing Meng","doi":"10.1109/ICICT50521.2020.00045","DOIUrl":null,"url":null,"abstract":"System logs are often used as the primary resource in data-driven methods to ensure system health and stability. The typical process of system log analysis is to first parse unstructured logs into structured data, and then apply data mining and machine learning techniques to analyze the data and build a workflow model. Existing log parsing methods focus on similar matching of log messages and log templates. We believe that the accuracy of log message parsing is the primary task of log parsing, so we propose One-to-One, a log parser that is marked one-to-one according to the rules duringthe matching process according to the token type and part of speech. Way to parse log messages online. We evaluated Oneto-One on different log sets and compared them with the three most advanced log parsing methods. The results show that our method is similar to the results of the other three methods in parsing simple logs. However, when parsing complex OpenStack logs, the accuracy can reach 98%, which is 20% higher than the best. It can parse tens of thousands of log messages per second. This method shows high efficiency and precision for all three types of test logs, and is applicable to modern system logs.","PeriodicalId":445000,"journal":{"name":"2020 3rd International Conference on Information and Computer Technologies (ICICT)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 3rd International Conference on Information and Computer Technologies (ICICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICT50521.2020.00045","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

System logs are often used as the primary resource in data-driven methods to ensure system health and stability. The typical process of system log analysis is to first parse unstructured logs into structured data, and then apply data mining and machine learning techniques to analyze the data and build a workflow model. Existing log parsing methods focus on similar matching of log messages and log templates. We believe that the accuracy of log message parsing is the primary task of log parsing, so we propose One-to-One, a log parser that is marked one-to-one according to the rules duringthe matching process according to the token type and part of speech. Way to parse log messages online. We evaluated Oneto-One on different log sets and compared them with the three most advanced log parsing methods. The results show that our method is similar to the results of the other three methods in parsing simple logs. However, when parsing complex OpenStack logs, the accuracy can reach 98%, which is 20% higher than the best. It can parse tens of thousands of log messages per second. This method shows high efficiency and precision for all three types of test logs, and is applicable to modern system logs.
具有一对一标记的日志解析器
在数据驱动的方法中,系统日志通常被用作主要资源,以确保系统的健康和稳定性。系统日志分析的典型流程是首先将非结构化日志解析为结构化数据,然后应用数据挖掘和机器学习技术对数据进行分析,并建立工作流模型。现有的日志解析方法侧重于日志消息和日志模板的相似匹配。我们认为日志消息解析的准确性是日志解析的首要任务,因此我们提出了一种一对一的日志解析器,它在匹配过程中根据标记类型和词性进行一对一的规则标记。在线解析日志消息的方法。我们在不同的日志集上评估了one - to- one,并将它们与三种最先进的日志解析方法进行了比较。结果表明,在解析简单日志时,我们的方法与其他三种方法的结果相似。但是,在解析复杂的OpenStack日志时,准确率可以达到98%,比最好的准确率高出20%。它每秒可以解析数万条日志消息。该方法对三种类型的测试日志均具有较高的效率和精度,适用于现代系统日志。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信