Research and Implementation of Algorithm Based on Data Fusion Technology

Yuzi Dou, Xiafei Feng, Ruifeng Zhu, Tianzhu Gao, Yanbing Wu, Lei Ma
{"title":"Research and Implementation of Algorithm Based on Data Fusion Technology","authors":"Yuzi Dou, Xiafei Feng, Ruifeng Zhu, Tianzhu Gao, Yanbing Wu, Lei Ma","doi":"10.2991/MASTA-19.2019.60","DOIUrl":null,"url":null,"abstract":"As the growing amount of data stored on the Internet, the work of searching for information becomes complicated. The traditional collection method cannot achieve a certain effect, it is cumbersome and time-consuming. Using natural language processing technology and web crawler technology to collect and analyze data about student evaluation, the purpose is to obtain the key factors affecting teachers' comprehensive evaluation results and propose the methods to solve the problems. For the traditional Web crawler technology, there is a lack of certain intelligence, initiative, etc. the design of the best priority crawler framework has improved and optimized its structure. And the improved PageRank value, user demand correlation degree, and NDC algorithm denoising are added, it can effectively solve a series of problems such as long retrieval time, overlapping information, incomplete information, and improve the accuracy of information collection. Introduction The proposal of student evaluation system is to find a solution to the current situation according to the specific needs of students and teachers' teaching requirements. As an integrated data processing technology, data fusion technology is applied to many traditional disciplines and emerging fields, which can improve the accuracy and reliability of target rule mining and prediction. In [1] combined with the crawler technology, the acquisition and analysis of multi-source spatial data is demonstrated, which is beneficial to better assist the urban planning work; In [2], the design of the Internet public opinion analysis system based on the principle of data fusion, and the data fusion analysis processing is realized by combining the crawler technology with the natural language processing technology; In [3], it is proposed a personal credit scoring system based on multi-source data fusion, which combines the logistic regression model to improve the model estimation accuracy; In [4], the data is collected by adaptive weighted fusion method based on data fusion principle, and the Grubbs criterion is used to eliminate invalid data, so as to comprehensively deal with the problem of measuring the parameters of the inlet section of the test piece in the afterburner of an aero-engine. Overall System Structure Design This paper uses multi-source data fusion technology to search for keyword group information about student evaluation in the webpages. The first chapter introduces the overall chapter arrangement of this article. The second chapter introduces the proposed system architecture and optimization scheme. In the third chapter, Applied to the comprehensive analysis of students' evaluation. Chapter four gives a summary and suggestions. As shown in Figure 1. Figure 1. Overall architecture flow chart Start Optimizati on scheme Web page classificat ion Data processin g Word frequency statistics Similar word substituti on Keyword list (NLP) END International Conference on Modeling, Analysis, Simulation Technologies and Applications (MASTA 2019) Copyright © 2019, the Authors. Published by Atlantis Press. This is an open access article under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/). Advances in Intelligent Systems Research, volume 168","PeriodicalId":103896,"journal":{"name":"Proceedings of the 2019 International Conference on Modeling, Analysis, Simulation Technologies and Applications (MASTA 2019)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 International Conference on Modeling, Analysis, Simulation Technologies and Applications (MASTA 2019)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2991/MASTA-19.2019.60","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

As the growing amount of data stored on the Internet, the work of searching for information becomes complicated. The traditional collection method cannot achieve a certain effect, it is cumbersome and time-consuming. Using natural language processing technology and web crawler technology to collect and analyze data about student evaluation, the purpose is to obtain the key factors affecting teachers' comprehensive evaluation results and propose the methods to solve the problems. For the traditional Web crawler technology, there is a lack of certain intelligence, initiative, etc. the design of the best priority crawler framework has improved and optimized its structure. And the improved PageRank value, user demand correlation degree, and NDC algorithm denoising are added, it can effectively solve a series of problems such as long retrieval time, overlapping information, incomplete information, and improve the accuracy of information collection. Introduction The proposal of student evaluation system is to find a solution to the current situation according to the specific needs of students and teachers' teaching requirements. As an integrated data processing technology, data fusion technology is applied to many traditional disciplines and emerging fields, which can improve the accuracy and reliability of target rule mining and prediction. In [1] combined with the crawler technology, the acquisition and analysis of multi-source spatial data is demonstrated, which is beneficial to better assist the urban planning work; In [2], the design of the Internet public opinion analysis system based on the principle of data fusion, and the data fusion analysis processing is realized by combining the crawler technology with the natural language processing technology; In [3], it is proposed a personal credit scoring system based on multi-source data fusion, which combines the logistic regression model to improve the model estimation accuracy; In [4], the data is collected by adaptive weighted fusion method based on data fusion principle, and the Grubbs criterion is used to eliminate invalid data, so as to comprehensively deal with the problem of measuring the parameters of the inlet section of the test piece in the afterburner of an aero-engine. Overall System Structure Design This paper uses multi-source data fusion technology to search for keyword group information about student evaluation in the webpages. The first chapter introduces the overall chapter arrangement of this article. The second chapter introduces the proposed system architecture and optimization scheme. In the third chapter, Applied to the comprehensive analysis of students' evaluation. Chapter four gives a summary and suggestions. As shown in Figure 1. Figure 1. Overall architecture flow chart Start Optimizati on scheme Web page classificat ion Data processin g Word frequency statistics Similar word substituti on Keyword list (NLP) END International Conference on Modeling, Analysis, Simulation Technologies and Applications (MASTA 2019) Copyright © 2019, the Authors. Published by Atlantis Press. This is an open access article under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/). Advances in Intelligent Systems Research, volume 168
基于数据融合技术的算法研究与实现
随着互联网上存储的数据量的增长,搜索信息的工作变得复杂起来。传统的收集方法不能达到一定的效果,它是繁琐和耗时的。利用自然语言处理技术和网络爬虫技术对学生评价数据进行收集和分析,得出影响教师综合评价结果的关键因素,并提出解决问题的方法。针对传统Web爬虫技术存在的缺乏一定的智能性、主动性等缺点,设计了最佳优先级爬虫框架,对其结构进行了改进和优化。并加入改进的PageRank值、用户需求关联度、NDC算法去噪等,有效解决了检索时间长、信息重叠、信息不完整等一系列问题,提高了信息采集的准确性。学生评价系统的提出是为了根据学生的具体需求和教师的教学要求,找到一种解决现状的方法。数据融合技术作为一种综合数据处理技术,被应用于许多传统学科和新兴领域,可以提高目标规则挖掘和预测的准确性和可靠性。[1]结合履带技术演示了多源空间数据的采集与分析,有利于更好地辅助城市规划工作;[2]设计了基于数据融合原理的互联网舆情分析系统,通过将爬虫技术与自然语言处理技术相结合来实现数据融合分析处理;[3]提出了一种基于多源数据融合的个人信用评分系统,结合logistic回归模型提高了模型估计精度;[4]采用基于数据融合原理的自适应加权融合方法采集数据,并采用Grubbs准则剔除无效数据,综合处理航空发动机加力燃烧室试件进气道截面参数测量问题。系统总体结构设计本文采用多源数据融合技术在网页中搜索学生评价关键字组信息。第一章介绍了本文的总体章节安排。第二章介绍了提出的系统架构和优化方案。第三章,应用于学生评价的综合分析。第四章对全文进行了总结和建议。如图1所示。图1所示。总体架构流程图启动优化方案网页分类数据处理词频统计相似词替换关键词列表(NLP) END建模、分析、仿真技术与应用国际会议(MASTA 2019)版权所有©2019,作者。亚特兰蒂斯出版社出版。这是一篇基于CC BY-NC许可(http://creativecommons.org/licenses/by-nc/4.0/)的开放获取文章。智能系统研究进展,第168卷
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信