The effects of data mining techniques on software cost estimation

Karen T. Lum, Daniel R. Baker, J. Hihn
{"title":"The effects of data mining techniques on software cost estimation","authors":"Karen T. Lum, Daniel R. Baker, J. Hihn","doi":"10.1109/IEMCE.2008.4617949","DOIUrl":null,"url":null,"abstract":"Current research at JPL incorporates data mining and machine learning techniques to see whether a better software cost model can be developed. 2CEE is a tool developed for developing new software cost estimation models using data mining techniques. The accuracy of these models has been validated internally through leave-one out cross validation. However, the newly generated models have not been validated to see how well they predict in the real world. Our study seeks to find out how well these machine learning based models perform against standard models for eighteen new flight and ground software projects. The accurate performance of the models against current real world projects is extremely important for practitioners to adapt new techniques.","PeriodicalId":408691,"journal":{"name":"2008 IEEE International Engineering Management Conference","volume":"202 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 IEEE International Engineering Management Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEMCE.2008.4617949","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13

Abstract

Current research at JPL incorporates data mining and machine learning techniques to see whether a better software cost model can be developed. 2CEE is a tool developed for developing new software cost estimation models using data mining techniques. The accuracy of these models has been validated internally through leave-one out cross validation. However, the newly generated models have not been validated to see how well they predict in the real world. Our study seeks to find out how well these machine learning based models perform against standard models for eighteen new flight and ground software projects. The accurate performance of the models against current real world projects is extremely important for practitioners to adapt new techniques.
数据挖掘技术对软件成本估算的影响
喷气推进实验室目前的研究结合了数据挖掘和机器学习技术,看看是否可以开发出更好的软件成本模型。2CEE是一个使用数据挖掘技术开发新的软件成本估算模型的工具。这些模型的准确性已经通过留一交叉验证内部验证。然而,新生成的模型还没有经过验证,看它们在现实世界中的预测效果如何。我们的研究旨在找出这些基于机器学习的模型与18个新的飞行和地面软件项目的标准模型相比表现如何。模型对当前现实世界项目的准确性能对于从业者适应新技术是极其重要的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信