利用机器学习改进美国政府合同的公开报告

INFORMS J. Appl. Anal. Pub Date : 2021-08-17 DOI:10.1287/lytx.2021.04.23n

William A. Muir, Daniel Reich

{"title":"利用机器学习改进美国政府合同的公开报告","authors":"William A. Muir, Daniel Reich","doi":"10.1287/lytx.2021.04.23n","DOIUrl":null,"url":null,"abstract":"The U.S. government procures more than $500 billion annually in goods and services on public contracts, which it classifies using a hierarchical product and service taxonomy. Classification serves several purposes, including transparency in the use of taxpayer funding; reporting, tracing, and segmenting government expenditures; budgeting; and forecasting. Government acquisition personnel have historically performed these classifications manually, resulting in a process that is time-consuming and error-prone and offers limited visibility into government purchases. The problem faced is not unique to the public sector and is common across retail, manufacturing, and healthcare, among other settings. Using almost 4 million historical data records on governmental purchases, we fit a series of classifiers and demonstrate (a) superior performance when explicitly modeling the hierarchical structure of information domains through the use of top-down strategies and (b) the effectiveness of character-level convolutional neural networks when textual inputs are terse and contain irregularities such as abnormal character combinations and misspellings, which are common in government contracts. Our machine learning models are embedded in multiple software applications, including a web application that we developed, used by federal government personnel and other contracting professionals.","PeriodicalId":430990,"journal":{"name":"INFORMS J. Appl. Anal.","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Using Machine Learning to Improve Public Reporting on U.S. Government Contracts\",\"authors\":\"William A. Muir, Daniel Reich\",\"doi\":\"10.1287/lytx.2021.04.23n\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The U.S. government procures more than $500 billion annually in goods and services on public contracts, which it classifies using a hierarchical product and service taxonomy. Classification serves several purposes, including transparency in the use of taxpayer funding; reporting, tracing, and segmenting government expenditures; budgeting; and forecasting. Government acquisition personnel have historically performed these classifications manually, resulting in a process that is time-consuming and error-prone and offers limited visibility into government purchases. The problem faced is not unique to the public sector and is common across retail, manufacturing, and healthcare, among other settings. Using almost 4 million historical data records on governmental purchases, we fit a series of classifiers and demonstrate (a) superior performance when explicitly modeling the hierarchical structure of information domains through the use of top-down strategies and (b) the effectiveness of character-level convolutional neural networks when textual inputs are terse and contain irregularities such as abnormal character combinations and misspellings, which are common in government contracts. Our machine learning models are embedded in multiple software applications, including a web application that we developed, used by federal government personnel and other contracting professionals.\",\"PeriodicalId\":430990,\"journal\":{\"name\":\"INFORMS J. Appl. Anal.\",\"volume\":\"53 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-08-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"INFORMS J. Appl. Anal.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1287/lytx.2021.04.23n\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"INFORMS J. Appl. Anal.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1287/lytx.2021.04.23n","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

美国政府每年通过公共合同采购超过5000亿美元的商品和服务，并使用分层产品和服务分类法对这些商品和服务进行分类。分类有几个目的，包括提高纳税人资金使用的透明度;报告、追踪和分割政府支出;预算;和预测。政府采购人员历来都是手动进行这些分类，这一过程既耗时又容易出错，而且对政府采购的可见性也很有限。所面临的问题并非公共部门所独有，在零售、制造业和医疗保健等领域都很常见。使用近400万政府采购的历史数据记录，我们拟合了一系列分类器，并证明了(a)通过使用自上而下的策略明确建模信息域的层次结构时的卓越性能;(b)当文本输入简洁且包含异常字符组合和拼写错误等不规则性时，字符级卷积神经网络的有效性，这些在政府合同中很常见。我们的机器学习模型嵌入到多个软件应用程序中，包括我们开发的web应用程序，供联邦政府人员和其他合同专业人员使用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Using Machine Learning to Improve Public Reporting on U.S. Government Contracts

The U.S. government procures more than $500 billion annually in goods and services on public contracts, which it classifies using a hierarchical product and service taxonomy. Classification serves several purposes, including transparency in the use of taxpayer funding; reporting, tracing, and segmenting government expenditures; budgeting; and forecasting. Government acquisition personnel have historically performed these classifications manually, resulting in a process that is time-consuming and error-prone and offers limited visibility into government purchases. The problem faced is not unique to the public sector and is common across retail, manufacturing, and healthcare, among other settings. Using almost 4 million historical data records on governmental purchases, we fit a series of classifiers and demonstrate (a) superior performance when explicitly modeling the hierarchical structure of information domains through the use of top-down strategies and (b) the effectiveness of character-level convolutional neural networks when textual inputs are terse and contain irregularities such as abnormal character combinations and misspellings, which are common in government contracts. Our machine learning models are embedded in multiple software applications, including a web application that we developed, used by federal government personnel and other contracting professionals.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

INFORMS J. Appl. Anal.

自引率

0.00%

发文量