Neural Network Based Transaction Classification System for Chinese Transaction Behavior Analysis

Jianyang Yu, Yuanyuan Qiao, Nanfei Shu, Kewu Sun, Shenshen Zhou, Jie Yang
{"title":"Neural Network Based Transaction Classification System for Chinese Transaction Behavior Analysis","authors":"Jianyang Yu, Yuanyuan Qiao, Nanfei Shu, Kewu Sun, Shenshen Zhou, Jie Yang","doi":"10.1109/BigDataCongress.2019.00021","DOIUrl":null,"url":null,"abstract":"With the rapid development of Chinese economy, it is significant to examine the economic activities in China. Each transaction behavior is recorded by the invoice. The invoice contains the transaction content, the classification of the transaction behavior (in accordance with the Tax Classification and Coding for Commodities and Services issued by the state) and transaction price, etc. Our work uses real mass invoice data collected from Zhejiang Province and conducts a multi-dimensional analysis of Chinese transaction behavior based on transaction behavior classification model. Firstly, we propose a compositional CNN-RNN model with attention mechanism to recommend the corresponding categories of transaction behavior collected from tax invoices. It maps the transaction behavior recorded in the invoice to transaction code in the Tax Classification and Coding for Commodities and Services issued by the state. Preliminary experiments show that the top-one accuracy of classifying transaction behavior achieves 75%. Then, we focus on the quantity distribution of invoice data and draw a conclusion that the major category with larger number of invoice records is more diversified in subdivided categories. After that, we studied the price distribution of various transaction behaviors to discover the difference in price distribution between different industries. Prices in the major categories of goods are more concentrated in the middle or lower prices. We can analyze the regional industrial structure through the price distribution of the industry which makes sense to study the economy of the region from the perspective of industry.","PeriodicalId":335850,"journal":{"name":"2019 IEEE International Congress on Big Data (BigDataCongress)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Congress on Big Data (BigDataCongress)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BigDataCongress.2019.00021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

With the rapid development of Chinese economy, it is significant to examine the economic activities in China. Each transaction behavior is recorded by the invoice. The invoice contains the transaction content, the classification of the transaction behavior (in accordance with the Tax Classification and Coding for Commodities and Services issued by the state) and transaction price, etc. Our work uses real mass invoice data collected from Zhejiang Province and conducts a multi-dimensional analysis of Chinese transaction behavior based on transaction behavior classification model. Firstly, we propose a compositional CNN-RNN model with attention mechanism to recommend the corresponding categories of transaction behavior collected from tax invoices. It maps the transaction behavior recorded in the invoice to transaction code in the Tax Classification and Coding for Commodities and Services issued by the state. Preliminary experiments show that the top-one accuracy of classifying transaction behavior achieves 75%. Then, we focus on the quantity distribution of invoice data and draw a conclusion that the major category with larger number of invoice records is more diversified in subdivided categories. After that, we studied the price distribution of various transaction behaviors to discover the difference in price distribution between different industries. Prices in the major categories of goods are more concentrated in the middle or lower prices. We can analyze the regional industrial structure through the price distribution of the industry which makes sense to study the economy of the region from the perspective of industry.
基于神经网络的中国交易行为分析交易分类系统
随着中国经济的快速发展,研究中国的经济活动意义重大。每一笔交易行为都由发票记录。发票包含交易内容、交易行为分类(根据国家颁布的《商品和服务税收分类与编码》)和交易价格等。我们的研究利用从浙江省采集的真实海量发票数据,基于交易行为分类模型对中国人的交易行为进行了多维度分析。首先,我们提出了一个具有关注机制的 CNN-RNN 组成模型,以推荐从税务发票中收集到的交易行为的相应类别。它将发票中记录的交易行为与国家颁布的《商品和服务税收分类与编码》中的交易代码进行映射。初步实验表明,交易行为分类的最高准确率达到 75%。然后,我们重点研究了发票数据的数量分布,得出了发票记录数量较多的大类在细分类别中更加多样化的结论。之后,我们研究了各种交易行为的价格分布,发现了不同行业之间价格分布的差异。大类商品的价格更多集中在中低价位。我们可以通过产业的价格分布来分析区域产业结构,这对于从产业角度研究区域经济是很有意义的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信