Chinese legal texts – Quantitative Description

Q3 Arts and Humanities

Acta Linguistica Asiatica Pub Date : 2017-06-28 DOI:10.4312/ALA.7.1.77-87

Luboš Gajdoš

引用次数: 2

Abstract

The aim of the paper is to provide a quantitative description of legal Chinese. This study adopts the approach of corpus-based analyses and it shows basic statistical parameters of legal texts in Chinese, namely the length of a sentence, the proportion of part of speech etc. The research is conducted on the Chinese monolingual corpus Hanku . The paper also discusses the issues of statistical data processing from various corpora, e.g. the tokenisation and part of speech tagging and their relevance to study of registers variation.

查看原文本刊更多论文

中国法律文本-数量描述

本文的目的是对法律汉语进行定量描述。本研究采用基于语料库的分析方法，展示了汉语法律文本的基本统计参数，即句长、词性比例等。本研究以汉语单语语料库汉库为研究对象。本文还讨论了各种语料库的统计数据处理问题，如标记化和词性标注及其与语域变化研究的相关性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊