{"title":"A sequence approach to case outcome detection","authors":"Tom Vacek, Frank Schilder","doi":"10.1145/3086512.3086534","DOIUrl":null,"url":null,"abstract":"We describe a system to detect the outcome of U.S. Federal District Court cases based on PACER electronic dockets. We study the text processing components of the system and develop two model architectures in order to detect the outcome of a case per party (e.g., dismissed by Court or Verdict for Plaintiff). We conclude that modeling cases as a linear-chain graphical model (i.e., Conditional Random Field (CRF)) offers significantly better performance than modeling the case entry-by-entry (i.e., Logistic Regression (LR)). We in particular show that a first-order modeling of the CRF significantly outperforms the factorized model for the CRF architecture.","PeriodicalId":425187,"journal":{"name":"Proceedings of the 16th edition of the International Conference on Articial Intelligence and Law","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 16th edition of the International Conference on Articial Intelligence and Law","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3086512.3086534","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
We describe a system to detect the outcome of U.S. Federal District Court cases based on PACER electronic dockets. We study the text processing components of the system and develop two model architectures in order to detect the outcome of a case per party (e.g., dismissed by Court or Verdict for Plaintiff). We conclude that modeling cases as a linear-chain graphical model (i.e., Conditional Random Field (CRF)) offers significantly better performance than modeling the case entry-by-entry (i.e., Logistic Regression (LR)). We in particular show that a first-order modeling of the CRF significantly outperforms the factorized model for the CRF architecture.