{"title":"目标定义","authors":"Raymond A. Anderson","doi":"10.1093/oso/9780192844194.003.0018","DOIUrl":null,"url":null,"abstract":"Predictive models need something to predict. Extracted data is used to set (or confirm) the target definition, if not cast in stone. This is one of the most crucial steps of the process, which may not be straightforward. (1) Overview—i) continuous versus binary targets, and distinctions between probability of default (PD), exposure-at-default (EAD) and loss given default (LGD); ii) definition requirements—relevance, focus, transparency, adequacy and data quality; iii) performance components—automated counters/statuses and manual statuses; iv) code cross-checks—to determine whether statuses are properly understood. (2) Definition strictness—i) status nodes—define treatment {out-of-scope, exclusion, trivial balance, Bad/Indeterminate/Good}; ii) roll-rates—used to define delinquency levels; iii) trivial balances—avoid penalization of minor infractions; iv) closed accounts—possible treatments. (3) Integrity checks—i) consistency—period-on-period changes in distribution; ii) characteristics—ensuring they have the intended influence within the definition; iii) swap-set—assessing new versus old OR alternative definitions.","PeriodicalId":286194,"journal":{"name":"Credit Intelligence & Modelling","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Target Definition\",\"authors\":\"Raymond A. Anderson\",\"doi\":\"10.1093/oso/9780192844194.003.0018\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Predictive models need something to predict. Extracted data is used to set (or confirm) the target definition, if not cast in stone. This is one of the most crucial steps of the process, which may not be straightforward. (1) Overview—i) continuous versus binary targets, and distinctions between probability of default (PD), exposure-at-default (EAD) and loss given default (LGD); ii) definition requirements—relevance, focus, transparency, adequacy and data quality; iii) performance components—automated counters/statuses and manual statuses; iv) code cross-checks—to determine whether statuses are properly understood. (2) Definition strictness—i) status nodes—define treatment {out-of-scope, exclusion, trivial balance, Bad/Indeterminate/Good}; ii) roll-rates—used to define delinquency levels; iii) trivial balances—avoid penalization of minor infractions; iv) closed accounts—possible treatments. (3) Integrity checks—i) consistency—period-on-period changes in distribution; ii) characteristics—ensuring they have the intended influence within the definition; iii) swap-set—assessing new versus old OR alternative definitions.\",\"PeriodicalId\":286194,\"journal\":{\"name\":\"Credit Intelligence & Modelling\",\"volume\":\"72 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Credit Intelligence & Modelling\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/oso/9780192844194.003.0018\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Credit Intelligence & Modelling","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/oso/9780192844194.003.0018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Predictive models need something to predict. Extracted data is used to set (or confirm) the target definition, if not cast in stone. This is one of the most crucial steps of the process, which may not be straightforward. (1) Overview—i) continuous versus binary targets, and distinctions between probability of default (PD), exposure-at-default (EAD) and loss given default (LGD); ii) definition requirements—relevance, focus, transparency, adequacy and data quality; iii) performance components—automated counters/statuses and manual statuses; iv) code cross-checks—to determine whether statuses are properly understood. (2) Definition strictness—i) status nodes—define treatment {out-of-scope, exclusion, trivial balance, Bad/Indeterminate/Good}; ii) roll-rates—used to define delinquency levels; iii) trivial balances—avoid penalization of minor infractions; iv) closed accounts—possible treatments. (3) Integrity checks—i) consistency—period-on-period changes in distribution; ii) characteristics—ensuring they have the intended influence within the definition; iii) swap-set—assessing new versus old OR alternative definitions.