Identifying and Qualifying Deviant Cases in Clusters of Sequences: The Why and The How.

IF 1.9 2区 社会学 Q2 DEMOGRAPHY
Raffaella Piccarreta, Emanuela Struffolino
{"title":"Identifying and Qualifying Deviant Cases in Clusters of Sequences: The Why and The How.","authors":"Raffaella Piccarreta, Emanuela Struffolino","doi":"10.1007/s10680-023-09682-3","DOIUrl":null,"url":null,"abstract":"<p><p>Sequence analysis is employed in different fields-e.g., demography, sociology, and political sciences-to describe longitudinal processes represented as sequences of categorical states. In many applications, sequences are clustered to identify relevant types, which reflect the different empirical realisations of the temporal process under study. We explore criteria to inspect internal cluster composition and to detect deviant sequences, that is, cases characterised by rare patterns or outliers that might compromise cluster homogeneity. We also introduce tools to visualise and distinguish the features of regular and deviant cases. Our proposals offer a more accurate and granular description of the data structure, by identifying-besides the most typical types-peculiar sequences that might be interesting from a substantive and theoretical point of view. This analysis could be very useful in applications where-under the assumption of within homogeneity-clusters are used as outcome or explanatory variables in regressions. We demonstrate the added value of our proposal in a motivating application from life-course socio-demography, focusing on Italian women's employment trajectories and on their link with their mothers' participation in the labour market across geographical areas.</p>","PeriodicalId":51496,"journal":{"name":"European Journal of Population-Revue Europeenne De Demographie","volume":"40 1","pages":"1"},"PeriodicalIF":1.9000,"publicationDate":"2023-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10730788/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Population-Revue Europeenne De Demographie","FirstCategoryId":"90","ListUrlMain":"https://doi.org/10.1007/s10680-023-09682-3","RegionNum":2,"RegionCategory":"社会学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"DEMOGRAPHY","Score":null,"Total":0}
引用次数: 0

Abstract

Sequence analysis is employed in different fields-e.g., demography, sociology, and political sciences-to describe longitudinal processes represented as sequences of categorical states. In many applications, sequences are clustered to identify relevant types, which reflect the different empirical realisations of the temporal process under study. We explore criteria to inspect internal cluster composition and to detect deviant sequences, that is, cases characterised by rare patterns or outliers that might compromise cluster homogeneity. We also introduce tools to visualise and distinguish the features of regular and deviant cases. Our proposals offer a more accurate and granular description of the data structure, by identifying-besides the most typical types-peculiar sequences that might be interesting from a substantive and theoretical point of view. This analysis could be very useful in applications where-under the assumption of within homogeneity-clusters are used as outcome or explanatory variables in regressions. We demonstrate the added value of our proposal in a motivating application from life-course socio-demography, focusing on Italian women's employment trajectories and on their link with their mothers' participation in the labour market across geographical areas.

识别和鉴定序列群中的偏差案例:原因与方法
序列分析应用于不同领域,如人口学、社会学和政治学,以描述以分类状态序列表示的纵向过程。在许多应用中,序列被聚类以识别相关类型,这些类型反映了所研究的时间过程的不同经验现实。我们探讨了检查内部聚类组成和检测偏差序列的标准,即具有罕见模式或异常值的情况,这些情况可能会影响聚类的一致性。我们还介绍了可视化和区分正常与异常情况特征的工具。除了最典型的类型外,我们的建议还能识别从实质和理论角度来看可能很有趣的特殊序列,从而对数据结构进行更准确、更细化的描述。在应用中,根据内部同质性假设,聚类被用作回归中的结果或解释变量,这种分析可能非常有用。我们将在生命历程社会人口学的一个激励性应用中展示我们建议的附加价值,重点关注意大利妇女的就业轨迹及其与母亲跨地域参与劳动力市场的联系。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.20
自引率
8.00%
发文量
44
期刊介绍: European Journal of Population addresses a broad public of researchers, policy makers and others concerned with population processes and their consequences. Its aim is to improve understanding of population phenomena by giving priority to work that contributes to the development of theory and method, and that spans the boundaries between demography and such disciplines as sociology, anthropology, economics, geography, history, political science, epidemiology and other sciences contributing to public health. The Journal is open to authors from all over the world, and its articles cover European and non-European countries (specifically including developing countries) alike.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信