Hirokazu Tanaka, Kota Katanoda, Tomoki Nakaya, Kayo Togawa, Yasuki Kobayashi
{"title":"Development of a comprehensive mortality database in Japan through data linkage of population census and vital statistics.","authors":"Hirokazu Tanaka, Kota Katanoda, Tomoki Nakaya, Kayo Togawa, Yasuki Kobayashi","doi":"10.1186/s12963-026-00479-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>No official framework exists for linking the Population Census with Vital Statistics in Japan, limiting the ability to monitor health-related inequalities. We aimed to develop a new methodology for data linkage to describe sociodemographic patterns of mortality in the Japanese population.</p><p><strong>Methods: </strong>The 2020 Population Census (n = 120,721,239) and Vital Statistics (death records: n = 2,949,946) between October 2020 and September 2022 were linked using a multi-stage deterministic linkage algorithm. Linkage variables included sex, birth year/month, residential address code (basic unit block, census block, and municipality), marital status, and age of spouse. Residential address codes (basic unit blocks: the smallest geographic unit at the Census) of death records were identified through geocoding of the exact residential address. We compared age-standardized mortality rates (ASMRs: per 100,000 person-years) before and after linkage to assess the validity of mortality. We also estimated ASMRs across multiple sociodemographic indicators, including marital status, household size, educational level, occupation, and area deprivation index (ADI).</p><p><strong>Results: </strong>Of the total death records, 2,253,228 (76.4%) were linked to the 2020 Census. The 2020 Japanese census-linked mortality database ultimately comprised 109,119,620 non-institutionalized Japanese individuals (90.4% of the total population) and 2,047,152 non-institutionalized death records (69.4% of total deaths). Crude all-cause mortality rates in most 5-year age categories from the database were 10-15% lower than those in the complete mortality registry without linkage; however, the difference was markedly greater in older age groups (aged ≥ 85 years), indicating less successful linkage in the age groups that had the highest mortality rates. ASMRs differed according to educational level, marital status, household size, occupation, and ADI (e.g., higher ASMRs among individuals with lower educational levels).</p><p><strong>Conclusions: </strong>The new linkage framework, which incorporates detailed address information as a linkage variable, has greatly increased the included population and the completeness of linkage. This approach to developing the census-linked mortality data provides a platform for comparing population health across socioeconomic groups within Japan and Japan's health-related inequalities with those in other countries.</p>","PeriodicalId":51476,"journal":{"name":"Population Health Metrics","volume":" ","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2026-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Population Health Metrics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12963-026-00479-0","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: No official framework exists for linking the Population Census with Vital Statistics in Japan, limiting the ability to monitor health-related inequalities. We aimed to develop a new methodology for data linkage to describe sociodemographic patterns of mortality in the Japanese population.
Methods: The 2020 Population Census (n = 120,721,239) and Vital Statistics (death records: n = 2,949,946) between October 2020 and September 2022 were linked using a multi-stage deterministic linkage algorithm. Linkage variables included sex, birth year/month, residential address code (basic unit block, census block, and municipality), marital status, and age of spouse. Residential address codes (basic unit blocks: the smallest geographic unit at the Census) of death records were identified through geocoding of the exact residential address. We compared age-standardized mortality rates (ASMRs: per 100,000 person-years) before and after linkage to assess the validity of mortality. We also estimated ASMRs across multiple sociodemographic indicators, including marital status, household size, educational level, occupation, and area deprivation index (ADI).
Results: Of the total death records, 2,253,228 (76.4%) were linked to the 2020 Census. The 2020 Japanese census-linked mortality database ultimately comprised 109,119,620 non-institutionalized Japanese individuals (90.4% of the total population) and 2,047,152 non-institutionalized death records (69.4% of total deaths). Crude all-cause mortality rates in most 5-year age categories from the database were 10-15% lower than those in the complete mortality registry without linkage; however, the difference was markedly greater in older age groups (aged ≥ 85 years), indicating less successful linkage in the age groups that had the highest mortality rates. ASMRs differed according to educational level, marital status, household size, occupation, and ADI (e.g., higher ASMRs among individuals with lower educational levels).
Conclusions: The new linkage framework, which incorporates detailed address information as a linkage variable, has greatly increased the included population and the completeness of linkage. This approach to developing the census-linked mortality data provides a platform for comparing population health across socioeconomic groups within Japan and Japan's health-related inequalities with those in other countries.
期刊介绍:
Population Health Metrics aims to advance the science of population health assessment, and welcomes papers relating to concepts, methods, ethics, applications, and summary measures of population health. The journal provides a unique platform for population health researchers to share their findings with the global community. We seek research that addresses the communication of population health measures and policy implications to stakeholders; this includes papers related to burden estimation and risk assessment, and research addressing population health across the full range of development. Population Health Metrics covers a broad range of topics encompassing health state measurement and valuation, summary measures of population health, descriptive epidemiology at the population level, burden of disease and injury analysis, disease and risk factor modeling for populations, and comparative assessment of risks to health at the population level. The journal is also interested in how to use and communicate indicators of population health to reduce disease burden, and the approaches for translating from indicators of population health to health-advancing actions. As a cross-cutting topic of importance, we are particularly interested in inequalities in population health and their measurement.