Quasi-Empirical Bayes methods for parameter estimation involving many small samples.

IF 1.2 4区医学 Q4 PHARMACOLOGY & PHARMACY

Journal of Biopharmaceutical Statistics Pub Date : 2025-04-11 DOI:10.1080/10543406.2025.2489357

Kanaka Tatikola, Javier Cabrera, Chun Pang Lin, Helena Geys, Fetene Tekle, Jocelyn Sendecki, Stan Altan, Dhammika Amaratunga, Mariusz Lubomirski

{"title":"Quasi-Empirical Bayes methods for parameter estimation involving many small samples.","authors":"Kanaka Tatikola, Javier Cabrera, Chun Pang Lin, Helena Geys, Fetene Tekle, Jocelyn Sendecki, Stan Altan, Dhammika Amaratunga, Mariusz Lubomirski","doi":"10.1080/10543406.2025.2489357","DOIUrl":null,"url":null,"abstract":"<p><p>Animal studies in pharmaceutical discovery and toxicology are not always statistically powered for estimation or hypothesis testing. Typically, only 3 to 5 animals are allocated per group, based on historical conventions or industry practice, particularly in early toxicology studies with several different types of controls and compounds at various concentrations. When we estimate means, variances, or other parameters under these conditions, often the confidence intervals generated will be of little practical use due to the small sample size. If, however, historical or even concurrent data with similar characteristics is available from comparable experiments, all data could be incorporated into the estimation by using an Empirical Bayesian approach. To implement this method, the existing data is used to determine prior distributions for the parameters of interest, which are then combined with the sample data of interest to produce posterior distributions. In our case study, we combined data from 30 different experiments to use as a basis for defining the prior distributions on the mean and standard deviation (SD). For practical reasons related to our application, we prefer to use the standard deviation instead of the variance or precision that are more commonly used in the Bayesian methodology. For the mean parameter, the prior distribution is approximated by a Normal distribution, covering the range of all samples. For SD, the prior distribution is approximated with a half-Normal, half-Cauchy, or Uniform with carefully chosen boundaries. An Empirical Bayes method is then applied, combining the selected prior distributions with observed data in each small experiment to obtain the posterior distribution for the mean and for the variance of that particular experiment. The strategy of using the combined data from multiple samples to develop a common prior distribution that borrows strength across all the available data reduces the variability of the estimates and improves the estimation of individual parameters. In effect, this method combines \"borrowing strength\" with \"Empirical Bayes\" in a way that suggests \"Tukey meets Robbins\"!</p>","PeriodicalId":54870,"journal":{"name":"Journal of Biopharmaceutical Statistics","volume":" ","pages":"1-10"},"PeriodicalIF":1.2000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biopharmaceutical Statistics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/10543406.2025.2489357","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}

引用次数: 0

Abstract

Animal studies in pharmaceutical discovery and toxicology are not always statistically powered for estimation or hypothesis testing. Typically, only 3 to 5 animals are allocated per group, based on historical conventions or industry practice, particularly in early toxicology studies with several different types of controls and compounds at various concentrations. When we estimate means, variances, or other parameters under these conditions, often the confidence intervals generated will be of little practical use due to the small sample size. If, however, historical or even concurrent data with similar characteristics is available from comparable experiments, all data could be incorporated into the estimation by using an Empirical Bayesian approach. To implement this method, the existing data is used to determine prior distributions for the parameters of interest, which are then combined with the sample data of interest to produce posterior distributions. In our case study, we combined data from 30 different experiments to use as a basis for defining the prior distributions on the mean and standard deviation (SD). For practical reasons related to our application, we prefer to use the standard deviation instead of the variance or precision that are more commonly used in the Bayesian methodology. For the mean parameter, the prior distribution is approximated by a Normal distribution, covering the range of all samples. For SD, the prior distribution is approximated with a half-Normal, half-Cauchy, or Uniform with carefully chosen boundaries. An Empirical Bayes method is then applied, combining the selected prior distributions with observed data in each small experiment to obtain the posterior distribution for the mean and for the variance of that particular experiment. The strategy of using the combined data from multiple samples to develop a common prior distribution that borrows strength across all the available data reduces the variability of the estimates and improves the estimation of individual parameters. In effect, this method combines "borrowing strength" with "Empirical Bayes" in a way that suggests "Tukey meets Robbins"!

查看原文本刊更多论文

多小样本参数估计的准经验贝叶斯方法。

药物发现和毒理学的动物研究并不总是统计上的估计或假设检验。通常，根据历史惯例或行业惯例，每组只分配3至5只动物，特别是在早期的毒理学研究中，有几种不同类型的对照和不同浓度的化合物。当我们在这些条件下估计均值、方差或其他参数时，由于样本量小，通常生成的置信区间几乎没有实际用途。但是，如果从可比实验中获得具有相似特征的历史甚至并发数据，则可以使用经验贝叶斯方法将所有数据纳入估计。为了实现该方法，使用现有数据确定感兴趣参数的先验分布，然后将其与感兴趣的样本数据相结合以产生后验分布。在我们的案例研究中，我们结合了来自30个不同实验的数据，作为定义均值和标准差（SD）先验分布的基础。由于与我们的应用相关的实际原因，我们更喜欢使用标准偏差，而不是贝叶斯方法中更常用的方差或精度。对于均值参数，先验分布近似为正态分布，覆盖所有样本的范围。对于SD，先验分布近似为半正态分布、半柯西分布或均匀分布，边界仔细选择。然后应用经验贝叶斯方法，将选择的先验分布与每个小实验中的观测数据相结合，得到该特定实验的均值和方差的后验分布。使用来自多个样本的组合数据来开发一个公共先验分布的策略，该策略借用了所有可用数据的强度，减少了估计的可变性，并改善了对单个参数的估计。实际上，这种方法将“借用力量”与“经验贝叶斯”结合在一起，就像“杜克遇到罗宾斯”一样！

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Biopharmaceutical Statistics 医学-统计学与概率论

CiteScore

2.50

自引率

18.20%

发文量

审稿时长

6-12 weeks

期刊介绍： The Journal of Biopharmaceutical Statistics, a rapid publication journal, discusses quality applications of statistics in biopharmaceutical research and development. Now publishing six times per year, it includes expositions of statistical methodology with immediate applicability to biopharmaceutical research in the form of full-length and short manuscripts, review articles, selected/invited conference papers, short articles, and letters to the editor. Addressing timely and provocative topics important to the biostatistical profession, the journal covers: Drug, device, and biological research and development; Drug screening and drug design; Assessment of pharmacological activity; Pharmaceutical formulation and scale-up; Preclinical safety assessment; Bioavailability, bioequivalence, and pharmacokinetics; Phase, I, II, and III clinical development including complex innovative designs; Premarket approval assessment of clinical safety; Postmarketing surveillance; Big data and artificial intelligence and applications.