Advances in neural information processing systems最新文献

Newton Informed Neural Operator for Solving Nonlinear Partial Differential Equations. 求解非线性偏微分方程的牛顿通知神经算子。

Advances in neural information processing systems Pub Date : 2024-12-01

Wenrui Hao, Xinliang Liu, Yahong Yang

引用次数: 0

Biomedical Visual Instruction Tuning with Clinician Preference Alignment. 生物医学视觉教学调整与临床医师偏好对齐。

Advances in neural information processing systems Pub Date : 2024-12-01

Hejie Cui, Lingjun Mao, Xin Liang, Jieyu Zhang, Hui Ren, Quanzheng Li, Xiang Li, Carl Yang

{"title":"Biomedical Visual Instruction Tuning with Clinician Preference Alignment.","authors":"Hejie Cui, Lingjun Mao, Xin Liang, Jieyu Zhang, Hui Ren, Quanzheng Li, Xiang Li, Carl Yang","doi":"","DOIUrl":"","url":null,"abstract":"Recent advancements in multimodal foundation models have showcased impressive capabilities in understanding and reasoning with visual and textual information. Adapting these foundation models trained for general usage to specialized domains like biomedicine requires large-scale domain-specific instruction datasets. While existing works have explored curating such datasets automatically, the resultant datasets are not explicitly aligned with domain expertise. In this work, we propose a data-centric framework, Biomedical Visual Instruction Tuning with Clinician Preference Alignment (BioMed-VITAL), that incorporates clinician preferences into both stages of generating and selecting instruction data for tuning biomedical multimodal foundation models. First, during the generation stage, we prompt the GPT-4V generator with a diverse set of clinician-selected demonstrations for preference-aligned data candidate generation. Then, during the selection phase, we train a separate selection model, which explicitly distills clinician and policy-guided model preferences into a rating function to select high-quality data for medical instruction tuning. Results show that the model tuned with the instruction-following data from our method demonstrates a significant improvement in open visual chat (18.5% relatively) and medical VQA (win rate up to 81.73%). Our instruction-following data and models are available at https://BioMed-VITAL.github.io.","PeriodicalId":72099,"journal":{"name":"Advances in neural information processing systems","volume":"37 ","pages":"96449-96467"},"PeriodicalIF":0.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11867732/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143525294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias. 交叉护理：评估语言模型偏差的预训练数据对医疗保健的影响。