{"title":"Data science through natural language with ChatGPT's Code Interpreter.","authors":"Sangzin Ahn","doi":"10.12793/tcp.2024.32.e8","DOIUrl":null,"url":null,"abstract":"<p><p>Large language models (LLMs) have emerged as a powerful tool for biomedical researchers, demonstrating remarkable capabilities in understanding and generating human-like text. ChatGPT with its Code Interpreter functionality, an LLM connected with the ability to write and execute code, streamlines data analysis workflows by enabling natural language interactions. Using materials from a previously published tutorial, similar analyses can be performed through conversational interactions with the chatbot, covering data loading and exploration, model development and comparison, permutation feature importance, partial dependence plots, and additional analyses and recommendations. The findings highlight the significant potential of LLMs in assisting researchers with data analysis tasks, allowing them to focus on higher-level aspects of their work. However, there are limitations and potential concerns associated with the use of LLMs, such as the importance of critical thinking, privacy, security, and equitable access to these tools. As LLMs continue to improve and integrate with available tools, data science may experience a transformation similar to the shift from manual to automatic transmission in driving. The advancements in LLMs call for considering the future directions of data science and its education, ensuring that the benefits of these powerful tools are utilized with proper human supervision and responsibility.</p>","PeriodicalId":23288,"journal":{"name":"Translational and Clinical Pharmacology","volume":"32 2","pages":"73-82"},"PeriodicalIF":1.1000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11224898/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational and Clinical Pharmacology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12793/tcp.2024.32.e8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/5/29 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0
Abstract
Large language models (LLMs) have emerged as a powerful tool for biomedical researchers, demonstrating remarkable capabilities in understanding and generating human-like text. ChatGPT with its Code Interpreter functionality, an LLM connected with the ability to write and execute code, streamlines data analysis workflows by enabling natural language interactions. Using materials from a previously published tutorial, similar analyses can be performed through conversational interactions with the chatbot, covering data loading and exploration, model development and comparison, permutation feature importance, partial dependence plots, and additional analyses and recommendations. The findings highlight the significant potential of LLMs in assisting researchers with data analysis tasks, allowing them to focus on higher-level aspects of their work. However, there are limitations and potential concerns associated with the use of LLMs, such as the importance of critical thinking, privacy, security, and equitable access to these tools. As LLMs continue to improve and integrate with available tools, data science may experience a transformation similar to the shift from manual to automatic transmission in driving. The advancements in LLMs call for considering the future directions of data science and its education, ensuring that the benefits of these powerful tools are utilized with proper human supervision and responsibility.
期刊介绍:
Translational and Clinical Pharmacology (Transl Clin Pharmacol, TCP) is the official journal of the Korean Society for Clinical Pharmacology and Therapeutics (KSCPT). TCP is an interdisciplinary journal devoted to the dissemination of knowledge relating to all aspects of translational and clinical pharmacology. The categories for publication include pharmacokinetics (PK) and drug disposition, drug metabolism, pharmacodynamics (PD), clinical trials and design issues, pharmacogenomics and pharmacogenetics, pharmacometrics, pharmacoepidemiology, pharmacovigilence, and human pharmacology. Studies involving animal models, pharmacological characterization, and clinical trials are appropriate for consideration.