{"title":"Demystifying dimensionality reduction techniques in the ‘omics’ era: A practical approach for biological science students","authors":"Leonardo D. Garma, Nuno S. Osório","doi":"10.1002/bmb.21800","DOIUrl":null,"url":null,"abstract":"<p>Dimensionality reduction techniques are essential in analyzing large ‘omics’ datasets in biochemistry and molecular biology. Principal component analysis, t-distributed stochastic neighbor embedding, and uniform manifold approximation and projection are commonly used for data visualization. However, these methods can be challenging for students without a strong mathematical background. In this study, intuitive examples were created using COVID-19 data to help students understand the core concepts behind these techniques. In a 4-h practical session, we used these examples to demonstrate dimensionality reduction techniques to 15 postgraduate students from biomedical backgrounds. Using Python and Jupyter notebooks, our goal was to demystify these methods, typically treated as “black boxes”, and empower students to generate and interpret their own results. To assess the impact of our approach, we conducted an anonymous survey. The majority of the students agreed that using computers enriched their learning experience (67%) and that Jupyter notebooks were a valuable part of the class (66%). Additionally, 60% of the students reported increased interest in Python, and 40% gained both interest and a better understanding of dimensionality reduction methods. Despite the short duration of the course, 40% of the students reported acquiring research skills necessary in the field. While further analysis of the learning impacts of this approach is needed, we believe that sharing the examples we generated can provide valuable resources for others to use in interactive teaching environments. These examples highlight advantages and limitations of the major dimensionality reduction methods used in modern bioinformatics analysis in an easy-to-understand way.</p>","PeriodicalId":8830,"journal":{"name":"Biochemistry and Molecular Biology Education","volume":"52 2","pages":"165-178"},"PeriodicalIF":1.2000,"publicationDate":"2023-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biochemistry and Molecular Biology Education","FirstCategoryId":"95","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/bmb.21800","RegionNum":4,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Dimensionality reduction techniques are essential in analyzing large ‘omics’ datasets in biochemistry and molecular biology. Principal component analysis, t-distributed stochastic neighbor embedding, and uniform manifold approximation and projection are commonly used for data visualization. However, these methods can be challenging for students without a strong mathematical background. In this study, intuitive examples were created using COVID-19 data to help students understand the core concepts behind these techniques. In a 4-h practical session, we used these examples to demonstrate dimensionality reduction techniques to 15 postgraduate students from biomedical backgrounds. Using Python and Jupyter notebooks, our goal was to demystify these methods, typically treated as “black boxes”, and empower students to generate and interpret their own results. To assess the impact of our approach, we conducted an anonymous survey. The majority of the students agreed that using computers enriched their learning experience (67%) and that Jupyter notebooks were a valuable part of the class (66%). Additionally, 60% of the students reported increased interest in Python, and 40% gained both interest and a better understanding of dimensionality reduction methods. Despite the short duration of the course, 40% of the students reported acquiring research skills necessary in the field. While further analysis of the learning impacts of this approach is needed, we believe that sharing the examples we generated can provide valuable resources for others to use in interactive teaching environments. These examples highlight advantages and limitations of the major dimensionality reduction methods used in modern bioinformatics analysis in an easy-to-understand way.
期刊介绍:
The aim of BAMBED is to enhance teacher preparation and student learning in Biochemistry, Molecular Biology, and related sciences such as Biophysics and Cell Biology, by promoting the world-wide dissemination of educational materials. BAMBED seeks and communicates articles on many topics, including:
Innovative techniques in teaching and learning.
New pedagogical approaches.
Research in biochemistry and molecular biology education.
Reviews on emerging areas of Biochemistry and Molecular Biology to provide background for the preparation of lectures, seminars, student presentations, dissertations, etc.
Historical Reviews describing "Paths to Discovery".
Novel and proven laboratory experiments that have both skill-building and discovery-based characteristics.
Reviews of relevant textbooks, software, and websites.
Descriptions of software for educational use.
Descriptions of multimedia materials such as tutorials on various aspects of biochemistry and molecular biology.