Application of lasso for identification of functional groups with significant contributions to antioxidant activities of Centella asiatica

IF 0.5 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS
C. Wirdiastuti, U. Syafitri, M. Sumertajaya, E. Rohaeti, M. Rafi
{"title":"Application of lasso for identification of functional groups with significant contributions to antioxidant activities of Centella asiatica","authors":"C. Wirdiastuti, U. Syafitri, M. Sumertajaya, E. Rohaeti, M. Rafi","doi":"10.28919/cmbn/7843","DOIUrl":null,"url":null,"abstract":": High-dimensional data has more variables than observations (p>>n). In this case, modeling with regression analysis becomes ineffective because it will violate the multicollinearity assumption. The least absolute shrinkage and selection operator (LASSO) can handle high-dimensional data and multicollinearity because LASSO works by reducing the parameters of variables with significant effects and selecting variables with minor effects. In its application, several variables have the same characteristics. Reducing and selecting variables in the form of groups can solve the problem so that the group LASSO can be used as a solution. This study used data on antioxidant activity in C. asiatica. It is a plant that contains antioxidants. The spectroscopic technique can find important information about antioxidants, namely the Fourier transformed infrared spectrophotometer (FTIR). FTIR is a spectroscopic technique based on molecular vibrations subjected to infrared so that it can characterize molecules with functional groups. FTIR data has large dimensions and multicollinearity. This study has 1866 explanatory","PeriodicalId":44079,"journal":{"name":"Communications in Mathematical Biology and Neuroscience","volume":null,"pages":null},"PeriodicalIF":0.5000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Mathematical Biology and Neuroscience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.28919/cmbn/7843","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

: High-dimensional data has more variables than observations (p>>n). In this case, modeling with regression analysis becomes ineffective because it will violate the multicollinearity assumption. The least absolute shrinkage and selection operator (LASSO) can handle high-dimensional data and multicollinearity because LASSO works by reducing the parameters of variables with significant effects and selecting variables with minor effects. In its application, several variables have the same characteristics. Reducing and selecting variables in the form of groups can solve the problem so that the group LASSO can be used as a solution. This study used data on antioxidant activity in C. asiatica. It is a plant that contains antioxidants. The spectroscopic technique can find important information about antioxidants, namely the Fourier transformed infrared spectrophotometer (FTIR). FTIR is a spectroscopic technique based on molecular vibrations subjected to infrared so that it can characterize molecules with functional groups. FTIR data has large dimensions and multicollinearity. This study has 1866 explanatory
利用套索对积雪草抗氧化活性有重要贡献的官能团进行鉴定
:高维数据比观测值具有更多的变量(p>>n)。在这种情况下,用回归分析建模是无效的,因为它会违反多重共线性假设。最小绝对收缩和选择算子(LASSO)可以处理高维数据和多重共线性,因为LASSO的工作原理是减少影响显著的变量的参数,选择影响较小的变量。在其应用中,几个变量具有相同的特征。以组的形式对变量进行化简和选择,可以解决问题,使组LASSO可以作为一种解决方案。本研究利用了积雪草的抗氧化活性数据。它是一种含有抗氧化剂的植物。光谱技术可以发现抗氧化剂的重要信息,即傅里叶变换红外分光光度计(FTIR)。FTIR是一种基于分子振动的红外光谱技术,它可以表征具有官能团的分子。FTIR数据具有维数大、多重共线性的特点。这项研究有1866年的解释性
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Communications in Mathematical Biology and Neuroscience
Communications in Mathematical Biology and Neuroscience COMPUTER SCIENCE, INFORMATION SYSTEMS-
CiteScore
2.10
自引率
15.40%
发文量
80
期刊介绍: Communications in Mathematical Biology and Neuroscience (CMBN) is a peer-reviewed open access international journal, which is aimed to provide a publication forum for important research in all aspects of mathematical biology and neuroscience. This journal will accept high quality articles containing original research results and survey articles of exceptional merit.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信