{"title":"A model for tobacco growing area classification based on time series features of thermogravimetric analysis","authors":"Jiaxu Xia, Yunong Tian, Xianwei Hao, Yuhan Peng, Guanqun Luo, Zhihua Gan","doi":"10.1186/s13068-025-02682-x","DOIUrl":null,"url":null,"abstract":"<div><p>Biomass is greatly influenced by geographic location, soil composition, environment, and climate, making the efficient and accurate identification of growing areas highly significant. This study proposes a classification model for tobacco growing areas based on time series features from thermogravimetric analysis (TGA). This study combines Convolutional Neural Networks (CNN) with Long Short-Term Memory (LSTM) model to process the derivative thermogravimetric (DTG) data, aiming to uncover the inherent time series properties and the continuous and dynamic relationship between temperatures for classifying tobacco growing areas. By analyzing 375 tobacco samples from ten different provinces, CNN is employed to extract local features, while LSTM captures long-term dependencies in the DTG data. The dataset used in this study has a limited sample size, a wide variety of classes, and an imbalance in the number of samples across these classes. Despite these challenges, the model achieves 86.4% accuracy on the test set, significantly surpassing the performance of the traditional Support Vector Machine model, which only achieves 68.2% accuracy. Additionally, the model reveals key temperature ranges crucial for growing area classification associated with the pyrolysis temperature ranges of volatile components, hemicellulose, cellulose, lignin, and CaCO<sub>3</sub> in the tobacco. This model lays the groundwork for the future use of geographical labels to accurately represent tobacco’s style and quality, enabling more precise differentiation and improved quality control.</p></div>","PeriodicalId":494,"journal":{"name":"Biotechnology for Biofuels","volume":"18 1","pages":""},"PeriodicalIF":6.1000,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://biotechnologyforbiofuels.biomedcentral.com/counter/pdf/10.1186/s13068-025-02682-x","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biotechnology for Biofuels","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1186/s13068-025-02682-x","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Biomass is greatly influenced by geographic location, soil composition, environment, and climate, making the efficient and accurate identification of growing areas highly significant. This study proposes a classification model for tobacco growing areas based on time series features from thermogravimetric analysis (TGA). This study combines Convolutional Neural Networks (CNN) with Long Short-Term Memory (LSTM) model to process the derivative thermogravimetric (DTG) data, aiming to uncover the inherent time series properties and the continuous and dynamic relationship between temperatures for classifying tobacco growing areas. By analyzing 375 tobacco samples from ten different provinces, CNN is employed to extract local features, while LSTM captures long-term dependencies in the DTG data. The dataset used in this study has a limited sample size, a wide variety of classes, and an imbalance in the number of samples across these classes. Despite these challenges, the model achieves 86.4% accuracy on the test set, significantly surpassing the performance of the traditional Support Vector Machine model, which only achieves 68.2% accuracy. Additionally, the model reveals key temperature ranges crucial for growing area classification associated with the pyrolysis temperature ranges of volatile components, hemicellulose, cellulose, lignin, and CaCO3 in the tobacco. This model lays the groundwork for the future use of geographical labels to accurately represent tobacco’s style and quality, enabling more precise differentiation and improved quality control.
期刊介绍:
Biotechnology for Biofuels is an open access peer-reviewed journal featuring high-quality studies describing technological and operational advances in the production of biofuels, chemicals and other bioproducts. The journal emphasizes understanding and advancing the application of biotechnology and synergistic operations to improve plants and biological conversion systems for the biological production of these products from biomass, intermediates derived from biomass, or CO2, as well as upstream or downstream operations that are integral to biological conversion of biomass.
Biotechnology for Biofuels focuses on the following areas:
• Development of terrestrial plant feedstocks
• Development of algal feedstocks
• Biomass pretreatment, fractionation and extraction for biological conversion
• Enzyme engineering, production and analysis
• Bacterial genetics, physiology and metabolic engineering
• Fungal/yeast genetics, physiology and metabolic engineering
• Fermentation, biocatalytic conversion and reaction dynamics
• Biological production of chemicals and bioproducts from biomass
• Anaerobic digestion, biohydrogen and bioelectricity
• Bioprocess integration, techno-economic analysis, modelling and policy
• Life cycle assessment and environmental impact analysis