{"title":"先减少再预测还是同时减少再预测?提高研发效率的数据驱动稀疏建模","authors":"Pa-Chieh Hsiao;Yen-Chun Chou;Howard Hao-Chun Chuang","doi":"10.1109/TEM.2025.3577580","DOIUrl":null,"url":null,"abstract":"Efficient research and development (R&D) workflows are critical in industries where early-stage results influence downstream outcomes. This study develops a predictive model to enhance R&D efficiency for a leading integrated device manufacturer specializing in printed circuit board design. To address challenges of limited data, noise and collinearity, we apply sparse principal component analysis (SPCA) to simplify simulation data, followed by least absolute shrinkage and selection operator (LASSO) regression to predict later-stage physical testing performance. Our SPCA-LASSO model reduces prediction errors by 22%–41% compared to direct LASSO regression while offering interpretable insights for engineers. In contrast, sparse principal component regression, which integrates dimension reduction and prediction, yields higher errors and unstable factor loadings. This empirical comparison between reduce-then-predict and simultaneous reduce-and-predict approaches contributes to sparse modeling and engineering analytics, offering actionable insights for improving sequential R&D processes across high-tech industries, software engineering, construction, and other sectors where early performance predictions are critical.","PeriodicalId":55009,"journal":{"name":"IEEE Transactions on Engineering Management","volume":"72 ","pages":"2646-2660"},"PeriodicalIF":5.2000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reduce-Then-Predict or Simultaneous Reduce-and-Predict? Data-Driven Sparse Modeling for Improving R&D Efficiency\",\"authors\":\"Pa-Chieh Hsiao;Yen-Chun Chou;Howard Hao-Chun Chuang\",\"doi\":\"10.1109/TEM.2025.3577580\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Efficient research and development (R&D) workflows are critical in industries where early-stage results influence downstream outcomes. This study develops a predictive model to enhance R&D efficiency for a leading integrated device manufacturer specializing in printed circuit board design. To address challenges of limited data, noise and collinearity, we apply sparse principal component analysis (SPCA) to simplify simulation data, followed by least absolute shrinkage and selection operator (LASSO) regression to predict later-stage physical testing performance. Our SPCA-LASSO model reduces prediction errors by 22%–41% compared to direct LASSO regression while offering interpretable insights for engineers. In contrast, sparse principal component regression, which integrates dimension reduction and prediction, yields higher errors and unstable factor loadings. This empirical comparison between reduce-then-predict and simultaneous reduce-and-predict approaches contributes to sparse modeling and engineering analytics, offering actionable insights for improving sequential R&D processes across high-tech industries, software engineering, construction, and other sectors where early performance predictions are critical.\",\"PeriodicalId\":55009,\"journal\":{\"name\":\"IEEE Transactions on Engineering Management\",\"volume\":\"72 \",\"pages\":\"2646-2660\"},\"PeriodicalIF\":5.2000,\"publicationDate\":\"2025-06-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Engineering Management\",\"FirstCategoryId\":\"91\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11027659/\",\"RegionNum\":3,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BUSINESS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Engineering Management","FirstCategoryId":"91","ListUrlMain":"https://ieeexplore.ieee.org/document/11027659/","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BUSINESS","Score":null,"Total":0}
Reduce-Then-Predict or Simultaneous Reduce-and-Predict? Data-Driven Sparse Modeling for Improving R&D Efficiency
Efficient research and development (R&D) workflows are critical in industries where early-stage results influence downstream outcomes. This study develops a predictive model to enhance R&D efficiency for a leading integrated device manufacturer specializing in printed circuit board design. To address challenges of limited data, noise and collinearity, we apply sparse principal component analysis (SPCA) to simplify simulation data, followed by least absolute shrinkage and selection operator (LASSO) regression to predict later-stage physical testing performance. Our SPCA-LASSO model reduces prediction errors by 22%–41% compared to direct LASSO regression while offering interpretable insights for engineers. In contrast, sparse principal component regression, which integrates dimension reduction and prediction, yields higher errors and unstable factor loadings. This empirical comparison between reduce-then-predict and simultaneous reduce-and-predict approaches contributes to sparse modeling and engineering analytics, offering actionable insights for improving sequential R&D processes across high-tech industries, software engineering, construction, and other sectors where early performance predictions are critical.
期刊介绍:
Management of technical functions such as research, development, and engineering in industry, government, university, and other settings. Emphasis is on studies carried on within an organization to help in decision making or policy formation for RD&E.