TY - JOUR
T1 - Screening Environmentally Benign Ionic Liquids for CO2 Absorption Using Representation Uncertainty-Based Machine Learning
AU - Zhong, Shifa
AU - Chen, Yushan
AU - Li, Jibai
AU - Igou, Thomas
AU - Xiong, Anyue
AU - Guan, Jian
AU - Dai, Zhenhua
AU - Cai, Xuanying
AU - Qu, Xintong
AU - Chen, Yongsheng
N1 - Publisher Copyright:
© 2024 The Authors. Published by American Chemical Society.
PY - 2024/11/12
Y1 - 2024/11/12
N2 - Screening ionic liquids (ILs) with low viscosity, low toxicity, and high CO2 absorption using machine learning (ML) models is crucial for mitigating global warming. However, when candidate ILs fall into the extrapolation zone of ML models, predictions may become unreliable, leading to poor decision-making. In this study, we introduce a “representation uncertainty” (RU) approach to quantify prediction uncertainty by employing four IL representations: molecular fingerprint, molecular descriptor, molecular image, and molecular graph. We develop four types of ML models based on these representations and calculate RU as the standard deviation of predictions across these models. Compared to traditional model uncertainty (MU), which is based on hyperparameter variations within a single representation, RU outperforms MU in identifying unreliable predictions across four IL property data sets: viscosity, toxicity, refractive index, and CO2 absorption capacity. Furthermore, we develop ensemble models from the four types of models, which show superior predictive performance compared with that of individual models. Using the RU approach, we screened 1420 ILs and identified 37 promising candidates with low viscosity, low toxicity, and high CO2 absorption capacity. The predictive performance of our ensemble model, along with the effectiveness of the RU-based approach, was experimentally validated by testing the CO2 absorption capacity of 14 ILs. This study not only offers a more reliable method for screening and designing ILs, accelerating the discovery process, but also introduces a new perspective on developing ensemble models with enhanced predictive performance.
AB - Screening ionic liquids (ILs) with low viscosity, low toxicity, and high CO2 absorption using machine learning (ML) models is crucial for mitigating global warming. However, when candidate ILs fall into the extrapolation zone of ML models, predictions may become unreliable, leading to poor decision-making. In this study, we introduce a “representation uncertainty” (RU) approach to quantify prediction uncertainty by employing four IL representations: molecular fingerprint, molecular descriptor, molecular image, and molecular graph. We develop four types of ML models based on these representations and calculate RU as the standard deviation of predictions across these models. Compared to traditional model uncertainty (MU), which is based on hyperparameter variations within a single representation, RU outperforms MU in identifying unreliable predictions across four IL property data sets: viscosity, toxicity, refractive index, and CO2 absorption capacity. Furthermore, we develop ensemble models from the four types of models, which show superior predictive performance compared with that of individual models. Using the RU approach, we screened 1420 ILs and identified 37 promising candidates with low viscosity, low toxicity, and high CO2 absorption capacity. The predictive performance of our ensemble model, along with the effectiveness of the RU-based approach, was experimentally validated by testing the CO2 absorption capacity of 14 ILs. This study not only offers a more reliable method for screening and designing ILs, accelerating the discovery process, but also introduces a new perspective on developing ensemble models with enhanced predictive performance.
KW - CO absorption
KW - QSPR
KW - ensemble model
KW - ionic liquids
KW - uncertainty quantification
UR - https://www.scopus.com/pages/publications/85205052295
U2 - 10.1021/acs.estlett.4c00524
DO - 10.1021/acs.estlett.4c00524
M3 - 文章
AN - SCOPUS:85205052295
SN - 2328-8930
VL - 11
SP - 1193
EP - 1199
JO - Environmental Science and Technology Letters
JF - Environmental Science and Technology Letters
IS - 11
ER -