TY - JOUR
T1 - Exploring the macrocyclic chemical space for heuristic drug design with deep learning models
AU - Hu, Feng
AU - Jia, Xiaotong
AU - Liao, Wenjie
AU - Chen, Ziqi
AU - Bi, Hongjie
AU - Ge, Huan
AU - Liu, Dandan
AU - Zhang, Rongrong
AU - Hu, Yuting
AU - Mei, Wenyi
AU - Zhao, Zhenjiang
AU - Zhang, Kai
AU - Zhu, Lili
AU - Diao, Yanyan
AU - Li, Honglin
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - Macrocyclic compounds hold great promise as therapeutic agents. However, their structural optimization remains constrained by the limited availability of bioactive candidates, which in turn hampers the systematic exploration of structure-activity relationships. Here we introduce CycleGPT, a generative chemical language model designed specifically to address these challenges. CycleGPT is characterized by a progressive transfer learning paradigm that incrementally transfers knowledge from pre-trained chemical language models to specialized macrocycle generation, thereby overcoming the data shortage issue. Meanwhile, it adopts an innovative probabilistic sampling strategy that effectively improves the structural novelty of generated macrocycles while ensuring domain-specific adaptability. In a prospective drug design based on CycleGPT and a JAK2 activity prediction model, we successfully developed a new JAK2 drug candidate with a good selectivity profile (inhibiting 17 wild-type kinases) and promising potential for treating polycythemia in vivo, demonstrating the practicality of deep learning methods in macrocyclic drug design.
AB - Macrocyclic compounds hold great promise as therapeutic agents. However, their structural optimization remains constrained by the limited availability of bioactive candidates, which in turn hampers the systematic exploration of structure-activity relationships. Here we introduce CycleGPT, a generative chemical language model designed specifically to address these challenges. CycleGPT is characterized by a progressive transfer learning paradigm that incrementally transfers knowledge from pre-trained chemical language models to specialized macrocycle generation, thereby overcoming the data shortage issue. Meanwhile, it adopts an innovative probabilistic sampling strategy that effectively improves the structural novelty of generated macrocycles while ensuring domain-specific adaptability. In a prospective drug design based on CycleGPT and a JAK2 activity prediction model, we successfully developed a new JAK2 drug candidate with a good selectivity profile (inhibiting 17 wild-type kinases) and promising potential for treating polycythemia in vivo, demonstrating the practicality of deep learning methods in macrocyclic drug design.
UR - https://www.scopus.com/pages/publications/105018675228
U2 - 10.1038/s42004-025-01686-w
DO - 10.1038/s42004-025-01686-w
M3 - 文章
AN - SCOPUS:105018675228
SN - 2399-3669
VL - 8
JO - Communications Chemistry
JF - Communications Chemistry
IS - 1
M1 - 299
ER -