TY - JOUR
T1 - Normalization Method Utilizing Endogenous Proteins for Quantitative Proteomics
AU - Yan, Kai
AU - Yang, Yueying
AU - Zhang, Yunpeng
AU - Zhao, Wanbing
AU - Liao, Lujian
N1 - Publisher Copyright:
© 2020 Springer New York LLC. All rights reserved.
PY - 2020/7/1
Y1 - 2020/7/1
N2 - We developed a normalization method utilizing the expression levels of a panel of endogenous proteins as normalization standards (EPNS herein). We tested the validity of the method using two sets of tandem mass tag (TMT)-labeled data and found that this normalization method effectively reduced global intensity bias at the protein level. The coefficient of variation (CV) of the overall median was reduced by 55% and 82% on average, compared to the reduction by 72% and 86% after normalization using the upper quartile. Furthermore, we used differential protein expression analysis and statistical learning to identify biomarkers for colorectal cancer from a CPTAC data set. The expression changes of a panel of proteins, including NUP205, GTPBP4, CNN2, GNL3, and S100A11, all of which highly correlate with colorectal cancer. Applying these five proteins as model features, random forest modeling obtained prediction results with the maximum AUC of 0.9998 using EPNS-normalized data, comparing favorably to the AUC of 0.9739 using the raw data. Thus, the normalization method based on EPNS reduced the global intensity bias and is applicable for quantitative proteomic analysis.
AB - We developed a normalization method utilizing the expression levels of a panel of endogenous proteins as normalization standards (EPNS herein). We tested the validity of the method using two sets of tandem mass tag (TMT)-labeled data and found that this normalization method effectively reduced global intensity bias at the protein level. The coefficient of variation (CV) of the overall median was reduced by 55% and 82% on average, compared to the reduction by 72% and 86% after normalization using the upper quartile. Furthermore, we used differential protein expression analysis and statistical learning to identify biomarkers for colorectal cancer from a CPTAC data set. The expression changes of a panel of proteins, including NUP205, GTPBP4, CNN2, GNL3, and S100A11, all of which highly correlate with colorectal cancer. Applying these five proteins as model features, random forest modeling obtained prediction results with the maximum AUC of 0.9998 using EPNS-normalized data, comparing favorably to the AUC of 0.9739 using the raw data. Thus, the normalization method based on EPNS reduced the global intensity bias and is applicable for quantitative proteomic analysis.
KW - biomarker
KW - endogenous protein normalization
KW - global bias
KW - quantitative proteomics
UR - https://www.scopus.com/pages/publications/85087466662
U2 - 10.1021/jasms.0c00012
DO - 10.1021/jasms.0c00012
M3 - 文章
C2 - 32268065
AN - SCOPUS:85087466662
SN - 1044-0305
VL - 31
SP - 1380
EP - 1388
JO - Journal of the American Society for Mass Spectrometry
JF - Journal of the American Society for Mass Spectrometry
IS - 7
ER -