TY - JOUR
T1 - Data-intensive science and engineering
T2 - Requirements and challenges
AU - Gong, Xue Qing
AU - Jin, Che Qing
AU - Wang, Xiao Ling
AU - Zhang, Rong
AU - Zhou, Ao Ying
PY - 2012/8
Y1 - 2012/8
N2 - Scientific exploration after experimental science, theoretical science and computational science phases, into data-intensive science phase, are accompanied by the arrival of the big data era. Generally, big data refers to a data set with a size of hundreds of TB, or several PB or even above, and it is often distributed, heterogeneous and in low-quality. It is critical to devise novel methods to manage big data since traditional database management techniques are unfeasible to manage big data efficiently and effectively, though such techniques, especially the commercial relational DBMSs, have achieved great success in the past decades. This paper discusses concrete requirements and realistic challenges of Data-Intensive Science and Engineering (DISE), ranging from data storage and organization, computational method, data analysis, to user interfaces. Meanwhile, data quality, data security and data curation should be paid more attentions. In this paper, we attempt to describe the architecture of DISE, review the recent progress, and discuss the challenges and future work briefly.
AB - Scientific exploration after experimental science, theoretical science and computational science phases, into data-intensive science phase, are accompanied by the arrival of the big data era. Generally, big data refers to a data set with a size of hundreds of TB, or several PB or even above, and it is often distributed, heterogeneous and in low-quality. It is critical to devise novel methods to manage big data since traditional database management techniques are unfeasible to manage big data efficiently and effectively, though such techniques, especially the commercial relational DBMSs, have achieved great success in the past decades. This paper discusses concrete requirements and realistic challenges of Data-Intensive Science and Engineering (DISE), ranging from data storage and organization, computational method, data analysis, to user interfaces. Meanwhile, data quality, data security and data curation should be paid more attentions. In this paper, we attempt to describe the architecture of DISE, review the recent progress, and discuss the challenges and future work briefly.
KW - Big data
KW - Challenge
KW - Data Intensive Science and Engineering (DISE)
KW - Requirement
UR - https://www.scopus.com/pages/publications/84867890388
U2 - 10.3724/SP.J.1016.2012.01563
DO - 10.3724/SP.J.1016.2012.01563
M3 - 文章
AN - SCOPUS:84867890388
SN - 0254-4164
VL - 35
SP - 1563
EP - 1578
JO - Jisuanji Xuebao/Chinese Journal of Computers
JF - Jisuanji Xuebao/Chinese Journal of Computers
IS - 8
ER -