Data-intensive science and engineering: Requirements and challenges

Research output: Contribution to journalArticlepeer-review

24 Scopus citations

Abstract

Scientific exploration after experimental science, theoretical science and computational science phases, into data-intensive science phase, are accompanied by the arrival of the big data era. Generally, big data refers to a data set with a size of hundreds of TB, or several PB or even above, and it is often distributed, heterogeneous and in low-quality. It is critical to devise novel methods to manage big data since traditional database management techniques are unfeasible to manage big data efficiently and effectively, though such techniques, especially the commercial relational DBMSs, have achieved great success in the past decades. This paper discusses concrete requirements and realistic challenges of Data-Intensive Science and Engineering (DISE), ranging from data storage and organization, computational method, data analysis, to user interfaces. Meanwhile, data quality, data security and data curation should be paid more attentions. In this paper, we attempt to describe the architecture of DISE, review the recent progress, and discuss the challenges and future work briefly.

Original languageEnglish
Pages (from-to)1563-1578
Number of pages16
JournalJisuanji Xuebao/Chinese Journal of Computers
Volume35
Issue number8
DOIs
StatePublished - Aug 2012

Keywords

  • Big data
  • Challenge
  • Data Intensive Science and Engineering (DISE)
  • Requirement

Fingerprint

Dive into the research topics of 'Data-intensive science and engineering: Requirements and challenges'. Together they form a unique fingerprint.

Cite this