跳到主要导航 跳到搜索 跳到主要内容

Using wide table to manage web data: A survey

科研成果: 期刊稿件文献综述同行评审

摘要

With the development of World Wide Web (www), storage and utilization of web data has become a big challenge for data management research community. Web data are essentially heterogeneous data, and may change schema frequently, traditional relational data model is inappropriate for web data management. A new data model, called Wide Table (or WT for simplicity), was introduced for this task. There are several characteristics of the WT model. First, WT is usually highly sparsely populated so that most data can be fit into a line or record. Second, queries are composed on only a small subset of the attributes. Thus, existing query processing and optimization techniques for relational database with normalized tables will not work efficiently anymore. Furthermore, WT is usually of extremely large volume. It is thought that only large-scale distributed storage can accommodate themassive data set. In this paper, requirements and challenges to web data management are discussed. Existing techniques for WT, including logical presentation, physical storage, and query processing, are introduced and analyzed in detail.

源语言英语
页(从-至)211-223
页数13
期刊Frontiers of Computer Science in China
2
3
DOI
出版状态已出版 - 9月 2008

指纹

探究 'Using wide table to manage web data: A survey' 的科研主题。它们共同构成独一无二的指纹。

引用此