TY - GEN
T1 - An application of spatial decision tree for classification of air pollution index
AU - Zhao, Minyue
AU - Li, Xiang
PY - 2011
Y1 - 2011
N2 - A decision tree is an analysis skill and a classification algorithm, whose basic principle is the combination of probability theory and an analysis tool of tree shapes. It derives a hierarchy of partition rules with respect to a target attribute of a large dataset. Nowadays, concrete coordinates exist in lots of datasets, which leads to the spatial distribution of datasets. However, conventional decision tree does not take the spatial distribution of records in the dataset into account, which makes it inadequate to deal with the geographical datasets. A number of new approaches to the analysis of geographical data have been proposed in recent years. In the purpose of evaluating the application of a spatial entropy-based decision tree, a spatial entropy-based decision tree that employed to classify the air pollution index (API) is presented in this paper. A spatial decision tree differs from a conventional tree in the way that it considers the spatial autocorrelation phenomena in the classification process. At each level of a spatial decision tree, the supporting attribute that gives the maximum spatial information gain is selected as a node. A case study oriented to the classification of API, whose study area is main cities in China, deals with the norms of the API, including density of total suspended particulate, density of SO2, density of NO2, and etc. After the process of data processing, and graphical analysis, it demonstrates a tree shape of the classification of the API and a map of the spatial distribution of the target attribute's categories, which illustrate the practicability of spatial decision tree.
AB - A decision tree is an analysis skill and a classification algorithm, whose basic principle is the combination of probability theory and an analysis tool of tree shapes. It derives a hierarchy of partition rules with respect to a target attribute of a large dataset. Nowadays, concrete coordinates exist in lots of datasets, which leads to the spatial distribution of datasets. However, conventional decision tree does not take the spatial distribution of records in the dataset into account, which makes it inadequate to deal with the geographical datasets. A number of new approaches to the analysis of geographical data have been proposed in recent years. In the purpose of evaluating the application of a spatial entropy-based decision tree, a spatial entropy-based decision tree that employed to classify the air pollution index (API) is presented in this paper. A spatial decision tree differs from a conventional tree in the way that it considers the spatial autocorrelation phenomena in the classification process. At each level of a spatial decision tree, the supporting attribute that gives the maximum spatial information gain is selected as a node. A case study oriented to the classification of API, whose study area is main cities in China, deals with the norms of the API, including density of total suspended particulate, density of SO2, density of NO2, and etc. After the process of data processing, and graphical analysis, it demonstrates a tree shape of the classification of the API and a map of the spatial distribution of the target attribute's categories, which illustrate the practicability of spatial decision tree.
KW - API
KW - spatial decision tree
KW - spatial entropy
UR - https://www.scopus.com/pages/publications/80052358858
U2 - 10.1109/GeoInformatics.2011.5981071
DO - 10.1109/GeoInformatics.2011.5981071
M3 - 会议稿件
AN - SCOPUS:80052358858
SN - 9781612848488
T3 - Proceedings - 2011 19th International Conference on Geoinformatics, Geoinformatics 2011
BT - Proceedings - 2011 19th International Conference on Geoinformatics, Geoinformatics 2011
T2 - 2011 19th International Conference on Geoinformatics, Geoinformatics 2011
Y2 - 24 June 2011 through 26 June 2011
ER -