Fuzzy SLIQ Decision Tree Based on Classification Sensitivity

Full Text (PDF, 247KB), PP.18-25

Views: 0 Downloads: 0

Author(s)

Hongze Qiu 1,* Haitang Zhang 1

1. Shandong University/School of Computer Science and Technology, Jinan, China

* Corresponding author.

DOI: https://doi.org/10.5815/ijmecs.2011.05.03

Received: 13 Jul. 2011 / Revised: 14 Aug. 2011 / Accepted: 12 Sep. 2011 / Published: 8 Oct. 2011

Index Terms

Decision trees, SLIQ, gini index, fuzzy set theory, sensitivity degree, membership function, G-FDT

Abstract

The determination of membership function is fairly critical to fuzzy decision tree induction. Unfortunately, generally used heuristics, such as SLIQ, show the pathological behavior of the attribute tests at split nodes inclining to select a crisp partition. Hence, for induction of binary fuzzy tree, this paper proposes a method depending on the sensitivity degree of attributes to all classes of training examples to determine the transition region of membership function. The method, properly using the pathological characteristic of common heuristics, overcomes drawbacks of G-FDT algorithm proposed by B. Chandra, and it well remedies defects brought on by the pathological behavior. Moreover, the sensitivity degree based algorithm outperforms G-FDT algorithm in respect to classification accuracy.

Cite This Paper

Hongze Qiu, Haitang Zhang, "Fuzzy SLIQ Decision Tree Based on Classification Sensitivity", International Journal of Modern Education and Computer Science(IJMECS), vol.3, no.5, pp.18-25, 2011. DOI:10.5815/ijmecs.2011.05.03

Reference

[1]B. Chandra, P. Paul, Fuzzifying Gini Index based decision trees, Expert Systems with Applications 36 (2009) 8549-8559.
[2]Cristina Olaru, Louis Wehenkel, A complete fuzzy decision tree technique, Fuzzy Sets and Systems 138 (2003) 221-254.
[3]Xavier Boyen, Louis Wehenkel, Automatic induction of fuzzy decision trees and its application to power system security assessment, Fuzzy Sets and Systems 102 (1999) 3-19.
[4]Yufei Yuan, Michael J. Shaw, Induction of fuzzy decision trees, Fuzzy Sets and Systems 69 (1995) 125-139.
[5]A. Suarez, F. Lutsko, Globally optimal fuzzy decision trees for classification and regression, IEEE Transactions on Pattern and Machine Intelligence 21 (12) (1999) 1297-1311.
[6]Cezary Z. Janikow, Fuzzy Decision Trees: Issues and Meth-ods, IEEE Transaction on System, Man, and Cybernetics-PART B: Cybernetics, 28 (1) (1998).
[7]J. Dombi, Membership function as an evaluation, Fuzzy Sets and Systems 35 (1990) 1-21.
[8]Koen-Myung Lee, Kyung-Mi Lee, et al, A Fuzzy Decision Tree Induction Method for Fuzzy Data, IEEE International Fuzzy Systems Conference Proceedings (1999).
[9]M. Mehta, R. Agrawal, and J. Riassnen, SLIQ: A fast scalable classifier for data mining, Extending Database Technology. (1996) 160-169.
[10]B. Chandra, P. Paul Varghese, On Improving Efficiency of SLIQ Decision Tree Algorithm. Proceedings of International Joint Conference on Neural Networks (2007).
[11]Xizhao Wang, Bin Chen, Guoliang Qian, Feng Ye, On the optimization of fuzzy decision trees, Fuzzy Sets and Systems 112 (2000) 117-125.
[12]Motohide Umano, Hirotaka Okanoto, et al, Fuzzy Decision Trees by Fuzzy ID3 Algorithm and Its Application to Diagnosis System, IEEE (1994) 2113-2118.
[13]Malcolm J. Beynon, Michael J. Peel, Yu-Cheng Tang, The application of fuzzy decision tree analysis in an exposition of the antecedents of audit fees, Omega 32 (2004) 231-244.
[14]Chih-Chung Yang, N. K. Bose, Generating fuzzy membership function with self-organizing feature map, Pattern Recognition Letters 27 (2006) 356-363.
[15]Robert Lowen, Fuzzy Set Theory: Basic Concepts, Techniques and Bibliography. Springer; 1 edition (May 31, 1996).
[16]Hongwen Yan, Rui Ma, Xiaojiao Tong, SLIQ in data mining and application in the generation unit’s bidding decision system of electricity market, The 7th International Power Engineering Conference (2005).
[17]Chandra, B., Mazumdar, S., Arena, V., Parimi, N., Elegant decision tree algorithm for classification in data mining, Proceedings of the Third International Conference on Web Information Systems Engineering (Workshops) (2002).
[18]B. Chandra, P. Paul Varghese, Fuzzy SLIQ Decision Tree Algorithm, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics (2008) 1294-1301.
[19]Shu-Cherng Fang, et al, An efficient and flexible mechanism for constructing membership functions, European Journal of Operational Research 139 (2002) 84–95.
[20]Tzung-Pei Hong, Jyh-Bin Chen, Finding relevant attributes and membership functions, Fuzzy Sets and Systems 103 (1999) 389-404.
[21]B. Apolloni, G. Zamponi, A.M. Zanaboni, Learning fuzzy decision trees, Neural Networks 11 (1998) 885–895.
[22]Quinlan, J. R, Introduction of decision tree, Machine Learning 1, (1986) 81–106.
[23]Quinlan, J. R., Improved use of continuous attributes in C4.5, Journal of Artificial Intelligence Research 4, (1996) 7–90.
[24]R. Weber, Fuzzy ID3: a class of methods for automatic knowledge acquisition, Proceedings of the 2nd International Conference on Fuzzy Logic and Neural Networks, Iizuka, Japan, (1992) 265–268.
[25]De Luca, A., Termini, S. A definition of a non probabilistic entropy in the setting of fuzzy sets theory, Information and Control 20 (1976) 301–312.
[26]Klir, G. J., Higashi, M., Measures of uncertainty and information based on possibility distributions, International Journal of General Systems 9. (1983) 43–58.
[27]Myung, W. K., Khil, A., Joung, W. R.., Efficient fuzzy rules for classification, In Proceedings of the IEEE international workshop on integrating ai and data mining (2006).
[28]X. Boyen, L. Wehenkel, Automatic induction of continuous decision trees, To appear in Proc. of IPMU96, Info. Proc. and Manag. of Uncertainty in Knowledge-Based Systems, Granada (SP)(1995).
[29]Louis Wehenkel, On Uncertainty Measures Used for Decision Tree Induction, in: Proc. IPMU’96 , Information Processing and Management of Uncertainty in Knowledge-Based Systems, Granada, July 1996, pp. 413-418.