# Introduction

ith the rapid development of information technology and network technology, different trades produce large amounts of data every year. The data itself cannot bring direct benefits so need to effectively mine hidden information from huge amount of data. Data mining deals with searching for interesting patterns or knowledge from massive data. It turns a large collection of data into knowledge. Data mining is an essential step in the process of knowledge discovery (Lakshmi & Raghunandhan, 2011 ).The data mining has become a unique tool in analyzing data from different perspective and converting it into useful and meaningful information.Data mining has been widely applied in the areas of Medical diagnosis, Intrusion detection system, Education, Banking, Fraud detection.

Classification is a supervised learning. Prediction and classification in data mining are two forms of data analysis task that is used to extract models describing data classes or to predict future data trends. Classification process has two phases; the first is the learning process where the training data sets are analyzed by classification algorithm. The learned model or classifier is presented in the form of classification rules or patterns. The second phase is the use of model for classification, and test data sets are used to estimate the accuracy of classification rules.

Authors ? ? ?: Department of Computer Science and Engineering, College of Technology and Engineering, Maharana Pratap University of Agriculture and Technology, Udaipur, Rajasthan, India. e-mails: dharm@mpuat.ac.in, naveenc121@yahoo.com, jullysamota304@gmail.com With the rising of data mining, decision tree plays an important role in the process of data mining and data analysis. Decision tree learning involves in using a set of training data to generate a decision tree that correctly classifies the training data itself. If the learning process works, this decision tree will then correctly classify new input data as well. Decision trees differ along several dimensions such as splitting criterion, stopping rules, branch condition (univariate, multivariate), style of branch operation, type of final tree (Han, Kamber & Pei, 2012).

The best known decision tree induction algorithm is the ID3. ID3 is a simple decision tree learning algorithm developed by Ross Quinlan. Its predecessor is CLS algorithm. ID3 is a greedy approach in which top-down, recursive, divide and conquer approach is followed. Information gain is used as attribute selection measure in ID3. ID3 is famous for the merits of easy construction and strong learning ability. There exists a problem with this method, this means that it is biased to select attributes with more taken values, which are not necessarily the best attributes. This problem affects its practicality. ID3 algorithm does not backtrack in searching. Whenever certain layer of tree chooses a property to test, it will not backtrack to reconsider this choice. Attribute selection greatly affects the accuracy of decision tree (Quinlan, 1986).

In rest of the paper, a brief introduction to the related work in the area of decision tree classification is presented in section 2. A brief introduction to the proposed work is presented in section 3. In section 4 we present the experimental results and comparison. In section 5, we conclude our results.


# II.


# Related Work

The structure of decision tree classification is easy to understand so they are especially used when we need to understand the structure of trained knowledge models. If irrelevant attribute selection then all results suffer. Selection space of data is very small if we increase space, selection procedure suffers so problem of attribute selection in classification. There have been a lot of efforts to achieve better classification with respect to accuracy.

Weighted and simplified entropy into decision tree classification is proposed for the problem of multiple-value property selection, selection criteria and property value vacancy. The method promotes the efficiency and precision (Li & Zhang, 2010).A comparison of attribute selection technique with rank of attributes is presented. If irrelevant, redundant and noisy attributes are added in model construction, predictive performance is affected so need to choose useful attributes along with background knowledge (Hall & Holmes, 2003). To improve the accuracy rate of classification and depth of tree, adaptive step forward/decision tree (ASF/DT) is proposed. The method considers not only one attribute but two that can find bigger information gain ratio (Tan & Liang, 2012). A new heuristic technique for attribute selection criteria is introduced. The best attribute, which have least heuristic functional value are taken. The method can be extended to larger databases with best splitting criteria for attribute selection (Raghu, Venkata Raju & Raja Jacob, 2012).Interval based algorithm is proposed. Algorithm has two phases for selection of attribute. First phase provides rank to attributes. Second phase selects the subset of attributes with highest accuracy. Proposed method is applied on real life data set (Salama, M.A., El Bendary, N., Hassanien, Revett & Fahmy, 2011).

Large training data sets have millions of tuples. Decision tree techniques have restriction that the tuples should reside in memory. Construction process becomes inefficient due to swapping of tuples in and out of memory. More scalable approaches are required to handle data (Changala, R., Gummadi, A., Yedukondalu & Raju, 2012).An improved learning algorithm based on the uncertainty deviation is developed. Rationality of attribute selection test is improved. An improved method shows better performance and stability (Sun & Hu, 2012).

Equivalence between multiple layer neural networks and decision trees is presented. Mapping advantage is to provide a self configuration capability to design process. It is possible to restructure as a multilayered network on given decision tree (Sethi, 1990).A comparison of different types of neural network techniques for classification is presented. Evaluation and comparison is done with three benchmark data set on the basis of accuracy (Jeatrakul & Wong, 2009).

The computation may be too heavy if no preprocessing in input phase. Some attributes are not relevant .To rank the importance of attributes, a novel separability correlation measure (SCM) is proposed. In input phase different subsets are used. Irrelevant attributes are those which increase validation error (Fu & Wang, 2003).


# III.


# Proposed Method

The input processing of training phase is data sampling technique for classifier. Single layer RBF networks can learn virtually any input output relationship (Kubat, 1998). The cascade-layer network has connections from the input to all cascaded layers. The additional connections can improve the speed. Artificial neural networks (ANNs) can find internal representations of dependencies within data that is not given. Short response time and simplicity to build the ANNs encouraged their application to the task of attribute selection.     


# Process Method
![Figure 1 : Process block diagram of modified ID3-CRBF](image-2.png "")
2![Figure 2 : Trained data, test data and unclassified region classification](image-3.png "Figure 2 :")
1Cross Fold RatioAccuracyID3ID3_CRBF586.1897.18681.9592.95781.9592.95Figure 3 : Graphical representation ofTable 1
			© 2013 Global Journals Inc. (US)
			© 2013 Global Journals Inc. (US) Global Journal of Computer Science and Technology
		
		
## VI. Conclusion

In this paper, we have experimented cascaded model of RBF with ID3 classification. The standard presentation of each attribute on selected ID3 is calculated and the Classify the given data. We can say from the experiments that the cascaded model of RBF with ID3 approach provides better accuracy and reduces the unclassified region. Increased classification region improves the performance of classifier. 


## References Références Referencias