# I.

I ntroduction s per high pace rise in software applications and major dependency on it, the fault prediction has become one of the inevitable parts of software development life cycle (SDLC) that can play significant role in reducing the probability of software failure.

Software defect prediction (SDP) can be performed while planning to identify fault-prone modules in software product that as a result can provide the insight to the need for increased quality of monitoring during software development. In addition, it can also facilitate necessary approaches to incorporate certain proper fault verification schemes leading to enhanced software quality [1,2,3,4] and reliability. SDP can be functional based on certain software metrics [3,4,5], such as source code changes, previous defects, etc. In fact software metrics are the quantitative data that are employed for characterizing the properties of source code and can be significant for predicting software quality. The efforts made through many generations have facilitated a number of schemes to mitigate defects, but the continuation of researches still indicates towards search for certain optimal SDP solution to ensure optimal performance, reliability, cost optimization and minimal maintenance. A number of efforts have been made for SDP using machine learning and neural network [6,7,8,9,10], clustering techniques, statistical method, mining and random forest [44,45,50] etc. In recent years, majority of software are being developed based on Object-Oriented (OO) paradigm. Thus, the quality of the software can be optimally assessed by employing software metrics, such as Abreu MOOD metric suite [11], QMOOD metrics suite [12], Bieman and Kang [13], Briand et al. [14], Etzkorn et al. [15], Halstead [16], Henderson-sellers [17], Li and Henry [18], McCabe [19], Tegarden et al. [20], Lorenz and Kidd [21] and CK metric [22] suite. These software metrics plays significant role in assessing the quality of software such as precision, accuracy, fault-resilience and sensitivity etc. The significance of these object oriented software metrics lies in their capability to predict the software quality in terms of adaptability, functionality, usability, portability, supportability, reliability and cost effectiveness. Predominantly two data driven algorithms, support vector machine (SVM) and artificial neural network (ANN) algorithms have been employed for fault detection. ANN approach functions on the basis of the human brain behaviorand possesses neurons and directed edges with certain weights existing between input and output layers. ANN employs output as the input so as to learn complex non-linear input-output relationship and can be stated to be a complex nonlinear mapping model between input and output layer. The processes in ANN comprise data sets to enhance the weight parameters, risk minimization scheme for stopping training as soon as the learning error enters in expected margin level. In fact, ANN has been employed in numerous utilities, but still it possesses certain limitations in terms of slow learning ability, local minima etc and hence require further optimization to achieve certain optimal SDP efficiency and performance. Thus, there is the requirement of further optimization of ANN approaches to accomplish a potential SDP solution. Some researches [23,24] advocate the implementation of evolutionary computing techniques for SDP optimization. This paper proposes a novel evolutionary computing based enhanced ANN algorithmnamed Hybrid Evolutionary Computation


# Year ( )


# D

based Neural Network (HENN) for defect prediction and classification. HENN system employs Adaptive Genetic Algorithm (A-GA) for optimal weight estimation so as to enhance weight update and learning efficiency of the ANN.In this paper, the object oriented software metrics, CK metrics [22] have been employed as a fault classification data and the respective performance has been analyzed using confusion matrix.

The remaining sections discusses, related work in Section II, problem definition is briefed in Section III, which has been followed by proposed research discussion in Section IV. Section V presents the results and analysis and conclusion has been discussed in Section VI.


# II.


# Related Work

The emergence of software applications and associated need of quality and reliability has motivated software practitioners as well as academia to develop certain novel scheme for defect prediction.With an objective to examine the relation between software metrics and associated faults some initiatives were made in [25,26,27,28,29,30] where machine learning mechanism were used for fault detection. With an enthuse to compare the performance of varied other schemes such as decision trees, naïve Bayes, and 1rule [31] performed fault detection using NASA MDP project. Chug et al [32] performed data mining based fault estimation using conventional J48, Random Forest, and Naive Bayesian Classifier (NBC) schemes but still couldn't employ the benefits of advanced classification schemes. With an objective to enhance conventional schemes Pushphavathi et al [33] introduced hybrid scheme of random forest (RF) and Fuzzy C Means (FCM) clustering. Then while, these systems were found limited for unbalanced data sets, which motivated author [34] to propose an approach called AdaBoost.NC that explored varied kinds of class imbalance learning schemes comprising resampling techniques, threshold moving, and ensemble algorithms. With an objective to explore SVM optimization in [35,36] a dynamic SVM model was proposed for fault detection in source code using with error data and faulty code execution. Researcher in [37] developed an ANN based SDP system. This is the matter of fact that SVM refers the functional paradigm of single layer perceptron's NN which on addition with kernels behaves like multilayered perceptron's [38]. Till available systems based neural network with conventional learning and weight estimation suffers from local optima and convergence issue, which has not been discussed dominantly. On contrary, these days the software are developed and examined for faults using object oriented software metrics which even being significant has not been explored in depth to ensure optimal solution for reliability oriented defect prediction. This paper intends to provide an optimal solution for software defect prediction using evolutionary computing based neural network for efficient fault classification.


# III.


# Problem Definition

In software development life cycle the reliability assurance is of great significance and to achieve it, the defect prediction is an inevitable need. The defect prediction can be performed using software metrics data, in which either it is predicted whether the code is defective or not or the magnitude of the probable defect and its severity is examined. In this research work, the predominant questions are whether evolutionary computing schemes, specifically GA can optimize neural network based artificial intelligence (AI) to achieve optimal software defect prediction. An another question that this research paper considers is that whether the conventional Genetic Algorithm can be further enhanced to deal with a scenario where multiple chromosomes are having similar fitness, and how this enhancement would perform classification or fault prediction?. In order to explore the answers of this significant question, in this paper it has been intended to optimize ANN learning and respective optimal weight estimation using GA, which has further being optimized to behave as an Adaptive GA (A-GA) scheme that ensures adaptive GA parameters (Crossover and mutation) estimation. Here, considering requirements of object oriented software metrics, CK metrics [22] have been considered that characterizes overall features of software in terms of varied component features. In this paper, the key software metrics considered are WMC, NOC,DIT,CBO,RFC,LCOM, which can be considered for defect prediction in certain class or data model. Based on the proposed model, the defect can be predicted which can be useful for ensuring quality and reliability of the software product. Given a training data, certain learning model can be developed that can classify the data for its faulty or non-faulty status. The artificial intelligence technique neural network has been used extensively so far for classification utilities, but being conventional these approaches do suffer from local minima and weight update issues. Thus, to enhance the systems, certain global optimization schemes like evolutionary computing can be considered. Since Particle Swarm Optimization suffers due to optimal minima and convergence issues, here we proposed an adaptive GA (A-GA) for ANN weight estimation where the weights are estimated dynamically in each iteration. Here, mean square error has been considered as the fitness value for A-GA. Further, the GA parameters such as crossover probability and mutation probability can be adaptively updated to make the overall system more robust and efficient. The optimization of ANN with A-GA can make it more effective and can be a potential candidate for fault detection in SDLC applications. The


# Global Journal of C omp uter S cience and T echnology

Volume XV Issue I Version I Year ( ) D performance evaluation for these two approaches can be done in terms of accuracy, precision, recall, specificity etc.


# IV.


# Proposed System

This section discusses the proposed evolutionary computing based hybrid neural network (HENN) for software defect prediction. HENN: Evolutionary Computing Based Neural Network for Software Defect Prediction Neural networks (NN) have seen an explosion of interest over the years, and are being successfully applied across a range of problem domains. Indeed, anywhere dealing with the problem of classification and prediction, neural networks are being used. For software defect prediction, ANN can be employed with learning approaches such as Gradient Descent (GD), Gauss Newton, and Levenberg Marquardt (LM) etc. Unlike conventional approach, in this paper, we have proposed an evolutionary computing technique called Adaptive Genetic Algorithm for ANN learning optimization and weight estimation, which has been further employed for fault prediction. Here, we intend to find relation between object oriented software metrics and fault prone classes and six CK metrics; WMC, NOC, DIT, RFC, CBO, LCOM have been taken as independent variable while fault data has been considered as dependent data. To design ANN, six inputs have been considered which do receive CK metrics individually as input having multiple classes, as per benchmark data (here PROMISE data). In this paper we have considered 8 hidden layers. Since, in the proposed SDP model, only FAULTY and NON-FAULTY are the results expected for prediction, therefore only one output node. The overall design of the proposed ANN model can be presented as follows: The above mentioned figure illustrates the architecture of ANN containing three layers i.e., input layer, hidden layer and output layer. In the considered ANN model, the linear activation function has been used for input layer i.e., the output of the output layer is treated as input of the input layer(?? ?? = ?? ?? ). Further, the sigmoid function has been employed for hidden layer?? ? . Hence, the result of the hidden nodes ?? ? with the fed input of ?? ? is estimated mathematically as ?? ? = 1 1+?? ??? ? and final outcome of output nodes O o is presented mathematically by?? ?? = 1 1+?? ??? ?? . In general, ANN is represented by a function ?? ? = ð??"ð??"(??, ??) where ??? represents the output vector and?? and ?? are the weight vector and the input vector respectively. In process, the weight factor ??is updated in each iterations to reduce Mean Square Error (MSE), which is estimated as follows:
?????? = 1 ?? ? ??? ?? ? ? ?? ?? ? 2 ?? ??=1 (1)
Where ?? represents the actual output while the expected output is given by?? ?? ? . In order to process the datasets using ANN, at first the normalization of data is required. A discussion of the proposed data normalization technique in this paper is given as follows:


# a) Data normalization

In the proposed model, initially the normalization has been performed before data processing that strengthens the system for better readability and defect prediction. Here the data normalization has been done over the range of [0, 1] for adjusting the defined range of input feature value and avoid the saturation of neurons. A number of schemes such as Min-Max normalization, Z-Score normalization and decimal scaling can be employed for the purpose of data normalization. In this paper, Min-Max normalization approach has been used that performs a linear transformation on the original data and maps each of the actual data ?? ?? of attribute ?? to normalized value ??? ?? that exists in the range of [0, 1]. The Min-Max based normalized data has been obtained by the following expression:
????????????????????(?? ?? ) = ?? ?? " = ?? ?? ??????? (??) ?????? (??)??????? (??)(2)
Where ??????(??) and ??????(??) represent the maximum and minimum value of the attribute ?? respectively. Performing data normalization the ANN model has been employed for fault classification and SDP functions.

In ANN based artificial intelligence systems, the efficient weight estimation is of great significance and till existing approaches have explored techniques such as Gauss Newton, Gradient descent, Levenberg Marquardt etc. Unfortunately these approaches couldn't be enhanced by scientific society to make weight estimation effective by means of certain global optimization techniques such as Genetic Algorithm.


# Efficient weight estimation during ANN learning can


# Global Journal of C omp uter S cience and T echnology

Volume XV Issue I Version I Year ( ) D make classification optimal. This requirement motivated us to employ genetic algorithm for dynamic weight estimation during ANN learning. A brief discussion of the proposed Adaptive Genetic Algorithm (A-GA) is given in the following section.


# b) Adaptive Genetic Algorithm(A-GA)

Genetic Algorithm (GA) is an adaptive search method for finding optimal or near optimal solutions, premised on the evolutionary ideas of natural selection. The fundamental concept of GA is emphasized on simulating processes in the natural system required for evolution, distinctively those that consider the Charles Darwin principles representing the terms of the survival of the fittest. Considering procedural flow, GA at first generates the initial population arbitrarily, where population refers a set of solutions. These solutions are nothing else but a chromosome that possesses a form of binary strings where all the comprising parameters are supposed to be encoded. Generating the population, GA estimates the fitness function of individual chromosome. Here the fitness function states toward a user-defined function that returns the evaluation results of each chromosome, thus a higher fitness value means its chromosome is a dominant gene. As per retrieved fitness values, offspring are generated using genetic operators-crossover and mutation. Applying these genetic operators the generations of the population are repeated iteratively until the stopping criteria are satisfied and an optimal solution is achieved. As illustrated in Figure -1, in this paper, the proposed HENN model comprises?? ? ? ? ?? network configuration with ?? input layer, ? hidden layer and ?? output layer or neurons. In the proposed ANN model, all the six considered CK metrics or feature vector are fed as input to the individual input node, where each feature vector metrics accompanies the number of classes available in datasets. Considering Figure-1 and relevant network configuration, there is N weight required to be estimated. Mathematically, the number of weight vectors is:
?? = (?? + ??) * ?(3)
Here, the individual weight, which is considered as gene in the chromosomes of the A-GA, is a real number. Considering the gene length or the number of digits be??. Then the length of the chromosome ?? ????????? can be estimated by the following expression:
?? ????????? = ?? * ?? = (?? + ??) * ? * ??(4)
These all chromosomes are considered as the population of the genetic algorithm. In the proposed model to estimate the fitness value of the individual chromosome, the weights are required to be extracted from the individual chromosome. In our proposed model, the weights (?? ?? ) are estimated by the following expression: 
?? ?? = ? ? ? ? ? ??ð??"ð??" 0 ? ?? ???? +1 < 5 ? ?? ???? +2 * 10
In order to process the Adaptive Genetic Algorithm (A-GA), the fitness values for each chromosome are required to be estimated. The fitness generation algorithm for the proposed A-GA system is given in Figure -2 
?? ?? = 1 ?? ?? = 1 ? ? ?? ?? ?? =?? ?? =1
?? Figure 2 : Algorithm for Fitness generation using A-GA This is the matter of fact that the evolutionary computing scheme named Genetic Algorithm has established itself as a potential optimization technique for various application scenarios, still this approach possess scopes for further optimization that specifically depends on the working environment. In this paper, there might be the possibility that after every generation to achieve optimal fitness, certain new population would be generated and thus the processing data might be increased after each iterations, thus resulting into certain


# Global Journal of C omp uter S cience and T echnology

Volume XV Issue I Version I Year ( ) D restraints such as premature convergence caused due to local optima and low convergence speed, which is common in other evolutionary techniques such as Particle Swarm Optimization. In order to alleviate these issues, the parameters like cross over probability (?? ?? ) and mutation probability (?? ?? ) can be made dynamic and weight adaptive. In addition, such novelty can deal with a common scenario, where there is the possibility of multiple chromosomes having similar fitness value, causing degraded classification accuracy. Taking into consideration of these all factors and motivations, in this paper a weight adaptive genetic algorithm (A-GA) has been developed where the genetic parameters (Crossover and mutation) are updated dynamically. In the proposed approach the parameters ?? ?? and ?? ?? have been dynamically updated by means of the following mathematical model:
(?? ?? ) ??+1 = (?? ?? ) ?? ? ?? 1 * ?? 5 (6) (?? ?? ) ??+1 = (?? ?? ) ?? ? ?? 2 * ??5
Where (?? ?? ) ??+1 and (?? ?? ) ??+1 represent the updated probability of cross over and mutation, (?? ?? ) ?? and (?? ?? ) ?? are the current probability of cross over and mutation, ?? 1 and ?? 2 can be the positive constant and?? is the number of chromosome having same fitness value. Thus, implementing these discussed approaches, if the final output estimated is greater than 0.5, then the class is labeled as FAULTY otherwise NON-FAULTY. Figure -3 represents the overall process of software defect prediction using Adaptive Genetic Algorithm (A-GA).  Weight Estimation: Obtained the weight vector W_kfor each chromosome as the input to hiddenlayer and hidden layer to output layer and thus the weight of input to hidden node and hidden node to output are estimated using equation 5. Fitness Estimation: On the basis of weights retrieved, the fitness value is estimated for each chromosome, where the proposed HENN intends to minimize the mean square error as defined in Figure 2. Ranking of Chromosomes: Perform the ranking of each chromosomes based on respective fitness value and substitute the chromosomes with minimum fitness value by the chromosomes with highest fitness value chromosome.

Perform two point crossover processdynamically vary the GA parameters P c and P m till reaching optimal criteria using equation (6).

In the simulation model, the initial P c and P m are 0.6 and 0.1 respectively and n signifies the number of chromosome having similar fitness value. Stopping Criteria: The developed system terminates once the 95% chromosomes in the gene pool accomplishes its unique fitness value and beyond this the fitness level of chromosomes gets saturated. Classify Faults: If the final weight is greater than 0.5, then the class is labeled as FAULTY otherwise NON-FAULTY. Confusion Matrix Generate the confusion matrix for each classes of OO-SM and classify fault/non-fault distribution for performance evaluation.

Thus, employing the proposed HENN model, the fault classification and prediction has been done. The simulation, results and discussion is provided in the following section.

V.


# Result and Analysis

This section discusses the research variables, simulation setups, results obtained and respective performance analysis.


# a) Data collection

In this paper, the CK metric suites have been employed which have been defined for varied objectives such as software fault detection/prediction, effort evaluation, re-usability and maintenance. Considering the robustness of CK metric suite [27], it has been used as object oriented software metrics which has been processed using Chidamber and Kemerer Java Metrics tool (CKJM) tool that extracts software metrics by executing byte code of compiled Java cases and assigns a definite weight of the comprising classes having feature vectors. In this paper, PROMISE fault benchmark data [39] and NASA MDP datasets [40] and PROMISE repository to evaluate the performance of the proposed fault prediction scheme. We intended to establish the relationship between Object-Oriented software metrics (OO-SM) and the fault proneness at the class level. In order to perform defect prediction using regression analysis paradigm, we have considered fault as a dependent variable while the CK metric as the independent variable. The predominant OO-SM metrics are given in Table-1 In our work, we have developed a function to explore the relation between Object-Oriented software metrics (OO-SM) (WMC, NOC, DIT, RFC, CBO and LCOM) and faults existing in class under consideration. The minimization of faults can be of great significance towards optimization of software equality, and to ensure optimal defect prediction, the fault has been derived as the function of software metrics as illustrated as follows: ???????????? = ð??"ð??"(??????, ??????, ??????, ??????, ??????, ????????)

We used four public domain defect datasets from the PROMISE repository [9][39]. The considered data sets are JEdit, IVY, Ant and Camel which contain static code measures along with varied modules sizes, defective modules and defect rates. In our simulation model, the respective extracted weights and features of the data classes are taken as input. The datasets with respective classes or modules are given in Table-2. In this paper, HENN algorithm has been developed for simulation using MATLAB 2012b software tool having artificial intelligence and ANN toolboxes. The proposed models examined defect datasets and the FAULTY and NON FAULTY data have been classified. Here on the basis of FAULT distribution by proposed model, a confusion matrix has been generated that encompasses two rows and columns comprising true negatives, true positive, false negative and false positive variables. The respective values of True negatives (TN) refer the modules which are NON FAULTY or fault-free on the other hand, true positives (TP) represents for those modules which are classified as FAULTY. False negatives (FN) are those modules which are FAULTY and are classified incorrectly as NON FAULTY. Similarly, false positives (FP) modules are those modules which are faultless but are classified incorrectly as FAULTY. A matrix presentation of confusion matrix is given in Table 3. Generally, the meanings of the values of the binary variables are not needed to be defined, however, in our work, especially for performance assessment the variables have been labeled as positive and negative. The positive levels refer towards the results as FAULTY in that specific simulation scenario. In this paper, we have measured the performance of the proposed HENN SDP in terms of correctness, precision, F-measures, accuracy, recall, specification and cost factor analysis. A brief mathematical definition of these variables is given as follows: 91.8 --C4.5 [47] 88.39 --J 48 [47] 90.90 Levenberg-Marquardt-NN [47] 88.0 --NNEP-Evolutionary [43] 88.8 81.2 -PSO [46] 78.78 --PSO-NN [48] 97.75 --HENN SDP * 97.9 * 1 98.9


# *-The best performance of HENN

Thus, the results obtained exhibit that the optimization made by means of Adaptive Genetic Algorithm has enhanced ANN learning for fault detection. The ultimate results obtained for HENN represents the most effective and optimal results as compared to other existing approaches, especially neural network based SDP models. The performance analysis for the proposed systems is given in Table-6.  


# Conclusion

Software defect prediction has become an inevitable need for organizations to ensure quality and reliability of software products. The early defect prediction can facilitate managers to rectify and enrich reliability of product. Approaches such as machine learning and neural network have become eminent solution for training and classification of data and can be significant for defect prediction. However, these approaches need optimization in terms of weight update, parametric enhancement while performing defect prediction. The local minima and convergence issue of ANN can be significantly dealt with employing evolutionary computing schemes and the implementation of genetic algorithm can be the dominant candidate. In this paper, Adaptive Genetic Algorithm (A-GA) has been used for ANN optimization, where A-GA functions for optimal weight estimation. The proposed HENN model has been tested with PROMISE data sets, where the average accuracy for HENN was retrieved as 87.23 % while the best classification performance was observed with JEdit datasets where HENN exhibited 97.99% accuracy while ensuring 100% precision. Performance in terms of F-measure using HENN was obtained as 98.97%. The results also depicted  
1![Figure 1 : ANN model for Defect prediction](image-2.png "Figure 1 :")


10 ???2??ð??"ð??" 5 <= ?? ???? +?? <= 9+?? ???? +2 *  10 ???2 +?? ???? +3 *  10 ???3 +?+?? (??+1)?? 10 ???2
Algorithm for Fitness EstimationInput:?? ? ?? = (?? 1?? , ?? ?? = ? ? ? ? ? ? ? ? ?? ???? +2 Phase-4: Estimate Root mean square error (RMSE) of ??ð??"ð??" 0 ? ?? ???? +1 < 5chromosome?? ???? ?? = ?? ?? =?? ?? =1 ???? ??Where ?? is the total number of training data setPhase-5: Estimate the fitnessvalue for chromosome?? ??* 10 ???2 + ?? ???? +3 * 10 ???3 + ? + ?? (??+1)?? 10 ???2 ??ð??"ð??" 5 <= ?? ???? +?? <= 9 + ?? ???? +2 * 10 ???2 + ?? ???? +3 * 10 ???3 + ? + ?? (??+1)?? 10 ???2
1[22])WMCOverall complexities of the methods incomprising classesNOCNumber of sub-classes subordinate to a classin the class hierarchyDITMaximum height of the class hierarchyCBONumber of other classes to which it is alliedwithRFCA set of approaches that can be executed inresponse to a message received by an objectof that classLCOMDissimilarity measurement of varied methods ina class using instanced attributes/variablesNOMNumber of methods (in a class)NOANumber of attribute (in a class)NOAINumber of attributes inherited by subclasses.NOMINumber of methods inherited by subclasses.Fan-inTotal number of local flows in certain processand data structures from where it retrievesinformationFan-outTotal number of local flows in certain processand data structures from where it retrievesinformationNOPMTotal number of private methods in a classNOPATotal number of private attribute in a classNOPMTotal number of public methods in a classNOPATotal number of public attribute in a classNLOCSize of program by counting the number oflines in the source code.
2PROMISEJEditIVYAntCamelNumber of492352744965modules
3PredictedPredicted DefectDefectiveFreeFAULTYTrue PositiveFalse NegativeNON-FAULTY False PositiveTrue Negative
4ConstructMathematicalDescriptionExpressionRecallTP/(TP+FN)Proportionofdefectiveunitscorrectly classifiedPrecisionTP/(TP+FP)Proportion of Unitscorrectly predictedas defectiveSpecification TN/(TN+FP)Proportionofcorrectly classifiednon defective units
62015YearVolume XV Issue I Version I( ) DGlobal Journal of C omp uter S cience and T echnologyTechniqueDataModules Accuracy PrecisionF-MeasureRecallSpecificationHENNJEdit4920.979910.989710.9756HENNIVY3520.88350.99360.93800.88830.3333HENNAnt7440.81450.93430.88670.84380.6346HENNCamel9650.811410.89520.81021
5

			© 2015 Global Journals Inc. (US) 1
			© 2015 Global Journals Inc. (US)
			Adaptive Genetic Algorithm Based Artificial Neural Network for Software Defect Prediction
		
		
* 
	
		A Framework of Software Measurement
		
			HZuse
		
		
			1998
			Walter de Grutger Publish
		
	
* 
	
		
			LRosenberg
		
		
			SBSheppard
		
		Metrics in Software Process Assessment, Quality Assurance and Risk Assessment
				London
		
			October, 1994
		
	
	2nd International Symposium on Software Metrics


* 
	
		
			BWBoehm
		
	
		Software Engineering Economics
		
			1981
			Prentice-Hall
		
	
* 
	
		Assessing the Applicability of Fault-Proneness Models Across Object-Oriented Software Projects
		
			LCBriand
		
		
			WLMelo
		
		
			JWu
		
		
			St
		
	
		IEEE Trans. Software Eng
		
			28
			7
			
			July 2002
		
	
* 
	
		A Machine Learning Based Model for Software Defect Prediction
		
			OKutlubay
		
		
			ABener
		
		
			2005
		
		
			Boaziçi University, Computer Engineering Department
		
	
	working paer


* 
	
		A study on software reliability prediction based on support vector machines
		
			Bo
		
		
			XiangYang
		
		
			Li
		
	
		The Annual IEEE International Conference on Industrial Engineering and Engineering Management
				
			2-4 Dec. 2007
			
		
* 
	
		Intelligence System for Software Maintenance Severity Prediction
		
			ParvinderSandhu
		
		
			SunilSingh
		
		
			HardeepKumar
		
		
			Singh
		
	
		Journal of Computer Science
		
			3
			5
			
			2007
		
	
* 
	
		Applying machine learning to software fault-proneness prediction
		
			Gondra
		
	
		Journal of Systems and Software
		
			81
			2
			
			Feb. 2008
		
	
* 
	
		Early software reliability prediction with extended ANN model
		
			QHu
		
		
			YSDai
		
		
			MXie
		
		
			SHNg
		
	
		Proceedings of the 30th Annual International Computer Software and Applications Conference (COMPSAC'06)
				the 30th Annual International Computer Software and Applications Conference (COMPSAC'06)
		
			September 2006
			2
			
		
* 
	
		Object-Oriented software engineering: Measuring and controlling the development process
		
			FB EAbreu
		
		
			RCarapuca
		
	
		Proceedings of the 4th International Conference on Software Quality
				the 4th International Conference on Software Quality
		
			1994
			186
		
	
* 
	
		A hierarchical model for Object-Oriented design quality assessment
		
			JBansiya
		
		
			CGDavis
		
	
		ACM Transactions on Programming Languages and Systems
		
			128
			
			August 2002
		
	
* 
	
		Cohesion and reuse in an Object-Oriented system
		
			BKKang
		
		
			JMBieman
		
	
		Proceedings of the ACM SIGSOFT Symposium on software reusability
				the ACM SIGSOFT Symposium on software reusabilitySeattle
		
			March 1995
			
		
* 
	
		Exploring the relationships between design measures and software quality in Object-Oriented systems
		
			LCBriand
		
		
			JWust
		
		
			JWDaly
		
		
			DVPorter
		
	
		The Journal of Systems and Software
		
			51
			
			May 2000
		
	
* 
	
		Design and code complexity metrics for Object-Oriented classes
		
			LEtzkorn
		
		
			JBansiya
		
		
			CDavis
		
	
		Object-Oriented Programming
		
			12
			10
			
			1999
		
	
* 
	
		
			MHalstead
		
		Elements of Software Sciencel
				New York, USA
		
			Elsevier Science
			1977
		
	
* 
	
		
			BHenderson-Sellers
		
		
			SoftwareMetrics
		
		
			1996
			Prentice-Hall
			UK
		
	
* 
	
		Maintenance metrics for the Object-Oriented paradigm
		
			WLi
		
		
			SHenry
		
	
		Proceedings of First International Software Metrics Symposium
				First International Software Metrics Symposium
		
			1993
			
		
* 
	
		A complexity measure
		
			TJMccabe
		
	
		IEEE Transactions on Software Engineering
		
			2
			
			December 1976
		
	
* 
	
		A software complexity model of Object-Oriented systems
		
			DPTegarden
		
		
			SDSheetz
		
		
			DEMonarchi
		
	
		Decision Support Systems
		
			13
			3
			
			1995
		
	
* 
	
		Object-Oriented Software Metrics
		
			MLorenz
		
		
			JKidd
		
		
			1994
			Prentice-Hall
			NJ, Englewood
		
	
* 
	
		A metrics suite for Object-Oriented design
		
			SRChidamber
		
		
			CFKemerer
		
	
		IEEE Transactions on Software Engineering
		
			20
			
			June 1994
		
	
* 
	
		Why the Virtual Nature of Software makes it Ideal for Search Based Optimization
		
			MHarman
		
	
		Fundamental Approaches to Software Engineering
				
			2010
		
	
* 
	
		
			CGrosan
		
		
			AAbraham
		
	
		Hybrid Evolutionary Algorithms: Methodologies, Architectures, and Reviews
				
			2011
			75
			
		
* 
	
		Issues in Validating Object-Oriented Metrics for Early Risk Prediction
		
			SaidaBenlarbi
		
		
			KhaledElEmam
		
		
			NishithGeol
		
	
		Cistel Technology
		
			210
			1999
		
	
* 
	
		
		Colonnade Road Suite
		
			204
		
	
* 
	
		Comparing Models for Identifying Fault-Prone Software Components
		
			FLanubile
		
		
			ALonigro
		
		
			GVisaggio
		
	
		Proceedings of Seventh International Conference on Software Engineering and Knowledge Engineering
				Seventh International Conference on Software Engineering and Knowledge Engineering
		
			June 1995
			
		
* 
	
		A Critique of Software Defect Prediction Models
		
			NEFenton
		
		
			MNeil
		
		
			IBellini
		
		
			PBruno
		
		
			DNesi
		
		
			Rogai
		
	
		IEEE Trans. Softw. Engineering
		
			25
			5
			
			1999
		
		
			University of Florence
		
	
* 
	
		Estimating Software Fault-Proneness for Tuning Testing Activities
		
			GiovanniDenaro
		
	
		Proceedings of the 22nd International Conference on Software Engineering
				the 22nd International Conference on Software EngineeringLimerick, Ireland
		
			June 2000
		
	
* 
	
		Prediction Model and the Size Factor for Fault-proneness of Object Oriented Systems
		
			ManasiDeodhar
		
		
			Dec. 2002
		
		
			Michigan Tech. University
		
	
	MS Thesis


* 
	
		Comparing Fault-Proneness Estimation Models
		
			PBellini
		
	
		10th IEEE International Conference on Engineering of Complex Computer Systems (ICECCS'05)
				
			2005
			
		
* 
	
		An Application of Zero-Inflated Poisson Regression for Software Fault Prediction. Software Reliability Engineering
		
			TMKhoshgoftaar
		
		
			KGao
		
		
			RMSzabo
		
	
		ISSRE 2001. Proceedings of 12th International Symposium
				
			27-30 Nov. 2001
			
		
* 
	
		Software defect prediction using supervised learning algorithm and unsupervised learning algorithm
		
			AChug
		
		
			SDhall
		
	
		Confluence 2013: The Next Generation Information Technology Summit
				
			Sept. 2013
			
		
* 
	
		A novel method for software defect prediction: Hybrid of FCM and random forest
		
			TPPushphavathi
		
		
			VSuma
		
		
			VRamaswamy
		
	
		Electronics and Communication Systems (ICECS)
				
			2014
		
	
* 
	
		International Conference
				
			5
			
		
* 
	
		Using Class Imbalance Learning for Software Defect Prediction
		
			SWang
		
		
			XYao
		
	
		Reliability, IEEE Transactions
				
			June 2013
			62
			
		
* 
	
		Finding Latent Code Errors via Machine Learning over Program Executions
		
			YBrun
		
		
			DEMichael
		
	
		Proceedings of the 26th International Conference on Software Engineering
				the 26th International Conference on Software Engineering
		
			May, 2004
		
	
* 
	
		A novel method for early software quality prediction based on support vector machine
		
			FXing
		
		
			PGuo
		
		
			MRLyu
		
	
		Software Reliability Engineering, International Symposium
				
			2005
			
		
* 
	
		
			KCai
		
	
		0n the Neura1 Network Approach in Software Reliability Modeling
				
			2001
			
		
* 
	
		Adapting multiple kernel parameters for support vector machines using genetic algorithms
		
			SARojas
		
		
			DFernandez-Reyes
		
	
		The 2005 IEEE Congress on Evolutionary Computation
				
			September, 2005
			1
			
		
* 
	
		
			ChunShan
		
		
			BoyangChen
		
		
			ChangzhenHu
		
		
			JingfengXue1
		
		
			NingLi
		
		SOFTWARE DEFECT PREDICTION MODEL BASED ON LLE AND SVM" Communications Security Conference; pp 1-5
				
			May 2014
			
		
* 
	
		A new metrics selection method for software defect prediction
		
			YeXia
		
		
			GuoyingYan
		
		
			XingweiJiang
		
		
			YanyanYang
		
	
		Progress in Informatics and Computing (PIC), International Conference
				
			May 2014
			
		
* 
	
		On the applicability of evolutionary computation for software defect prediction
		
			RMalhotra
		
		
			NPritam
		
		
			YSingh
		
	
		Advances in Computing, Communications and Informatics (ICACCI, 2014 International Conference
				
			Sept. 2014
			
		
* 
	
		Software defect prediction using supervised learning algorithm and unsupervised learning algorithm
		
			AChug
		
		
			SDhall
		
	
		Confluence 2013: The Next Generation Information Technology Summit
				
			Sept. 2013
			
		
* 
	
		Multilevel data preprocessing for software defect prediction
		
			GKArmah
		
		
			GuangchunLuo
		
		
			KeQin
		
	
		6th International Conference
				
			2013. Nov. 2013
			2
			
		
	ICIII)


* 
	
		Software defect prediction using two level data pre-processing
		
			RVerma
		
		
			AGupta
		
	
		Recent Advances in Computing and Software Systems (RACSS), International Conference
				
			April 2012
			
		
* 
	
		Software Defect Prediction Tool based on Neural Network
		
			MalkitSingh
		
		
			DalwinderSingh Salaria
		
	
		International Journal of Computer Applications
		
			70
			
			May 2013
		
	
* 
	
		A Hybrid Model of Soft Computing Technique for Software
		
			AnuragShrivastava
		
		
			VishalShrivastava