# I. I ntroduction s per high pace rise in software applications and major dependency on it, the fault prediction has become one of the inevitable parts of software development life cycle (SDLC) that can play significant role in reducing the probability of software failure. Software defect prediction (SDP) can be performed while planning to identify fault-prone modules in software product that as a result can provide the insight to the need for increased quality of monitoring during software development. In addition, it can also facilitate necessary approaches to incorporate certain proper fault verification schemes leading to enhanced software quality [1,2,3,4] and reliability. SDP can be functional based on certain software metrics [3,4,5], such as source code changes, previous defects, etc. In fact software metrics are the quantitative data that are employed for characterizing the properties of source code and can be significant for predicting software quality. The efforts made through many generations have facilitated a number of schemes to mitigate defects, but the continuation of researches still indicates towards search for certain optimal SDP solution to ensure optimal performance, reliability, cost optimization and minimal maintenance. A number of efforts have been made for SDP using machine learning and neural network [6,7,8,9,10], clustering techniques, statistical method, mining and random forest [44,45,50] etc. In recent years, majority of software are being developed based on Object-Oriented (OO) paradigm. Thus, the quality of the software can be optimally assessed by employing software metrics, such as Abreu MOOD metric suite [11], QMOOD metrics suite [12], Bieman and Kang [13], Briand et al. [14], Etzkorn et al. [15], Halstead [16], Henderson-sellers [17], Li and Henry [18], McCabe [19], Tegarden et al. [20], Lorenz and Kidd [21] and CK metric [22] suite. These software metrics plays significant role in assessing the quality of software such as precision, accuracy, fault-resilience and sensitivity etc. The significance of these object oriented software metrics lies in their capability to predict the software quality in terms of adaptability, functionality, usability, portability, supportability, reliability and cost effectiveness. Predominantly two data driven algorithms, support vector machine (SVM) and artificial neural network (ANN) algorithms have been employed for fault detection. ANN approach functions on the basis of the human brain behaviorand possesses neurons and directed edges with certain weights existing between input and output layers. ANN employs output as the input so as to learn complex non-linear input-output relationship and can be stated to be a complex nonlinear mapping model between input and output layer. The processes in ANN comprise data sets to enhance the weight parameters, risk minimization scheme for stopping training as soon as the learning error enters in expected margin level. In fact, ANN has been employed in numerous utilities, but still it possesses certain limitations in terms of slow learning ability, local minima etc and hence require further optimization to achieve certain optimal SDP efficiency and performance. Thus, there is the requirement of further optimization of ANN approaches to accomplish a potential SDP solution. Some researches [23,24] advocate the implementation of evolutionary computing techniques for SDP optimization. This paper proposes a novel evolutionary computing based enhanced ANN algorithmnamed Hybrid Evolutionary Computation # Year ( ) # D based Neural Network (HENN) for defect prediction and classification. HENN system employs Adaptive Genetic Algorithm (A-GA) for optimal weight estimation so as to enhance weight update and learning efficiency of the ANN.In this paper, the object oriented software metrics, CK metrics [22] have been employed as a fault classification data and the respective performance has been analyzed using confusion matrix. The remaining sections discusses, related work in Section II, problem definition is briefed in Section III, which has been followed by proposed research discussion in Section IV. Section V presents the results and analysis and conclusion has been discussed in Section VI. # II. # Related Work The emergence of software applications and associated need of quality and reliability has motivated software practitioners as well as academia to develop certain novel scheme for defect prediction.With an objective to examine the relation between software metrics and associated faults some initiatives were made in [25,26,27,28,29,30] where machine learning mechanism were used for fault detection. With an enthuse to compare the performance of varied other schemes such as decision trees, naïve Bayes, and 1rule [31] performed fault detection using NASA MDP project. Chug et al [32] performed data mining based fault estimation using conventional J48, Random Forest, and Naive Bayesian Classifier (NBC) schemes but still couldn't employ the benefits of advanced classification schemes. With an objective to enhance conventional schemes Pushphavathi et al [33] introduced hybrid scheme of random forest (RF) and Fuzzy C Means (FCM) clustering. Then while, these systems were found limited for unbalanced data sets, which motivated author [34] to propose an approach called AdaBoost.NC that explored varied kinds of class imbalance learning schemes comprising resampling techniques, threshold moving, and ensemble algorithms. With an objective to explore SVM optimization in [35,36] a dynamic SVM model was proposed for fault detection in source code using with error data and faulty code execution. Researcher in [37] developed an ANN based SDP system. This is the matter of fact that SVM refers the functional paradigm of single layer perceptron's NN which on addition with kernels behaves like multilayered perceptron's [38]. Till available systems based neural network with conventional learning and weight estimation suffers from local optima and convergence issue, which has not been discussed dominantly. On contrary, these days the software are developed and examined for faults using object oriented software metrics which even being significant has not been explored in depth to ensure optimal solution for reliability oriented defect prediction. This paper intends to provide an optimal solution for software defect prediction using evolutionary computing based neural network for efficient fault classification. # III. # Problem Definition In software development life cycle the reliability assurance is of great significance and to achieve it, the defect prediction is an inevitable need. The defect prediction can be performed using software metrics data, in which either it is predicted whether the code is defective or not or the magnitude of the probable defect and its severity is examined. In this research work, the predominant questions are whether evolutionary computing schemes, specifically GA can optimize neural network based artificial intelligence (AI) to achieve optimal software defect prediction. An another question that this research paper considers is that whether the conventional Genetic Algorithm can be further enhanced to deal with a scenario where multiple chromosomes are having similar fitness, and how this enhancement would perform classification or fault prediction?. In order to explore the answers of this significant question, in this paper it has been intended to optimize ANN learning and respective optimal weight estimation using GA, which has further being optimized to behave as an Adaptive GA (A-GA) scheme that ensures adaptive GA parameters (Crossover and mutation) estimation. Here, considering requirements of object oriented software metrics, CK metrics [22] have been considered that characterizes overall features of software in terms of varied component features. In this paper, the key software metrics considered are WMC, NOC,DIT,CBO,RFC,LCOM, which can be considered for defect prediction in certain class or data model. Based on the proposed model, the defect can be predicted which can be useful for ensuring quality and reliability of the software product. Given a training data, certain learning model can be developed that can classify the data for its faulty or non-faulty status. The artificial intelligence technique neural network has been used extensively so far for classification utilities, but being conventional these approaches do suffer from local minima and weight update issues. Thus, to enhance the systems, certain global optimization schemes like evolutionary computing can be considered. Since Particle Swarm Optimization suffers due to optimal minima and convergence issues, here we proposed an adaptive GA (A-GA) for ANN weight estimation where the weights are estimated dynamically in each iteration. Here, mean square error has been considered as the fitness value for A-GA. Further, the GA parameters such as crossover probability and mutation probability can be adaptively updated to make the overall system more robust and efficient. The optimization of ANN with A-GA can make it more effective and can be a potential candidate for fault detection in SDLC applications. The # Global Journal of C omp uter S cience and T echnology Volume XV Issue I Version I Year ( ) D performance evaluation for these two approaches can be done in terms of accuracy, precision, recall, specificity etc. # IV. # Proposed System This section discusses the proposed evolutionary computing based hybrid neural network (HENN) for software defect prediction. HENN: Evolutionary Computing Based Neural Network for Software Defect Prediction Neural networks (NN) have seen an explosion of interest over the years, and are being successfully applied across a range of problem domains. Indeed, anywhere dealing with the problem of classification and prediction, neural networks are being used. For software defect prediction, ANN can be employed with learning approaches such as Gradient Descent (GD), Gauss Newton, and Levenberg Marquardt (LM) etc. Unlike conventional approach, in this paper, we have proposed an evolutionary computing technique called Adaptive Genetic Algorithm for ANN learning optimization and weight estimation, which has been further employed for fault prediction. Here, we intend to find relation between object oriented software metrics and fault prone classes and six CK metrics; WMC, NOC, DIT, RFC, CBO, LCOM have been taken as independent variable while fault data has been considered as dependent data. To design ANN, six inputs have been considered which do receive CK metrics individually as input having multiple classes, as per benchmark data (here PROMISE data). In this paper we have considered 8 hidden layers. Since, in the proposed SDP model, only FAULTY and NON-FAULTY are the results expected for prediction, therefore only one output node. The overall design of the proposed ANN model can be presented as follows: The above mentioned figure illustrates the architecture of ANN containing three layers i.e., input layer, hidden layer and output layer. In the considered ANN model, the linear activation function has been used for input layer i.e., the output of the output layer is treated as input of the input layer(?? ?? = ?? ?? ). Further, the sigmoid function has been employed for hidden layer?? ? . Hence, the result of the hidden nodes ?? ? with the fed input of ?? ? is estimated mathematically as ?? ? = 1 1+?? ??? ? and final outcome of output nodes O o is presented mathematically by?? ?? = 1 1+?? ??? ?? . In general, ANN is represented by a function ?? ? = ð??"ð??"(??, ??) where ??? represents the output vector and?? and ?? are the weight vector and the input vector respectively. In process, the weight factor ??is updated in each iterations to reduce Mean Square Error (MSE), which is estimated as follows: ?????? = 1 ?? ? ??? ?? ? ? ?? ?? ? 2 ?? ??=1 (1) Where ?? represents the actual output while the expected output is given by?? ?? ? . In order to process the datasets using ANN, at first the normalization of data is required. A discussion of the proposed data normalization technique in this paper is given as follows: # a) Data normalization In the proposed model, initially the normalization has been performed before data processing that strengthens the system for better readability and defect prediction. Here the data normalization has been done over the range of [0, 1] for adjusting the defined range of input feature value and avoid the saturation of neurons. A number of schemes such as Min-Max normalization, Z-Score normalization and decimal scaling can be employed for the purpose of data normalization. In this paper, Min-Max normalization approach has been used that performs a linear transformation on the original data and maps each of the actual data ?? ?? of attribute ?? to normalized value ??? ?? that exists in the range of [0, 1]. The Min-Max based normalized data has been obtained by the following expression: ????????????????????(?? ?? ) = ?? ?? " = ?? ?? ??????? (??) ?????? (??)??????? (??)(2) Where ??????(??) and ??????(??) represent the maximum and minimum value of the attribute ?? respectively. Performing data normalization the ANN model has been employed for fault classification and SDP functions. In ANN based artificial intelligence systems, the efficient weight estimation is of great significance and till existing approaches have explored techniques such as Gauss Newton, Gradient descent, Levenberg Marquardt etc. Unfortunately these approaches couldn't be enhanced by scientific society to make weight estimation effective by means of certain global optimization techniques such as Genetic Algorithm. # Efficient weight estimation during ANN learning can # Global Journal of C omp uter S cience and T echnology Volume XV Issue I Version I Year ( ) D make classification optimal. This requirement motivated us to employ genetic algorithm for dynamic weight estimation during ANN learning. A brief discussion of the proposed Adaptive Genetic Algorithm (A-GA) is given in the following section. # b) Adaptive Genetic Algorithm(A-GA) Genetic Algorithm (GA) is an adaptive search method for finding optimal or near optimal solutions, premised on the evolutionary ideas of natural selection. The fundamental concept of GA is emphasized on simulating processes in the natural system required for evolution, distinctively those that consider the Charles Darwin principles representing the terms of the survival of the fittest. Considering procedural flow, GA at first generates the initial population arbitrarily, where population refers a set of solutions. These solutions are nothing else but a chromosome that possesses a form of binary strings where all the comprising parameters are supposed to be encoded. Generating the population, GA estimates the fitness function of individual chromosome. Here the fitness function states toward a user-defined function that returns the evaluation results of each chromosome, thus a higher fitness value means its chromosome is a dominant gene. As per retrieved fitness values, offspring are generated using genetic operators-crossover and mutation. Applying these genetic operators the generations of the population are repeated iteratively until the stopping criteria are satisfied and an optimal solution is achieved. As illustrated in Figure -1, in this paper, the proposed HENN model comprises?? ? ? ? ?? network configuration with ?? input layer, ? hidden layer and ?? output layer or neurons. In the proposed ANN model, all the six considered CK metrics or feature vector are fed as input to the individual input node, where each feature vector metrics accompanies the number of classes available in datasets. Considering Figure-1 and relevant network configuration, there is N weight required to be estimated. Mathematically, the number of weight vectors is: ?? = (?? + ??) * ?(3) Here, the individual weight, which is considered as gene in the chromosomes of the A-GA, is a real number. Considering the gene length or the number of digits be??. Then the length of the chromosome ?? ????????? can be estimated by the following expression: ?? ????????? = ?? * ?? = (?? + ??) * ? * ??(4) These all chromosomes are considered as the population of the genetic algorithm. In the proposed model to estimate the fitness value of the individual chromosome, the weights are required to be extracted from the individual chromosome. In our proposed model, the weights (?? ?? ) are estimated by the following expression: ?? ?? = ? ? ? ? ? ??ð??"ð??" 0 ? ?? ???? +1 < 5 ? ?? ???? +2 * 10 In order to process the Adaptive Genetic Algorithm (A-GA), the fitness values for each chromosome are required to be estimated. The fitness generation algorithm for the proposed A-GA system is given in Figure -2 ?? ?? = 1 ?? ?? = 1 ? ? ?? ?? ?? =?? ?? =1 ?? Figure 2 : Algorithm for Fitness generation using A-GA This is the matter of fact that the evolutionary computing scheme named Genetic Algorithm has established itself as a potential optimization technique for various application scenarios, still this approach possess scopes for further optimization that specifically depends on the working environment. In this paper, there might be the possibility that after every generation to achieve optimal fitness, certain new population would be generated and thus the processing data might be increased after each iterations, thus resulting into certain # Global Journal of C omp uter S cience and T echnology Volume XV Issue I Version I Year ( ) D restraints such as premature convergence caused due to local optima and low convergence speed, which is common in other evolutionary techniques such as Particle Swarm Optimization. In order to alleviate these issues, the parameters like cross over probability (?? ?? ) and mutation probability (?? ?? ) can be made dynamic and weight adaptive. In addition, such novelty can deal with a common scenario, where there is the possibility of multiple chromosomes having similar fitness value, causing degraded classification accuracy. Taking into consideration of these all factors and motivations, in this paper a weight adaptive genetic algorithm (A-GA) has been developed where the genetic parameters (Crossover and mutation) are updated dynamically. In the proposed approach the parameters ?? ?? and ?? ?? have been dynamically updated by means of the following mathematical model: (?? ?? ) ??+1 = (?? ?? ) ?? ? ?? 1 * ?? 5 (6) (?? ?? ) ??+1 = (?? ?? ) ?? ? ?? 2 * ??5 Where (?? ?? ) ??+1 and (?? ?? ) ??+1 represent the updated probability of cross over and mutation, (?? ?? ) ?? and (?? ?? ) ?? are the current probability of cross over and mutation, ?? 1 and ?? 2 can be the positive constant and?? is the number of chromosome having same fitness value. Thus, implementing these discussed approaches, if the final output estimated is greater than 0.5, then the class is labeled as FAULTY otherwise NON-FAULTY. Figure -3 represents the overall process of software defect prediction using Adaptive Genetic Algorithm (A-GA). Weight Estimation: Obtained the weight vector W_kfor each chromosome as the input to hiddenlayer and hidden layer to output layer and thus the weight of input to hidden node and hidden node to output are estimated using equation 5. Fitness Estimation: On the basis of weights retrieved, the fitness value is estimated for each chromosome, where the proposed HENN intends to minimize the mean square error as defined in Figure 2. Ranking of Chromosomes: Perform the ranking of each chromosomes based on respective fitness value and substitute the chromosomes with minimum fitness value by the chromosomes with highest fitness value chromosome. Perform two point crossover processdynamically vary the GA parameters P c and P m till reaching optimal criteria using equation (6). In the simulation model, the initial P c and P m are 0.6 and 0.1 respectively and n signifies the number of chromosome having similar fitness value. Stopping Criteria: The developed system terminates once the 95% chromosomes in the gene pool accomplishes its unique fitness value and beyond this the fitness level of chromosomes gets saturated. Classify Faults: If the final weight is greater than 0.5, then the class is labeled as FAULTY otherwise NON-FAULTY. Confusion Matrix Generate the confusion matrix for each classes of OO-SM and classify fault/non-fault distribution for performance evaluation. Thus, employing the proposed HENN model, the fault classification and prediction has been done. The simulation, results and discussion is provided in the following section. V. # Result and Analysis This section discusses the research variables, simulation setups, results obtained and respective performance analysis. # a) Data collection In this paper, the CK metric suites have been employed which have been defined for varied objectives such as software fault detection/prediction, effort evaluation, re-usability and maintenance. Considering the robustness of CK metric suite [27], it has been used as object oriented software metrics which has been processed using Chidamber and Kemerer Java Metrics tool (CKJM) tool that extracts software metrics by executing byte code of compiled Java cases and assigns a definite weight of the comprising classes having feature vectors. In this paper, PROMISE fault benchmark data [39] and NASA MDP datasets [40] and PROMISE repository to evaluate the performance of the proposed fault prediction scheme. We intended to establish the relationship between Object-Oriented software metrics (OO-SM) and the fault proneness at the class level. In order to perform defect prediction using regression analysis paradigm, we have considered fault as a dependent variable while the CK metric as the independent variable. The predominant OO-SM metrics are given in Table-1 In our work, we have developed a function to explore the relation between Object-Oriented software metrics (OO-SM) (WMC, NOC, DIT, RFC, CBO and LCOM) and faults existing in class under consideration. The minimization of faults can be of great significance towards optimization of software equality, and to ensure optimal defect prediction, the fault has been derived as the function of software metrics as illustrated as follows: ???????????? = ð??"ð??"(??????, ??????, ??????, ??????, ??????, ????????) We used four public domain defect datasets from the PROMISE repository [9][39]. The considered data sets are JEdit, IVY, Ant and Camel which contain static code measures along with varied modules sizes, defective modules and defect rates. In our simulation model, the respective extracted weights and features of the data classes are taken as input. The datasets with respective classes or modules are given in Table-2. In this paper, HENN algorithm has been developed for simulation using MATLAB 2012b software tool having artificial intelligence and ANN toolboxes. The proposed models examined defect datasets and the FAULTY and NON FAULTY data have been classified. Here on the basis of FAULT distribution by proposed model, a confusion matrix has been generated that encompasses two rows and columns comprising true negatives, true positive, false negative and false positive variables. The respective values of True negatives (TN) refer the modules which are NON FAULTY or fault-free on the other hand, true positives (TP) represents for those modules which are classified as FAULTY. False negatives (FN) are those modules which are FAULTY and are classified incorrectly as NON FAULTY. Similarly, false positives (FP) modules are those modules which are faultless but are classified incorrectly as FAULTY. A matrix presentation of confusion matrix is given in Table 3. Generally, the meanings of the values of the binary variables are not needed to be defined, however, in our work, especially for performance assessment the variables have been labeled as positive and negative. The positive levels refer towards the results as FAULTY in that specific simulation scenario. In this paper, we have measured the performance of the proposed HENN SDP in terms of correctness, precision, F-measures, accuracy, recall, specification and cost factor analysis. A brief mathematical definition of these variables is given as follows: 91.8 --C4.5 [47] 88.39 --J 48 [47] 90.90 Levenberg-Marquardt-NN [47] 88.0 --NNEP-Evolutionary [43] 88.8 81.2 -PSO [46] 78.78 --PSO-NN [48] 97.75 --HENN SDP * 97.9 * 1 98.9 # *-The best performance of HENN Thus, the results obtained exhibit that the optimization made by means of Adaptive Genetic Algorithm has enhanced ANN learning for fault detection. The ultimate results obtained for HENN represents the most effective and optimal results as compared to other existing approaches, especially neural network based SDP models. The performance analysis for the proposed systems is given in Table-6. # Conclusion Software defect prediction has become an inevitable need for organizations to ensure quality and reliability of software products. The early defect prediction can facilitate managers to rectify and enrich reliability of product. Approaches such as machine learning and neural network have become eminent solution for training and classification of data and can be significant for defect prediction. However, these approaches need optimization in terms of weight update, parametric enhancement while performing defect prediction. The local minima and convergence issue of ANN can be significantly dealt with employing evolutionary computing schemes and the implementation of genetic algorithm can be the dominant candidate. In this paper, Adaptive Genetic Algorithm (A-GA) has been used for ANN optimization, where A-GA functions for optimal weight estimation. The proposed HENN model has been tested with PROMISE data sets, where the average accuracy for HENN was retrieved as 87.23 % while the best classification performance was observed with JEdit datasets where HENN exhibited 97.99% accuracy while ensuring 100% precision. Performance in terms of F-measure using HENN was obtained as 98.97%. The results also depicted 1![Figure 1 : ANN model for Defect prediction](image-2.png "Figure 1 :") 10 ???2??ð??"ð??" 5 <= ?? ???? +?? <= 9+?? ???? +2 * 10 ???2 +?? ???? +3 * 10 ???3 +?+?? (??+1)?? 10 ???2 Algorithm for Fitness EstimationInput:?? ? ?? = (?? 1?? , ?? ?? = ? ? ? ? ? ? ? ? ?? ???? +2 Phase-4: Estimate Root mean square error (RMSE) of ??ð??"ð??" 0 ? ?? ???? +1 < 5chromosome?? ???? ?? = ?? ?? =?? ?? =1 ???? ??Where ?? is the total number of training data setPhase-5: Estimate the fitnessvalue for chromosome?? ??* 10 ???2 + ?? ???? +3 * 10 ???3 + ? + ?? (??+1)?? 10 ???2 ??ð??"ð??" 5 <= ?? ???? +?? <= 9 + ?? ???? +2 * 10 ???2 + ?? ???? +3 * 10 ???3 + ? + ?? (??+1)?? 10 ???2 1[22])WMCOverall complexities of the methods incomprising classesNOCNumber of sub-classes subordinate to a classin the class hierarchyDITMaximum height of the class hierarchyCBONumber of other classes to which it is alliedwithRFCA set of approaches that can be executed inresponse to a message received by an objectof that classLCOMDissimilarity measurement of varied methods ina class using instanced attributes/variablesNOMNumber of methods (in a class)NOANumber of attribute (in a class)NOAINumber of attributes inherited by subclasses.NOMINumber of methods inherited by subclasses.Fan-inTotal number of local flows in certain processand data structures from where it retrievesinformationFan-outTotal number of local flows in certain processand data structures from where it retrievesinformationNOPMTotal number of private methods in a classNOPATotal number of private attribute in a classNOPMTotal number of public methods in a classNOPATotal number of public attribute in a classNLOCSize of program by counting the number oflines in the source code. 2PROMISEJEditIVYAntCamelNumber of492352744965modules 3PredictedPredicted DefectDefectiveFreeFAULTYTrue PositiveFalse NegativeNON-FAULTY False PositiveTrue Negative 4ConstructMathematicalDescriptionExpressionRecallTP/(TP+FN)Proportionofdefectiveunitscorrectly classifiedPrecisionTP/(TP+FP)Proportion of Unitscorrectly predictedas defectiveSpecification TN/(TN+FP)Proportionofcorrectly classifiednon defective units 62015YearVolume XV Issue I Version I( ) DGlobal Journal of C omp uter S cience and T echnologyTechniqueDataModules Accuracy PrecisionF-MeasureRecallSpecificationHENNJEdit4920.979910.989710.9756HENNIVY3520.88350.99360.93800.88830.3333HENNAnt7440.81450.93430.88670.84380.6346HENNCamel9650.811410.89520.81021 5 © 2015 Global Journals Inc. (US) 1 © 2015 Global Journals Inc. (US) Adaptive Genetic Algorithm Based Artificial Neural Network for Software Defect Prediction * A Framework of Software Measurement HZuse 1998 Walter de Grutger Publish * LRosenberg SBSheppard Metrics in Software Process Assessment, Quality Assurance and Risk Assessment London October, 1994 2nd International Symposium on Software Metrics * BWBoehm Software Engineering Economics 1981 Prentice-Hall * Assessing the Applicability of Fault-Proneness Models Across Object-Oriented Software Projects LCBriand WLMelo JWu St IEEE Trans. Software Eng 28 7 July 2002 * A Machine Learning Based Model for Software Defect Prediction OKutlubay ABener 2005 Boaziçi University, Computer Engineering Department working paer * A study on software reliability prediction based on support vector machines Bo XiangYang Li The Annual IEEE International Conference on Industrial Engineering and Engineering Management 2-4 Dec. 2007 * Intelligence System for Software Maintenance Severity Prediction ParvinderSandhu SunilSingh HardeepKumar Singh Journal of Computer Science 3 5 2007 * Applying machine learning to software fault-proneness prediction Gondra Journal of Systems and Software 81 2 Feb. 2008 * Early software reliability prediction with extended ANN model QHu YSDai MXie SHNg Proceedings of the 30th Annual International Computer Software and Applications Conference (COMPSAC'06) the 30th Annual International Computer Software and Applications Conference (COMPSAC'06) September 2006 2 * Object-Oriented software engineering: Measuring and controlling the development process FB EAbreu RCarapuca Proceedings of the 4th International Conference on Software Quality the 4th International Conference on Software Quality 1994 186 * A hierarchical model for Object-Oriented design quality assessment JBansiya CGDavis ACM Transactions on Programming Languages and Systems 128 August 2002 * Cohesion and reuse in an Object-Oriented system BKKang JMBieman Proceedings of the ACM SIGSOFT Symposium on software reusability the ACM SIGSOFT Symposium on software reusabilitySeattle March 1995 * Exploring the relationships between design measures and software quality in Object-Oriented systems LCBriand JWust JWDaly DVPorter The Journal of Systems and Software 51 May 2000 * Design and code complexity metrics for Object-Oriented classes LEtzkorn JBansiya CDavis Object-Oriented Programming 12 10 1999 * MHalstead Elements of Software Sciencel New York, USA Elsevier Science 1977 * BHenderson-Sellers SoftwareMetrics 1996 Prentice-Hall UK * Maintenance metrics for the Object-Oriented paradigm WLi SHenry Proceedings of First International Software Metrics Symposium First International Software Metrics Symposium 1993 * A complexity measure TJMccabe IEEE Transactions on Software Engineering 2 December 1976 * A software complexity model of Object-Oriented systems DPTegarden SDSheetz DEMonarchi Decision Support Systems 13 3 1995 * Object-Oriented Software Metrics MLorenz JKidd 1994 Prentice-Hall NJ, Englewood * A metrics suite for Object-Oriented design SRChidamber CFKemerer IEEE Transactions on Software Engineering 20 June 1994 * Why the Virtual Nature of Software makes it Ideal for Search Based Optimization MHarman Fundamental Approaches to Software Engineering 2010 * CGrosan AAbraham Hybrid Evolutionary Algorithms: Methodologies, Architectures, and Reviews 2011 75 * Issues in Validating Object-Oriented Metrics for Early Risk Prediction SaidaBenlarbi KhaledElEmam NishithGeol Cistel Technology 210 1999 * Colonnade Road Suite 204 * Comparing Models for Identifying Fault-Prone Software Components FLanubile ALonigro GVisaggio Proceedings of Seventh International Conference on Software Engineering and Knowledge Engineering Seventh International Conference on Software Engineering and Knowledge Engineering June 1995 * A Critique of Software Defect Prediction Models NEFenton MNeil IBellini PBruno DNesi Rogai IEEE Trans. Softw. Engineering 25 5 1999 University of Florence * Estimating Software Fault-Proneness for Tuning Testing Activities GiovanniDenaro Proceedings of the 22nd International Conference on Software Engineering the 22nd International Conference on Software EngineeringLimerick, Ireland June 2000 * Prediction Model and the Size Factor for Fault-proneness of Object Oriented Systems ManasiDeodhar Dec. 2002 Michigan Tech. University MS Thesis * Comparing Fault-Proneness Estimation Models PBellini 10th IEEE International Conference on Engineering of Complex Computer Systems (ICECCS'05) 2005 * An Application of Zero-Inflated Poisson Regression for Software Fault Prediction. Software Reliability Engineering TMKhoshgoftaar KGao RMSzabo ISSRE 2001. Proceedings of 12th International Symposium 27-30 Nov. 2001 * Software defect prediction using supervised learning algorithm and unsupervised learning algorithm AChug SDhall Confluence 2013: The Next Generation Information Technology Summit Sept. 2013 * A novel method for software defect prediction: Hybrid of FCM and random forest TPPushphavathi VSuma VRamaswamy Electronics and Communication Systems (ICECS) 2014 * International Conference 5 * Using Class Imbalance Learning for Software Defect Prediction SWang XYao Reliability, IEEE Transactions June 2013 62 * Finding Latent Code Errors via Machine Learning over Program Executions YBrun DEMichael Proceedings of the 26th International Conference on Software Engineering the 26th International Conference on Software Engineering May, 2004 * A novel method for early software quality prediction based on support vector machine FXing PGuo MRLyu Software Reliability Engineering, International Symposium 2005 * KCai 0n the Neura1 Network Approach in Software Reliability Modeling 2001 * Adapting multiple kernel parameters for support vector machines using genetic algorithms SARojas DFernandez-Reyes The 2005 IEEE Congress on Evolutionary Computation September, 2005 1 * ChunShan BoyangChen ChangzhenHu JingfengXue1 NingLi SOFTWARE DEFECT PREDICTION MODEL BASED ON LLE AND SVM" Communications Security Conference; pp 1-5 May 2014 * A new metrics selection method for software defect prediction YeXia GuoyingYan XingweiJiang YanyanYang Progress in Informatics and Computing (PIC), International Conference May 2014 * On the applicability of evolutionary computation for software defect prediction RMalhotra NPritam YSingh Advances in Computing, Communications and Informatics (ICACCI, 2014 International Conference Sept. 2014 * Software defect prediction using supervised learning algorithm and unsupervised learning algorithm AChug SDhall Confluence 2013: The Next Generation Information Technology Summit Sept. 2013 * Multilevel data preprocessing for software defect prediction GKArmah GuangchunLuo KeQin 6th International Conference 2013. Nov. 2013 2 ICIII) * Software defect prediction using two level data pre-processing RVerma AGupta Recent Advances in Computing and Software Systems (RACSS), International Conference April 2012 * Software Defect Prediction Tool based on Neural Network MalkitSingh DalwinderSingh Salaria International Journal of Computer Applications 70 May 2013 * A Hybrid Model of Soft Computing Technique for Software AnuragShrivastava VishalShrivastava