n recent years, handwritten Marathi character recognition has grabbed a lot of attention as Marathi being primary official language in Maharashtra has wide application in areas like passport, railways, postal address reading etc. Handwritten character recognition [4] consist of six main steps.
Handwritten drawn of character ii.
Training on handwritten characters. iii.
Testing on handwritten characters. ? Pixel rating an Image ? Image smoothing ? Features extraction. ? Image segmentation. ? Patterns matching ? Result display as per their % of pattern matched.
Earlier, traditional classifiers such as Nearest Neighbor (NN), Hidden Markov Models (HMM) etc. were adopted for character recognition, however they exhibit certain limitations. Machine learning (ML) algorithms [6] provide a promising alternative in character recognition based on the feature set given to them. Each character image sample can be expressed in terms of some 26
Author ? : M.E (Scholar), Prof.Ram Meghe Institute of Research & Technology, Badnera. E-mail : [email protected] Author ? : Associate Prof. Prof. Ram Meghe Institute of Research & Technology, Bandera. E-mail : [email protected] Computer Science & Information Technology (CS & IT) quantifiable attributes called features. A variety of features can be extracted such as primitives, profiles etc. Multi Layer (ML) algorithm is then trained with this list of measured features, so that it maps these input features onto a class among certain predefined classes [1,2]. Then the classifier can be used to determine the class of unknown samples used for testing.
ii.
Character recognition task has been attempted through many different approaches like template matching, statistical techniques like NN, HMM, Quadratic Discriminant function (QDF) etc. Template matching works effectively for recognition of standard fonts, but gives poor performance with handwritten characters and when the size of dataset grows. It is not an effective technique if there is font discrepancy [4]. HMM models achieved great success in the field of speech recognition in past decades, however developing a 2-D HMM model for character recognition is found difficult and complex [5]. NN is found very computationally expensive in recognition purpose [6]. N. Araki et al. [7] applied Bayesian filters based on Bayes Theorem for handwritten character recognition. Later, discriminative classifiers such as Artificial Neural Network (ANN) and Support Vector Machine (SVM) grabbed a lot of attention. In [3] G. Vamvakas et al. compared the performance of three classifiers: Naive Bayes, K-NN and SVM and attained best performance with SVM. However SVM suffers from limitation of selection of kernel. ANNs can adapt to changes in the data and learn the characteristics of input signal [8].Also, ANNs consume less storage and computation than SVMs [9]. Mostly used classifiers based on ANN are MLP and RBFN. B.K. Verma [10] presented a system for HCR using MLP and RBFN networks in the task of handwritten Hindi character recognition. The error back propagation algorithm was used to train the MLP networks. J. Sutha et al. in [11] showed the effectiveness of MLP for Tamil HCR using the Fourier descriptor features. R. Gheroie et al. in [12] proposed handwritten Farsi character recognition using MLP trained with error back propagation algorithm. Computer Science & Information Technology (CS & IT) 27 similar shaped characters are difficult to differentiate because of very minor variations in their structures. In [13] T. Wakabayashi et al. proposed an F-Ratio (Fisher Ratio) based feature extraction method to improve results of similar shaped characters. They considered pairs of similar shaped characters of different scripts like English, Arabic/Persian, Devnagri, etc. and used QDF for recognition purpose. QDF suffers from limitation of minimum required size of dataset. F. Yang et al. in [14] proposed a method that combines both structural and statistical features of characters for similar handwritten Chinese character recognition. As it can be seen that various feature extraction methods and classifiers have been used for character recognition by researchers that are suitable for their work, we propose a novel feature set that is expected to perform well for this application. In this paper, the features are extracted on the basis of character geometry, which are then fed to each of the selected ML algorithms for recognition of SSHMC.
iii.
In this paper, we proposed a novel method based on combinations on pixel rating an image, feature extractions and Image pattern matching. Proposed method gives considerable expected outputs than previous proposed character recognition algorithms like HMM, NN, ML etc. Proposed method consists of following phases. ----------------- ----------Eq.3. From the study of Literature survey and proposed method, we conclude that, proposed method gives considerable and expected accuracy than previous character recognitions techniques like HMM, ML, NBP etc. Experiment results shows that, proposed method achieved an accuracy nearer to 98%provided no. of training samples per standard Marathi images should be maximum as possible as.
In the process of recognizing handwritten character, human brains may fails that's why to keep an expectations to achieve 100% accuracy is not expectable. A future work is needed to correctly analyze segments patterns and fuzzy rules mentioned in an equation 3.5.1 to achieve better accuracy which should be independent of no. of training set images.
Input handwritten image | Input binary image |
Input binary image | Pixel Rate Image |
D D D D ) |
( |
Handwritten Hindi Character recognition Using Multilayer Perceptron and Radial Basis Function Neural Networks. Proceedings of IEEE International conference on Neural Networks, (IEEE International conference on Neural Networks) 1995. 4 p. .
Performance evaluation of pattern classifiers for handwritten character recognition. International Journal on Document Analysis and Recognition (IJDAR) 2002. 4 p. .
Comparison of SVM and ANN performance for handwritten character classification. Proceedings of the IEEE 12th Signal Processing and Communications Applications Conference, (the IEEE 12th Signal Processing and Communications Applications Conference) 2004. p. .
An Improved Method for Similar Handwritten Chinese Character Recognition. IEEE Third International Symposium on Intelligent Information Technology and Security Informatics (IITSI), 2010. p. .
Figure 2. Misclassification Rate of Bayesian Network and C4.5 with FULL,CFS and CON Features Sets 34 Computer Science & Information Technology. IEEE Ninth International Conference on Document Analysis and Recognition(ICDAR ), 2007. CS & IT. 2 p. . (An Efficient Feature Extraction and Dimensionality Reduction Scheme for Isolated Greek Handwritten Character Recognition)
An HMMRF-Based Statistical Approach for Off-line Handwritten Character Recognition. IEEE Proceedings of the 13 th International Conference on Pattern Recognition, 1996. 2 p. .
Offline handwritten character recognition of Gujrati script using pattern matching. IEEE 3rd International Conference on Anti-counterfeiting, Security, and Identification in Communication, 2009. p. .
Neural Network Based Offline Tamil Handwritten Character Recognition System. IEEE International Conference on Computational Intelligence and Multimedia Applications, 2007. 2 p. .
A Statistical Approach for Handwritten Character Recognition Using Bayesian Filter. IEEE 3rd International Conference on Innovative Computing Information and Control, 2008. p. .
An Overview Of Character Recognition Focused On Off-line Handwriting. IEEE Transactions on Systems, Man, and Cybernetics 2001. 31 p. .
WEKA 3: Data Mining With Open Source Machine Learning Software. http://www.cs.waikato.ac.nz/ml/weka/ JAVA 1998. 16. Robotics Lab, Deptt of Computer Science, Stanford University (An early draft of a proposed textbook)
Handwritten Farsi Character Recognition using Artificial Neural Network. International Journal of Computer Science and Information Security 2009. 4.
Fratio Based Weighted Feature Extraction for Similar Shape Character Recognition. IEEE 10th International Conference on Document Analysis and Recognition, 2009. p. .