IJCATR Volume 7 Issue 7

Hangul Recognition Using Support Vector Machine

Rahmatina Hidayati, Moechammad Sarosa , Panca Mudjirahardjo
10.7753/IJCATR0707.1007
keywords : Support Vector Machine; SVM; Kernel Polynomial; Kernel Linear; Kernel Radial Basis Function; Hangul

PDF
The recognition of Hangul Image is more difficult compared with that of Latin. It could be recognized from the structural arrangement. Hangul is arranged from two dimensions while Latin is only from the left to the right. The current research creates a system to convert Hangul image into Latin text in order to use it as a learning material on reading Hangul. In general, image recognition system is divided into three steps. The first step is preprocessing, which includes binarization, segmentation through connected component-labeling method, and thinning with Zhang Suen to decrease some pattern information. The second is receiving the feature from every single image, whose identification process is done through chain code method. The third is recognizing the process using Support Vector Machine (SVM) with some kernels. It works through letter image and Hangul word recognition. It consists of 34 letters, each of which has 15 different patterns. The whole patterns are 510, divided into 3 data scenarios. The highest result achieved is 94,7% using SVM kernel polynomial and radial basis function. The level of recognition result is influenced by many trained data. Whilst the recognition process of Hangul word applies to the type 2 Hangul word with 6 different patterns. The difference of these patterns appears from the change of the font type. The chosen fonts for data training are such as Batang, Dotum, Gaeul, Gulim, Malgun Gothic. Arial Unicode MS is used to test the data. The lowest accuracy is achieved through the use of SVM kernel radial basis function, which is 69%. The same result, 72 %, is given by the SVM kernel linear and polynomial.
@artical{r772018ijcatr07071007,
Title = "Hangul Recognition Using Support Vector Machine",
Journal ="International Journal of Computer Applications Technology and Research(IJCATR)",
Volume = "7",
Issue ="7",
Pages ="281 - 285",
Year = "2018",
Authors ="Rahmatina Hidayati, Moechammad Sarosa , Panca Mudjirahardjo"}
  • The paper proposes to recognize Hangul image into Latin text
  • Preprocessing consists of binarization, segmentation thinning
  • The feature extraction used in the current research is chain code
  • Recognition process uses Support Vector Machine (SVM).