To search, Click
below search items.
|
|
All
Published Papers Search Service
|
Title
|
Methods of Arabic Language Baseline Detection ? The State of Art
|
Author
|
Atallah AL-Shatnawi, Khairuddin Omar
|
Citation |
Vol. 8 No. 10 pp. 137-143
|
Abstract
|
Preprocessing is the most important stage in the Arabic OCR system; it has a direct effect on the reliability and efficiency of the segmentation and feature extraction stages. It is worth mentioning that Arabic language is cursively written, and its characters have between 2 to 4 shapes. An Arabic word likely consists of two or more characters which are connected through an imaginary line called baseline. Detecting baseline is one of the main majorities in preprocessing Arabic OCR system. The baseline can be used for both skew normalization and character segmentation. This paper aims to provide a comprehensive review of the methods proposed by researchers to detect Arabic baseline. The Arabic baseline detection methods are categorized into four methods: (a) based on horizontal projection methods, (b) based on word skeleton method, (c) based on contour tracing method, and (d) based on principle component analysis method. Each of these methods has its own advantages and drawbacks.
|
Keywords
|
Preprocessing, OCR, Handwritten, Offline, Arabic Baseline
|
URL
|
http://paper.ijcsns.org/07_book/200810/20081021.pdf
|
|