Loading...

Page segmentation of Persian/Arabic printed text using ink spread effect

Shirali Shahreza, S ; Sharif University of Technology | 2006

284 Viewed
  1. Type of Document: Article
  2. DOI: 10.1109/SICE.2006.315618
  3. Publisher: 2006
  4. Abstract:
  5. Nowadays, OCR (Optical Character Recognition) is widely used for converting written documents to digital documents. One of the OCR phases is page segmentation. In page segmentation, text regions must be found in input image. In addition, text parts like text columns must be separated. In this paper, a new method for segmenting Persian/Arabic printed text is proposed. This method is based on Ink Spread Effect idea, a new idea that has particular features. Main features of Persian/Arabic scripts are considered in designing this method. This method is skew resistant and can segment text within frames and tables or regions with gray background. © 2006 ICASE
  6. Keywords:
  7. Digital documents ; Page segmentation ; Skew resistant ; Text columns ; Feature extraction ; Image reconstruction ; Image segmentation ; Text processing ; Optical character recognition
  8. Source: 2006 SICE-ICASE International Joint Conference, Busan, 18 October 2006 through 21 October 2006 ; 2006 , Pages 259-262 ; 8995003855 (ISBN); 9788995003855 (ISBN)
  9. URL: https://ieeexplore.ieee.org/document/4108835