Dhivehi OCR: Character Recognition of Thaana Script using Machine- Generated Text and Tesseract OCR Engine

Authors

  • Ahmed Ibrahim

DOI:

https://doi.org/10.55712/ijsri.v1i1.23

Keywords:

Dhivehi OCR, Thaana Script, Optical Character Recognition, Tesseract OCR

Abstract

This paper provides technical aspects and the context of recognising Dhivehi characters using Tesseract OCR Engine, which is a freely available OCR engine with remarkable accuracy and support for multiple languages. The experiments that were conducted showed promising results with 69.46% accuracy and, more importantly, highlighted limitations that are unique to Dhivehi. These issues have been discussed in detail and possible directions for future research are presented.

Published

20.03.2018

How to Cite

Ahmed Ibrahim. (2018). Dhivehi OCR: Character Recognition of Thaana Script using Machine- Generated Text and Tesseract OCR Engine. International Journal of Social Research & Innovation, 1(1). https://doi.org/10.55712/ijsri.v1i1.23