Detail publikace

Offline Handwritten Text Recognition Using Support Vector Machines

Originální název

Offline Handwritten Text Recognition Using Support Vector Machines

Anglický název

Offline Handwritten Text Recognition Using Support Vector Machines

Jazyk

en

Originální abstrakt

Comenia script is a novel handwritten text introduced at primary schools in the Czech Republic. This paper describes a method for handwritten text recognition (HWR) of this font. In particular it proposes a method for preprocessing and normalization of data and optical character recognition based on SVM classifier. We have trained and statistically evaluated several models, where we have focused on recognition of different styles of writing of the same characters - for the forensic purposes and identification of the author of a document. The best model has achieved 92.86 % accuracy without any further postprocessing, e.g. a spellchecker. We also proposed using more than one classification model for character recognition that has shown to increase accuracy when compared to a single model approach.

Anglický abstrakt

Comenia script is a novel handwritten text introduced at primary schools in the Czech Republic. This paper describes a method for handwritten text recognition (HWR) of this font. In particular it proposes a method for preprocessing and normalization of data and optical character recognition based on SVM classifier. We have trained and statistically evaluated several models, where we have focused on recognition of different styles of writing of the same characters - for the forensic purposes and identification of the author of a document. The best model has achieved 92.86 % accuracy without any further postprocessing, e.g. a spellchecker. We also proposed using more than one classification model for character recognition that has shown to increase accuracy when compared to a single model approach.

BibTex


@inproceedings{BUT133620,
  author="Martin {Rajnoha} and Radim {Burget} and Malay Kishore {Dutta} and Ashish {Issac}",
  title="Offline Handwritten Text Recognition Using Support Vector Machines",
  annote="Comenia script is a novel handwritten text introduced at primary schools in the Czech Republic. This paper describes a method for handwritten text recognition (HWR) of this font. In particular it proposes a method for preprocessing and normalization of data and optical character recognition based on SVM classifier. We have trained and statistically evaluated several models, where we have focused on recognition of different styles of writing of the same characters - for the forensic purposes and identification of the author of a document. The best model has achieved 92.86 % accuracy without any further postprocessing, e.g. a spellchecker. We also proposed using more than one classification model for character recognition that has shown to increase accuracy when compared to a single model approach.",
  booktitle="2017 4th International Conference on Signal Processing and Integrated Networks (SPIN)",
  chapter="133620",
  doi="10.1109/SPIN.2017.8049930",
  howpublished="electronic, physical medium",
  year="2017",
  month="february",
  pages="132--136",
  type="conference paper"
}