Xaren Xaren - 3 months ago 17
C# Question

Can Windows ocr recognize custom symbols/font?

I'm developing for UWP, Windows has an OCR engine: Windows.Media.Ocr

My question is: someone knows if the Windows OCR can be trained to recognize new characters or use a custom font? if yes, how i can do this?

what i want to achieve is to recognize non alphabetical symbols. I want to recognize per example the character ⌰ (unicode: U+2330) or ⌖ (U+2316).

The characters that i want to recognize are symbols not for any language.

Answer

I used Windows.Media.Ocr library in my WUP application and here some test result with different font


Arial

Font - Arial
Test Words - Hello @ World
Expected Result - Hello @ World
Original Result - Hello @ World
Accuracy - 100%

enter image description here


Agency FB

Font - Agency FB
Test Words - Hello @ World
Expected Result - Hello @ World
Original Result - Hello World
Accuracy - 84.6% (Missed - @ symbol and one space)

enter image description here


Modern

Font - Modern
Test Words - Hello @ World
Expected Result - Hello @ World
Original Result - Hello @ world
Accuracy - 92.3% (W recognised as w)

enter image description here


Lucida Handwriting

Font - Lucida Handwriting Test Words - Hello @ World
Expected Result - Hello @ World
Original Result - HeUe@ worw
Accuracy - 46.1%
enter image description here


Update [1]

Arial Unicode MS

Font - Arial Unicode MS
Test Symbols - ⌰ ⌖
Expected Result - ⌰ ⌖
Original Result - (Unable to Recognize)
Accuracy - 0%

enter image description here


Update 2

enter image description here

Hope this helpful to you.

Comments