Do not let its name fool you – VietOCR.NET is not limited to recognizing Vietnamese text from a scanned document and turning it into searchable text; actually, it works with any language that Tesseract can work with. This open-source tool is actually a GUI for Tesseract, an open-source OCR engine that supports dozens of languages – all you have to do is download the corresponding module from the program and use it for the language you need.
The fact that its developer is apparently a Vietnamese person, that the program comes with Vietnamese and English as the only supported languages, and that it is presented as an OCR tool for the Vietnamese language should not stop you from trying this free OCR utility. Tesseract recognizes more than 100 languages, including all the most widely used. Note, however, that to turn any scanned document into PDF files – regardless of the language – you will need to have GPL Ghostscript installed on your computer.
Praising the quality of the text rendered by the OCR engine would be praising the many qualities of Tesseract (the most renowned open-source OCR engine out there), which I think it is outside the scope of this review. As a GUI for that engine, VietOCR.NET is also a neat piece of work, though it still needs polishing certain areas, such as the OCR language selection, whose drop-down menu mixes languages and even disappears from view sometimes. more
Comments