OCR for ancient Greek: Difference between revisions

From The Digital Classicist Wiki
Jump to navigation Jump to search
m (moved keying out of ocr list)
(Google Docs OCR)
Line 3: Line 3:
* [http://finereader.abbyy.com/ ABBYY FineReader] can be made to work with ancient Greek with extensive training
* [http://finereader.abbyy.com/ ABBYY FineReader] can be made to work with ancient Greek with extensive training
* The [http://gamera.informatik.hsnr.de/ Gamera] toolkit for analysing and scanning complex texts includes some experiments with polytonic Greek
* The [http://gamera.informatik.hsnr.de/ Gamera] toolkit for analysing and scanning complex texts includes some experiments with polytonic Greek
* Google Docs now allows you to have it do [http://googledocs.blogspot.com/2011/02/optical-character-recognition-ocr-in-34.html OCR on uploaded documents in a variety of languages], and you can get some results by specifying "Greek" and uploading a PDF (images seem not to work). Quality is about on the level of Google Books OCR of printed ancient Greek.


===alternatives===
===alternatives===

Revision as of 18:05, 1 March 2011

  • Federico Boschetti has been experimenting with adapting/training Google's OCR engine tesseract to ancient Greek texts: http://www.himeros.eu/ (related paper)
  • The commercial OCR software Anagnostis (€585) can handle ancient Greek, though apparently poorly
  • ABBYY FineReader can be made to work with ancient Greek with extensive training
  • The Gamera toolkit for analysing and scanning complex texts includes some experiments with polytonic Greek
  • Google Docs now allows you to have it do OCR on uploaded documents in a variety of languages, and you can get some results by specifying "Greek" and uploading a PDF (images seem not to work). Quality is about on the level of Google Books OCR of printed ancient Greek.

alternatives

  • AccessTEI is a service for members of the TEI for manual keying of texts which can handle ancient Greek

External links