Posts - Page 3 of 3
From Handwritten to Metal Type and Back to Handwritten: The Trajectory of Nasta’līq Printing in the 19th Century Islamicate World
In the course of our work locating and analyzing typefaces and typeface frequency across the last two centuries of Arabic script mechanical print we made some fascinating historical discoveries, including some striking typefaces whose existence we would not have previously suspected. Among these…
Challenges of Layout Analysis across Arabic-Script Training Data
Layout Analysis is the process of identifying regions (e.g., title, body text, footnotes, etc.) on a page of text before sending it through the OCR engine. Preparing documents to train our OCR models involves several distinct steps, including semantic annotation, fixing segmentation errors, and editing faulty transcriptions. eScriptorium allows users to associate specific labels with regions…


