The OpenITI is involved in a number of exciting projects. See a list of our projects (past and present) below:
The Automatic Collation for Diversifying Corpora (ACDC) project, funded by a Level III Digital Humanities Advanced Grant from the National Endowment for the Humanities, aims to significantly improve the accuracy of handwritten text recognition (HTR) for Arabic-script manuscripts. Our team will develop a collation tool to automatically create large amounts of training data from existing digital texts and manuscript images without time-consuming human annotation of individual manuscripts.
OpenITI has begun piloting the production of the first digital publications of Persian and Arabic works, taken straight from their original manuscript form into a digital publication without a print intermediary.
Funded through two grants from The Andrew W. Mellon Foundation, Phase One of the Open Islamicate Texts Initiative Arabic-script OCR Catalyst Project (OpenITI AOCP) is the first undertaking of its kind to tackle the technical and organizational barriers that historically have stymied the development of Arabic-script OCR and digital text production for Islamicate Studies.
Descriptions of many projects that are (or have been) affiliated with OpenITI