Coptic SCRIPTORIUM (Sahidic Corpus Research: Internet Platform for Interdisciplinary multilayer Methods) is a project being developed by Caroline T. Schroeder (the University of the Pacific) and Amir Zeldes (Georgetown University) to create an interdisciplinary platform for search and computational research in texts in the Coptic language, particularly the Sahidic dialect.
As a fully open-source platform, the Coptic SCRIPTORIUM technologies and corpus will function as a collaborative environment for digital research by any scholars working in Coptic. It will provide tools to process texts, a searchable richly-annotated corpus of texts processed in part with such tools, a collaborative platform for scholars to use and contribute to the project, and research results generated from the tools and corpus.
The project has developed the first freely available part-of-speech tagger for any phase of the Egyptian language. It also has character converters to convert text from some older Coptic fonts into Coptic Unicode. It includes a search page for the Comprehensive Coptic Lexicon (v1.2) containing 11263 entries and 31847 forms of Egyptian-Coptic and Greek-Coptic datasets. The lexicon has been produced together with the Thesaurus Linguae Aegyptiae project ("Strukturen und Transformationen des Wortschatzes der ägyptischen Sprache", BBAW) and the “Database and Dictionary of Greek Loanwords in Coptic” (DDGLC, Freie Universität Berlin).