Difference between revisions of "Coptic SCRIPTORIUM"

From The Digital Classicist Wiki
Jump to navigation Jump to search
(adding contact and editors)
(added Digiclass seminar link)
 
Line 20: Line 20:


The project has developed the first freely available part-of-speech tagger for any phase of the Egyptian language.  It also has character converters to convert text from some older Coptic fonts into [[Coptic (Unicode)|Coptic]] Unicode.  It includes a search page for the Comprehensive Coptic Lexicon (v1.2) containing 11263 entries and 31847 forms of Egyptian-Coptic and Greek-Coptic datasets.  The lexicon has been produced together with the Thesaurus Linguae Aegyptiae project ("Strukturen und Transformationen des Wortschatzes der ägyptischen Sprache", BBAW) and the “Database and Dictionary of Greek Loanwords in Coptic” (DDGLC, Freie Universität Berlin).  
The project has developed the first freely available part-of-speech tagger for any phase of the Egyptian language.  It also has character converters to convert text from some older Coptic fonts into [[Coptic (Unicode)|Coptic]] Unicode.  It includes a search page for the Comprehensive Coptic Lexicon (v1.2) containing 11263 entries and 31847 forms of Egyptian-Coptic and Greek-Coptic datasets.  The lexicon has been produced together with the Thesaurus Linguae Aegyptiae project ("Strukturen und Transformationen des Wortschatzes der ägyptischen Sprache", BBAW) and the “Database and Dictionary of Greek Loanwords in Coptic” (DDGLC, Freie Universität Berlin).  
==References and presentations==
* Amir Zeldes, Caroline T. Schroeder & Lance Martin, 2021. “Leveraging non-named entities in Coptic antiquity.” ''Digital Classicist London seminar'' 2021-0-10. Available: <https://www.youtube.com/watch?v=Web8vaAbx-M>


[[Category:projects]]
[[Category:projects]]

Latest revision as of 17:33, 10 September 2021

Available

Contact

  • contact@copticscriptorium.org

Editors

  • Caroline T. Schroeder
  • Amir Zeldes

Description

Coptic SCRIPTORIUM (Sahidic Corpus Research: Internet Platform for Interdisciplinary multilayer Methods) is a project being developed by Caroline T. Schroeder (the University of the Pacific) and Amir Zeldes (Georgetown University) to create an interdisciplinary platform for search and computational research in texts in the Coptic language, particularly the Sahidic dialect.

As a fully open-source platform, the Coptic SCRIPTORIUM technologies and corpus will function as a collaborative environment for digital research by any scholars working in Coptic. It will provide tools to process texts, a searchable richly-annotated corpus of texts processed in part with such tools, a collaborative platform for scholars to use and contribute to the project, and research results generated from the tools and corpus.

The project has developed the first freely available part-of-speech tagger for any phase of the Egyptian language. It also has character converters to convert text from some older Coptic fonts into Coptic Unicode. It includes a search page for the Comprehensive Coptic Lexicon (v1.2) containing 11263 entries and 31847 forms of Egyptian-Coptic and Greek-Coptic datasets. The lexicon has been produced together with the Thesaurus Linguae Aegyptiae project ("Strukturen und Transformationen des Wortschatzes der ägyptischen Sprache", BBAW) and the “Database and Dictionary of Greek Loanwords in Coptic” (DDGLC, Freie Universität Berlin).

References and presentations

  • Amir Zeldes, Caroline T. Schroeder & Lance Martin, 2021. “Leveraging non-named entities in Coptic antiquity.” Digital Classicist London seminar 2021-0-10. Available: <https://www.youtube.com/watch?v=Web8vaAbx-M>