(Created page with "==Available== * Description: https://www.turing.ac.uk/research/publications/diorisis-ancient-greek-corpus * Data: https://figshare.com/articles/The_Diorisis_Ancient_Greek_Cor...")
From the article abstract (accessed 2019-08-06):
The Diorisis Ancient Greek Corpus is a digital collection of ancient Greek texts (from Homer to the early fifth century AD) compiled for linguistic analyses, and specifically with the purpose of developing a computational model of semantic change in Ancient Greek. The corpus consists of 820 texts sourced from open access digital libraries. The texts have been automatically enriched with morphological information for each word. The automatic assignment of words to the correct dictionary entry (lemmatization) has been disambiguated with the implementation of a part-of-speech tagger (a computer programme that may select the part of speech to which an ambiguous word belongs).