Syntacticus
Available
Description
Adapted from the project description of Syntacticus (accessed 24-07-2025):
Syntacticus is an umbrella project for the Pragmatic Resources in Old Indo-European Languages, the Tromsø Old Russian and OCS Treebank (TOROT) and the Information Structure and Word Order Change in Germanic and Romance Languages (ISWOC) Treebank, which all use the same annotation system and share similar linguistic priorities.
Syntacticus provides easy access to around a million morphosyntactically annotated tokens from 10 early Indo-European languages
The texts were manually annotated by specialists. The project has its own annotation style, based on the principles of dependency grammar. Each text was annotated as follows (adapted from the Annotation Principles, accessed 24-07-2025):
- split into words,
- lemmatised (i.e. linked to its dictionary entry),
- assigned a part of speech (i.e. classified as noun, verb etc.),
- assigned morphological features (e.g. tagged with its case form or its tense), and
- given a syntactic function and linked to one or more other words (e.g. the subject of a verb has been labelled a subject and linked to the verb.