Digital Corpus of Sanskrit

From The Digital Classicist Wiki
Jump to navigation Jump to search

Available

Editor

  • Oliver Hellwig

Description

From the project website (accessed 2025-04-28):

The Digital Corpus of Sanskrit (DCS) is a Sandhi-split corpus of Sanskrit texts with full morphological and lexical analysis. The DCS is designed for text-historical research in Sanskrit linguistics and philology. Users can search for lexical units (words) and their collocations in a corpus of about 4,800,000 manually tagged words in 650,000 text lines. The DCS offers two main entry points for research:

  1. Words can be retrieved from the dictionary through a simple query or a dictionary page. For each lexical unit contained in the corpus, DCS provides the complete set of occurrences and a statistical evaluation based on historical principles.
  2. The text interface shows all contained texts along with their interlinear lexical and morphological analysis.