Beta Code: Difference between revisions

From The Digital Classicist Wiki
Jump to navigation Jump to search
mNo edit summary
mNo edit summary
Line 1: Line 1:


Beta Code is a system for representing ancient Greek and other non-Latin scripts using the standard ASCII character set. Developed in the 1980s by the Thesaurus Linguae Graecae (TLG) project at the University of California, Irvine, its purpose was to enable the electronic encoding, searching, and transmission of polytonic Greek texts on computers that could not yet display Greek fonts.
Beta Code is a system for representing ancient Greek and other non-Latin scripts using the standard ASCII character set. Developed in the 1980s by the Thesaurus Linguae Graecae (TLG) project at the University of California, Irvine, it aimed to enable the electronic encoding, searching, and transmission of polytonic Greek texts on computers that were not able yet to display Greek fonts.


In Beta Code, every Greek letter and diacritical mark, such as accents, breathings, and iota subscripts, is represented by a combination of Latin letters and symbols. For example:
In Beta Code, every Greek letter and diacritical mark, including accents, breathings, and iota subscripts, is represented by a combination of Latin letters and symbols. For example:


λόγος → LO/GOS
λόγος → LO/GOS
Line 8: Line 8:
ἄνθρωπος → A)/NQRWPOS
ἄνθρωπος → A)/NQRWPOS


This standardized encoding allowed large digital text corpora, including the TLG, the Packard Humanities Institute (PHI) Greek Inscriptions, and the Duke Databank of Documentary Papyri, to store and exchange Greek texts in plain-text form before the widespread adoption of Unicode.
This standardized encoding allowed large digital text corpora, including the TLG, the Packard Humanities Institute (PHI) Greek Inscriptions, and the Duke Databank of Documentary Papyri, to store and exchange Greek texts in plain-text form (before the widespread adoption of Unicode).


Today, most databases have migrated to Unicode, but Beta Code remains important for data preservation, scholarly citation, and compatibility with legacy digital classics projects.
Today, the vast majority of databases support Unicode, but Beta Code still is important for data preservation, scholarly citation, and compatibility with legacy digital classics projects.

Revision as of 18:57, 11 October 2025

Beta Code is a system for representing ancient Greek and other non-Latin scripts using the standard ASCII character set. Developed in the 1980s by the Thesaurus Linguae Graecae (TLG) project at the University of California, Irvine, it aimed to enable the electronic encoding, searching, and transmission of polytonic Greek texts on computers that were not able yet to display Greek fonts.

In Beta Code, every Greek letter and diacritical mark, including accents, breathings, and iota subscripts, is represented by a combination of Latin letters and symbols. For example:

λόγος → LO/GOS

ἄνθρωπος → A)/NQRWPOS

This standardized encoding allowed large digital text corpora, including the TLG, the Packard Humanities Institute (PHI) Greek Inscriptions, and the Duke Databank of Documentary Papyri, to store and exchange Greek texts in plain-text form (before the widespread adoption of Unicode).

Today, the vast majority of databases support Unicode, but Beta Code still is important for data preservation, scholarly citation, and compatibility with legacy digital classics projects.