Citation in digital scholarship

From The Digital Classicist Wiki
Jump to navigation Jump to search

Introduction

This page suggests best practices for making citations in digital scholarship, and documents a set of conventions that are intended to promote greater interoperability. It will also identify tools for identifying processing, and presenting citations in serve and client-side environments. It also lists resources that are creating stable URLs relevant to digital scholarship, with a focus on the humanities disciplines. The term 'citation' is meant very generally as the encoding of reference to an external entity in support of, as illustration of, or otherwise in relationship to a work of digital scholarship. Scholars cite resources ranging from primary texts, contemporary scholarship, museum objects, people, places, and a wide range of other entities and categories of information.

This effort takes as its starting point that the conventions described of citations should:

  1. Be automatically parsable. Automatic agents should be able to recognize that a citation is being made, and to identify what is being cited.
  2. Encourage reuse of existing naming schemes. A consistently applied convention should allow distinct and independent citation to the same entity to be recognized by third parties. For all the examples below, but particularly for sites creating stable id's (e.g. Pleiades), a concern is for a generic, interoperable, author-friendly convention to refer to those resources in ways that the sites themselves will recognize. "If you make a reference to Pleiades, how does Pleiades know that you've done so?"
  3. Support user interaction. Client-side operations, such as "show me a map of all geographic entities in a document" can be facilitated by a robust citation convention.
  4. Recognize that various standards already exist and not take unnecessary steps to interfere with the deployment of those standards.

The above list is based up on previous scholarship in the field of digital citation ([1], [2]).

Preliminary Notes

  • An xml environment, with examples implemented in (x)html and tei, is assumed.
  • While this page does assert categories, those are also up for discussion. What is the theoretical and practical difference between a "primary source" and "secondary scholarship"? It is reasonable to cite the 9th century scholar Photius as both.

The Process of Digital Citation in Prose Works

1. Plain-text citations

Sample text: Herodotus (1.78) describes Babylon as the strongest and most famous city in Assyria. It is likely that this city was subsequently the mint from which Alexander issued a series of coins depicting eastern warriors on the obverse and an elephant on the reverse (e.g. ANS 1995.51.68). See discussion by Martin Price (1991).

Is it possible to establish a robust convention that allows unambiguous machine-recognizable linking to the cited text, to Alexander, to Babylon, to a description of the the coin in the collection of the American Numismatic Society and to the article "Circulation at Babylon in 323 B.C."?

2. Indicating the Presence of a Citation (@*="citation")

HTML: <span class="citation">Herodotus (1.78)</span>

TEI: Cite error: Invalid <ref> tag; invalid names, e.g. too many

In both these usages, an xpath selector "//*[@*='citation']" will create a set of all the citations in a text. That is robust.

3. Normalizing the plain text citation

HTML: <span class="citation" lang="en" title="Herodotus Histories 1.78">Herodotus (1.78)</span>

TEI: Cite error: Invalid <ref> tag; invalid names, e.g. too many

Normalization will assist tools that can automatically recognize plain text citations.

If the value of the 'title' attribute would be identical to the text representation of the element it is attached to, it can be left out.

Note: in the HTML5 spec, elements without @title inherit the value from any ancestor that has @title. That should not happen in the case of a citation.

4. Be explicit about language

Both "Herodotus Histories 1.78" and "Hdt. 1.78" can be considered English representations of the citation of that text. The German equivalent of the first is "Herodot Historien 1.78", the Latin - still with Arabic numerals - is "Herodotus Historiae 1.78". If the language of the citation is the same as its prose context, it is not necessary to further markup the citation. It is common practice in some disciplines to cite the title of a work in its original language or in a widely accepted academic language, such as Latin titles for Greek works in Classics.

HTML: Herodotus (<a class="citation" title="Herodotus Historiae 1.78" lang="la">Historiae 1.78</a>) describes...

The 'lang' attribute specifices the language of the element to which it is attached. It does not directly specify the language of the 'title' attribute. Therefore, they must be the same.

5. Choosing a URL

Ideally, citations in digital scholarship that are accompanied by a link to an online resource should use a URI that is persistent and with clear semantics. These do always exist, which is one reason to put a plain-text reference in the 'title' attribute.

HTML: <a class="citation" title="Hdt. 1.78" href="http://www.perseus.tufts.edu/hopper/text?doc=Perseus:text:1999.01.0125:book%3D1:chapter%3D78">Herodotus (1.78)</a>

TEI: Cite error: Invalid <ref> tag; invalid names, e.g. too many

For other categories of evidence, persistent URLs do exist.

HTML: <a class="citation" href="http://atlantides.org/batlas/babylon-91-f5">Babylon</a>

More complete markup

Extrapolating from the truncated steps above gives the following markup for the sample text:

HTML: <span><a class="citation" title="Herodotus Histories 1.78" href="http://www.perseus.tufts.edu/hopper/text?doc=Perseus:text:1999.01.0125:book%3D1:chapter%3D78">Herodotus (1.78)</a> describes <a class="citation" href="http://atlantides.org/batlas/babylon-91-f5">Babylon</a> as the strongest and most famous city in Assyria. It is likely that this city was subsequently the mint from which <a class="citation" title="Alexander III of Macedon" href="http://en.wikipedia.org/wiki/Alexander_the_Great">Alexander</a> issued a series of coins depicting eastern warriors on the obverse and an elephant on the reverse (e.g. <a class="citation" href="http://numismatics.org/collection/1995.51.68">ANS 1995.51.68</a>). See discussion by Martin Price (<a class="citation" title="Martin Price. 'Circulation at Babylon in 323 B.C.' in Mnemata : papers in memory of Nancy M. Waggoner" href="http://www.worldcat.org/title/mnemata-papers-in-memory-of-nancy-m-waggoner/oclc/24342025">1991</a>).

Notes: The reference to the M. Price article is insufficient.

TEI: to come.

Adding other markup schemes to conformant citations

The 'class="citation" title="<normalized plain text citation>"' html pattern is designed so that it can be easily used with other markup schemes. The global 'class' attribute in html is a space separated list so that other, unrelated values can be present without interfering with the identification of an element as a citation. The global 'title' attribute is directly suitable for the role envisioned here so shouldn't clash with other conforming uses.

Content-creators may choose to add in additional markup. Links to guidelines for doing so are list here.

OpenURL/Coins/Zotero

RDFa

Categories of resources that can be cited

Ancient Mediterranean Primary Texts

"Classics" has well established abbreviations. Neither complete, nor unambiguous, but well established.

  • Plain text: "Hom. Il. 2.345", "Homer, Iliad 2.345"

The following examples illustrate that the same text can appear in different places.

This example does not address the presence and/or capabilities of the Canonical Text Services (CTS) protocol and URN scheme under development at the Center for Hellenic Studies.

Geographic Entities

Within the Ancient Mediterranean, the Pleiades Project is establishing short URL as identifiers for geographic entities (but see their own discussion for details). Geonames.org is a worldwide list of identifiers.

Bibliographic Data

Worldcat. But there may be licensing issues.

What is the relationship between citing a work and citing its bibliographic record? Is that a necessary distinction?

Museum Objects

Or any cataloged object with stable id?

HTML: <a class="citation" href="http://numismatics.org/collection/1968.34.40">ANS 1968.34.40</a>.

Egyptian Papyri

The sites http://papyri.info and http://trismegistos.org (e.g. http://www.trismegistos.org/tm/detail.php?tm=23 ) are islands of stability here.

HTML: <a class="citation" title="Trismegistos Number 23" href="http://www.trismegistos.org/tm/detail.php?tm=23">TM23</a>

Notes

Template:Reflist

References

  1. Romanello 2008
  2. Smith 2009