Dataset Integration Hack: Difference between revisions

From The Digital Classicist Wiki
Jump to navigation Jump to search
mNo edit summary
(→‎The problem: added GoogleDoc link)
Line 2: Line 2:


How to integrate several distributed but Open Access and Open Licensed datasets so that they can be served via a metadata portal from a single web service.
How to integrate several distributed but Open Access and Open Licensed datasets so that they can be served via a metadata portal from a single web service.
The datasets: [https://spreadsheets.google.com/ccc?key=0As1AGmWRrRdUdDBBbXItYmZ3NWJ0RHUtZk1waXd5N3c Open Access Classical Data]


== Platform ==
== Platform ==

Revision as of 18:15, 3 November 2010

The problem

How to integrate several distributed but Open Access and Open Licensed datasets so that they can be served via a metadata portal from a single web service.

The datasets: Open Access Classical Data

Platform

OAI-PMH server and DC metadata. (JN, MR, JMV: more info please?)

Metadata

Harvesting

Metadata will be harvested on a case-by-case basis from the source data, with additional global parameters provided from local knowledge as required. Ideally, and eventually, individual datasets would provide their own OAI service to expose this metadata. (We may try to illustrate this with IAph and IRT at some point.)

Schema

OAI-PMH in Dublin Core

  • dc:title
    • title of resource
  • dc:creator
    • harvest (or known?)
  • dc:subject
    • ??
  • dc:description
    • if any free prose
  • dc:publisher
    • harvest
  • dc:contributor
    • harvest if given
  • dc:date
    • harvest
  • dc:type
    • closed list (edition|photograph|commentary|database|linked data|other)
  • dc:format
    • filetypes?
  • dc:identifier
    • URI and/or URL?
  • dc:source
    • ??
  • dc:language
    • = modern language
  • dc:relation
    • ??
  • dc:coverage
    • ??
  • dc:rights
    • = license (in spreadsheet)

What's next?

Set up OAIPMH server.

Create sample metadata for each dataset.

Next meeting.