Extracting Information from Classics Scholarly Texts (Romanello): Difference between revisions
No edit summary |
No edit summary |
||
Line 9: | Line 9: | ||
== Abstract == | == Abstract == | ||
The project is an ongoing Computational Linguistic and Text Analytic study of how the language and structure of explicitly encoded data sources can be used to help mining texts of unencoded corpora. | |||
The two corpora being currently considered contain respectively OCRed journal papers and working papers about Classical(Greek and Latin) texts. | |||
The presented project aims at showing how - and with which gain in terms of accuracy - information extracted from structured data sources can be used to automatically extract information from an unstructured | |||
corpus. The extracted information is meant to be used in order to provide semantic access over the corpus itself. | |||
== Presentations == | == Presentations == | ||
* poster presentation at the Arts and Humanities Week 2009, King's College London : [[http://www.slideshare.net/56k/extracting-information-from-classics-scholarly-texts poster]] | |||
* presentation at the PhD Seminar (CCH/KCL) : [[http://www.slideshare.net/56k/stuctured-vs-unstructured-extracting-information-from-classics-scholarly-texts slides]] | |||
== Material == | == Material == |
Revision as of 15:15, 28 April 2010
Provisional Title:
Structured and Unstructured: Extracting Information from Classics Scholarly Texts
Supervisors:
- Willard McCarty (Centre for Computing in the Humanities, King's College London)
- Jonathan Ginzburg (Department of Computer Science, King's College London)
Abstract
The project is an ongoing Computational Linguistic and Text Analytic study of how the language and structure of explicitly encoded data sources can be used to help mining texts of unencoded corpora.
The two corpora being currently considered contain respectively OCRed journal papers and working papers about Classical(Greek and Latin) texts.
The presented project aims at showing how - and with which gain in terms of accuracy - information extracted from structured data sources can be used to automatically extract information from an unstructured corpus. The extracted information is meant to be used in order to provide semantic access over the corpus itself.
Presentations
- poster presentation at the Arts and Humanities Week 2009, King's College London : [poster]
- presentation at the PhD Seminar (CCH/KCL) : [slides]