<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en-GB">
	<id>https://wiki.digitalclassicist.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=TheaSommerschield</id>
	<title>The Digital Classicist Wiki - User contributions [en-gb]</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.digitalclassicist.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=TheaSommerschield"/>
	<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/Special:Contributions/TheaSommerschield"/>
	<updated>2026-04-29T05:28:12Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.41.1</generator>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Lexicon_Iconographicum_Mythologiae_Classicae-France&amp;diff=11882</id>
		<title>Lexicon Iconographicum Mythologiae Classicae-France</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Lexicon_Iconographicum_Mythologiae_Classicae-France&amp;diff=11882"/>
		<updated>2023-10-18T15:05:43Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: Formatted entire page&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Available==&lt;br /&gt;
* http://www.limc-france.fr/presentation&lt;br /&gt;
&lt;br /&gt;
==Editor==&lt;br /&gt;
* LIMC-France&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
Taken from the project website (Accessed 2023-10-18):&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote&amp;gt;The French team of the International Foundation for the LIMC gives you access to the digital resources that have been developed around ancient iconography.&lt;br /&gt;
The three databases:&lt;br /&gt;
&lt;br /&gt;
* LIMC-icon contains data relating to ancient Greek, Roman and Etruscan documents bearing a mythological or religious representation.&lt;br /&gt;
&lt;br /&gt;
* LIMC-biblio contains recent bibliographical data to complete the information published in the LIMC volumes.&lt;br /&gt;
&lt;br /&gt;
* LIMC-abrev allows you to find the list of the articles published in the LIMC and the full names of the bibliographical abbreviations used in the LIMC, in the ThesCRA and on this site.&lt;br /&gt;
&lt;br /&gt;
Any person wishing to add any information relating to ancient iconographic documents can contact us so that this information may be entered into the LIMC-France online resources.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Cultural heritage]]&lt;br /&gt;
[[category:Institutions]]&lt;br /&gt;
[[category:images]]&lt;br /&gt;
[[category:paywalled]]&lt;br /&gt;
[[category:mythology]]&lt;br /&gt;
[[category:Etruscan]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11879</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11879"/>
		<updated>2023-10-18T15:02:25Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;I am a Marie Skłodowska-Curie postdoctoral fellow at Ca’ Foscari University of Venice. My research uses machine learning to study the epigraphic cultures of the Mediterranean world, and I also work on the history and inscriptions of ancient Sicily.&lt;br /&gt;
&lt;br /&gt;
Since obtaining my DPhil in Ancient History (University of Oxford) in 2021, I have held fellowships at the British School at Rome, Harvard's Centre for Hellenic Studies, and Google Cloud. In 2024 I will be returning to the UK for a Leverhulme EC Fellowship at the University of Nottingham.&lt;br /&gt;
I co-led the [[Pythia]] (2019) and [[Ithaca]] (2022) projects, and am publishing the new inscriptions from Himera (Sicily).&lt;br /&gt;
&lt;br /&gt;
I live in Venice, but I'm often out on the Mediterranean trail.&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* thea {dot} sommerschield {at} fmail {dot} com&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* [https://orcid.org/0000-0002-6965-8105 ORCiD]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Vici.org&amp;diff=11877</id>
		<title>Vici.org</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Vici.org&amp;diff=11877"/>
		<updated>2023-10-18T14:33:49Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: Added access date and checked contents&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Available==&lt;br /&gt;
&lt;br /&gt;
* https://vici.org/&lt;br /&gt;
&lt;br /&gt;
==Author==&lt;br /&gt;
&lt;br /&gt;
* René Voorburg&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
Taken from the project website (Accessed 2023-10-18):&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote&amp;gt;&lt;br /&gt;
'''Vici.org''' is the archaeological atlas of classical antiquity. It is a community driven archaeological map, inspired by and modelled after Wikipedia. The first version of Vici.org went online in May 2012. It was preceded by a sister website Omnesviae.org, a roman routeplanner based on the Peutinger map. Since its start, Vici.org has grown a lot. At the time to this writing, over 140 contributors have added nearly 20,000 locations, approximately 1,000 line tracings and over 3,000 images.&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[category:geography]]&lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:Crowdsourcing]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11778</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11778"/>
		<updated>2023-09-11T11:18:29Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;A historian and epigrapher, my research uses machine learning to study the epigraphic cultures of the ancient Mediterranean. &lt;br /&gt;
&lt;br /&gt;
I am a Marie Skłodowska-Curie Fellow at Ca' Foscari University of Venice ([https://pric.unive.it/projects/pythiaplus/home PythiaPlus project]), collaborating with Google Deepmind (London), AUEB and the IMSI Athena Research Centre (Athens). In 2021-2022, I am a CHS Fellow in Hellenic Studies at Harvard University; in 2022-2023 I am part of the second cohort of Google Cloud Research Innovators.&lt;br /&gt;
&lt;br /&gt;
I did my BA in Classics at the University of Pavia and a parallel MA in Humanities at the School for Advanced Study IUSS Pavia. I then moved to Oxford for my MSt in Greek History. I completed my DPhil in Ancient History at the University of Oxford in January 2021. In 2020, I was the Ralegh Radford Rome Awardee at the British School at Rome.&lt;br /&gt;
&lt;br /&gt;
I am co-lead and co-first author of the [[Pythia]] (2019) and [[Ithaca]] (2022) projects.&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* thea {dot} sommerschield {at} classics {dot} ox {dot} ac {dot} uk&lt;br /&gt;
* thea {at} sommerschield {dot} it&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* [https://pric.unive.it/projects/pythiaplus/home PythiaPlus]&lt;br /&gt;
* [https://www.unive.it/data/persone/25105303 UniVe Profile]&lt;br /&gt;
* [https://orcid.org/0000-0002-6965-8105 ORCiD]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11777</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11777"/>
		<updated>2023-09-11T11:18:02Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: updated job link&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;I am a historian and epigrapher, my research uses machine learning to study the epigraphic cultures of the ancient Mediterranean. &lt;br /&gt;
&lt;br /&gt;
I am a Marie Skłodowska-Curie Fellow at Ca' Foscari University of Venice ([https://pric.unive.it/projects/pythiaplus/home PythiaPlus project]), collaborating with Google Deepmind (London), AUEB and the IMSI Athena Research Centre (Athens). In 2021-2022, I am a CHS Fellow in Hellenic Studies at Harvard University; in 2022-2023 I am part of the second cohort of Google Cloud Research Innovators.&lt;br /&gt;
&lt;br /&gt;
I did my BA in Classics at the University of Pavia and a parallel MA in Humanities at the School for Advanced Study IUSS Pavia. I then moved to Oxford for my MSt in Greek History. I completed my DPhil in Ancient History at the University of Oxford in January 2021. In 2020, I was the Ralegh Radford Rome Awardee at the British School at Rome.&lt;br /&gt;
&lt;br /&gt;
I am co-lead and co-first author of the [[Pythia]] (2019) and [[Ithaca]] (2022) projects.&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* thea {dot} sommerschield {at} classics {dot} ox {dot} ac {dot} uk&lt;br /&gt;
* thea {at} sommerschield {dot} it&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* [https://pric.unive.it/projects/pythiaplus/home PythiaPlus]&lt;br /&gt;
* [https://www.unive.it/data/persone/25105303 UniVe Profile]&lt;br /&gt;
* [https://orcid.org/0000-0002-6965-8105 ORCiD]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11776</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11776"/>
		<updated>2023-09-11T11:17:38Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;I am a Marie Skłodowska-Curie postdoctoral fellow at Ca’ Foscari University of Venice working on my [https://pric.unive.it/projects/pythiaplus/home PythiaPlus] project.&lt;br /&gt;
&lt;br /&gt;
My research uses machine learning to study the epigraphic cultures of the Mediterranean world, and I also work on the history and inscriptions of ancient Sicily.&lt;br /&gt;
Since obtaining my DPhil in Ancient History (University of Oxford) in 2021, I have held fellowships at the British School at Rome, Harvard's Centre for Hellenic Studies, and Google Cloud. In 2024 I will be returning to the UK for a Leverhulme EC Fellowship at the University of Nottingham.&lt;br /&gt;
&lt;br /&gt;
I co-led the Pythia (2019) and Ithaca (2022) projects (the latter featured on the cover of the scientific journal ''Nature''), and am publishing the new inscriptions from Himera, Sicily. I live in Venice, but I'm often out on the Mediterranean trail.&lt;br /&gt;
&lt;br /&gt;
&amp;quot;Computers aren't the thing, they're the thing that gets us to the thing&amp;quot; (J. MacMillan).&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* thea {dot} sommerschield {at} gmail {dot} com&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11192</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11192"/>
		<updated>2022-05-31T16:41:04Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;I am a historian and epigrapher, my research uses machine learning to study the epigraphic cultures of the ancient Mediterranean. &lt;br /&gt;
&lt;br /&gt;
I am a Marie Skłodowska-Curie Fellow at Ca' Foscari University of Venice ([https://pric.unive.it/projects/pythiaplus/home PythiaPlus project]), collaborating with Google Deepmind (London), AUEB and the IMSI Athena Research Centre (Athens). In 2021-2022, I am a CHS Fellow in Hellenic Studies at Harvard University; in 2022-2023 I am part of the second cohort of Google Cloud Research Innovators.&lt;br /&gt;
&lt;br /&gt;
I did my BA in Classics at the University of Pavia and a parallel MA in Humanities at the School for Advanced Study IUSS Pavia. I then moved to Oxford for my MSt in Greek History. I completed my DPhil in Ancient History at the University of Oxford in January 2021. In 2020, I was the Ralegh Radford Rome Awardee at the British School at Rome.&lt;br /&gt;
&lt;br /&gt;
I am co-lead and co-first author of the [[Pythia]] (2019) and [[Ithaca]] (2022) projects.&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* thea {dot} sommerschield {at} classics {dot} ox {dot} ac {dot} uk&lt;br /&gt;
* thea {at} sommerschield {dot} it&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* [https://pric.unive.it/projects/pythiaplus/home PythiaPlus]&lt;br /&gt;
* [https://www.unive.it/data/persone/25105303 UniVe Profile]&lt;br /&gt;
* [https://orcid.org/0000-0002-6965-8105 ORCiD]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11191</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11191"/>
		<updated>2022-05-31T16:39:28Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;I am a historian and epigrapher, my research uses machine learning to study the epigraphic cultures of the ancient Mediterranean. &lt;br /&gt;
&lt;br /&gt;
I am a Marie Skłodowska-Curie Fellow at Ca' Foscari University of Venice ([https://pric.unive.it/projects/pythiaplus/home PythiaPlus project]), collaborating with Google Deepmind (London), AUEB and the IMSI Athena Research Centre (Athens). In 2021-2022, I am a CHS Fellow in Hellenic Studies at Harvard University; in 2022-2023 I am part of the second cohort of Google Cloud Research Innovators.&lt;br /&gt;
&lt;br /&gt;
I did my BA in Classics at the University of Pavia and a parallel MA in Humanities at the School for Advanced Study IUSS Pavia. I then moved to Oxford for my MSt in Greek History. I completed my DPhil in Ancient History at the University of Oxford in January 2021. In 2020, I was the Ralegh Radford Rome Awardee at the British School at Rome.&lt;br /&gt;
&lt;br /&gt;
I am co-lead and co-first author of the [[Project:Pythia|Pythia]] and [[Project:Ithaca|Ithaca]] projects.&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* thea {dot} sommerschield {at} classics {dot} ox {dot} ac {dot} uk&lt;br /&gt;
* thea {at} sommerschield {dot} it&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* [https://pric.unive.it/projects/pythiaplus/home PythiaPlus]&lt;br /&gt;
* [https://www.unive.it/data/persone/25105303 UniVe Profile]&lt;br /&gt;
* [https://orcid.org/0000-0002-6965-8105 ORCiD]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Ithaca&amp;diff=11190</id>
		<title>Ithaca</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Ithaca&amp;diff=11190"/>
		<updated>2022-05-31T16:37:15Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: added link&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Links==&lt;br /&gt;
&lt;br /&gt;
* DOI: https://www.nature.com/articles/s41586-022-04448-z&lt;br /&gt;
* Ithaca's free online interface: https://ithaca.deepmind.com/&lt;br /&gt;
* I.PHI codebase: https://github.com/sommerschield/iphi&lt;br /&gt;
* Ithaca's training and inference code: https://github.com/deepmind/ithaca &lt;br /&gt;
&lt;br /&gt;
==Authors==&lt;br /&gt;
Yannis Assael*, [[User:TheaSommerschield|Thea Sommerschield]]*, Brendan Shillingford, Mahyar Bordbar, John Pavlopoulos, Marita Chatzipanagiotou, Ion Androutsopoulos, Jonathan Prag &amp;amp; Nando de Freitas.&lt;br /&gt;
&lt;br /&gt;
==Institutions and Partners==&lt;br /&gt;
DeepMind, Ca' Foscari University of Venice (PythiaPlus Project, Marie Skłodowska-Curie grant agreement No. 101026185), Athens University of Economics and Business, University of Oxford, Google Cloud, Google Arts and Culture.&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
'''Ithaca: Restoring and attributing ancient texts using deep neural networks'''&lt;br /&gt;
&lt;br /&gt;
Ancient history relies on disciplines such as epigraphy—the study of inscribed texts known as inscriptions—for evidence of the thought, language, society and history of past civilizations. However, over the centuries, many inscriptions have been damaged to the point of illegibility, transported far from their original location and their date of writing is steeped in uncertainty. Here we present Ithaca, a deep neural network for the textual restoration, geographical attribution and chronological attribution of ancient Greek inscriptions. Ithaca is designed to assist and expand the historian’s workflow. The architecture of Ithaca focuses on collaboration, decision support and interpretability. While Ithaca alone achieves 62% accuracy when restoring damaged texts, the use of Ithaca by historians improved their accuracy from 25% to 72%, confirming the synergistic effect of this research tool. Ithaca can attribute inscriptions to their original location with an accuracy of 71% and can date them to less than 30 years of their ground-truth ranges, redating key texts of Classical Athens and contributing to topical debates in ancient history. This research shows how models such as Ithaca can unlock the cooperative potential between artificial intelligence and historians, transformationally impacting the way that we study and write about one of the most important periods in human history.&lt;br /&gt;
&lt;br /&gt;
Learn more:&lt;br /&gt;
* Nature video: https://www.youtube.com/watch?v=rq0Ex_qCKeQ&lt;br /&gt;
* DeepMind research blog: https://deepmind.com/blog/article/Predicting-the-past-with-Ithaca&lt;br /&gt;
* Nature podcast: https://www.nature.com/articles/d41586-022-00701-7 &lt;br /&gt;
* Nature 'News and Views' piece (by Prof. Charlotte Roueché): https://www.nature.com/articles/d41586-022-00641-2&lt;br /&gt;
&lt;br /&gt;
To cite this work:&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Assael, Y., Sommerschield, T., Shillingford, B. et al. Restoring and attributing ancient texts using deep neural networks. Nature 603, 280–283 (2022). https://doi.org/10.1038/s41586-022-04448-z.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Media==&lt;br /&gt;
* [https://elpais.com/ciencia/2022-05-24/una-herramienta-de-inteligencia-artificial-ayuda-a-la-restauracion-y-atribucion-de-textos-antiguos.html El Paìs]&lt;br /&gt;
* [https://www.smithsonianmag.com/smart-news/a-new-ai-can-help-historians-decipher-damaged-ancient-greek-texts-180979736/ The Smithsonian]&lt;br /&gt;
* [https://www.ox.ac.uk/news/2022-03-09-oxford-historians-apply-state-art-ai-transform-study-ancient-texts University of Oxford]&lt;br /&gt;
* [https://www.newscientist.com/article/2311657-ai-can-help-historians-restore-ancient-texts-from-damaged-inscriptions/ New Scientist]&lt;br /&gt;
* [https://www.theguardian.com/science/2022/mar/09/ai-could-decipher-gaps-in-ancient-greek-texts-say-researchers The Guardian]&lt;br /&gt;
* [https://www.thetimes.co.uk/article/ai-fills-the-gaps-in-ancient-texts-to-clear-pericles-3qccs25fv The Times]&lt;br /&gt;
* [https://www.lastampa.it/tuttoscienze/2022/04/06/news/arriva_ithaca_l_occhio_dell_ia_che_aiuta_gli_archeologi_a_riscrivere_il_passato-2914070/ La Stampa]&lt;br /&gt;
* [https://digitalculture.gov.gr/2022/03/ithaki-anakaliptontas-tin-archea-ellada-mesa-apo-tin-techniti-noimosini/ Greek Ministry of Culture]&lt;br /&gt;
* [https://www.kathimerini.gr/society/561754495/apokleistiko-ithaki-ekei-poy-i-techniti-noimosyni-synanta-tin-archaia-ellada/ Kathimerini]&lt;br /&gt;
* [https://www.lemonde.fr/sciences/article/2022/03/15/l-intelligence-artificielle-dechiffre-le-grec-ancien_6117639_1650684.html Le Monde]&lt;br /&gt;
* [https://www.unive.it/pag/14024/?tx_news_pi1%5Bnews%5D=12038&amp;amp;cHash=ee3ef41bb70b14cbda7f8dba3f65f07f Ca' Foscari University of Venice]&lt;br /&gt;
&lt;br /&gt;
==Corresponding authors==&lt;br /&gt;
* thea {dot} sommerschield {at} unive {dot} it&lt;br /&gt;
* assael {at} google {dot} com&lt;br /&gt;
&lt;br /&gt;
[[category:tools]] &lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:corpora]]&lt;br /&gt;
[[category:dataset]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:digitization]]&lt;br /&gt;
[[category:machine learning]]&lt;br /&gt;
[[category:NLP]]&lt;br /&gt;
[[category:Openaccess]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:repositories]]&lt;br /&gt;
[[category:text mining]]&lt;br /&gt;
[[category:web service]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Ithaca&amp;diff=11187</id>
		<title>Ithaca</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Ithaca&amp;diff=11187"/>
		<updated>2022-05-24T13:56:35Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: affiliations added&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Links==&lt;br /&gt;
&lt;br /&gt;
* DOI: https://www.nature.com/articles/s41586-022-04448-z&lt;br /&gt;
* Ithaca's free online interface: https://ithaca.deepmind.com/&lt;br /&gt;
* I.PHI codebase: https://github.com/sommerschield/iphi&lt;br /&gt;
* Ithaca's training and inference code: https://github.com/deepmind/ithaca &lt;br /&gt;
&lt;br /&gt;
==Authors==&lt;br /&gt;
Yannis Assael*, Thea Sommerschield*, Brendan Shillingford, Mahyar Bordbar, John Pavlopoulos, Marita Chatzipanagiotou, Ion Androutsopoulos, Jonathan Prag &amp;amp; Nando de Freitas.&lt;br /&gt;
&lt;br /&gt;
==Institutions and Partners==&lt;br /&gt;
DeepMind, Ca' Foscari University of Venice (PythiaPlus Project, Marie Skłodowska-Curie grant agreement No. 101026185), Athens University of Economics and Business, University of Oxford, Google Cloud, Google Arts and Culture.&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
'''Ithaca: Restoring and attributing ancient texts using deep neural networks'''&lt;br /&gt;
&lt;br /&gt;
Ancient history relies on disciplines such as epigraphy—the study of inscribed texts known as inscriptions—for evidence of the thought, language, society and history of past civilizations. However, over the centuries, many inscriptions have been damaged to the point of illegibility, transported far from their original location and their date of writing is steeped in uncertainty. Here we present Ithaca, a deep neural network for the textual restoration, geographical attribution and chronological attribution of ancient Greek inscriptions. Ithaca is designed to assist and expand the historian’s workflow. The architecture of Ithaca focuses on collaboration, decision support and interpretability. While Ithaca alone achieves 62% accuracy when restoring damaged texts, the use of Ithaca by historians improved their accuracy from 25% to 72%, confirming the synergistic effect of this research tool. Ithaca can attribute inscriptions to their original location with an accuracy of 71% and can date them to less than 30 years of their ground-truth ranges, redating key texts of Classical Athens and contributing to topical debates in ancient history. This research shows how models such as Ithaca can unlock the cooperative potential between artificial intelligence and historians, transformationally impacting the way that we study and write about one of the most important periods in human history.&lt;br /&gt;
&lt;br /&gt;
Learn more:&lt;br /&gt;
* Nature video: https://www.youtube.com/watch?v=rq0Ex_qCKeQ&lt;br /&gt;
* DeepMind research blog: https://deepmind.com/blog/article/Predicting-the-past-with-Ithaca&lt;br /&gt;
* Nature podcast: https://www.nature.com/articles/d41586-022-00701-7 &lt;br /&gt;
* Nature 'News and Views' piece (by Prof. Charlotte Roueché): https://www.nature.com/articles/d41586-022-00641-2&lt;br /&gt;
&lt;br /&gt;
To cite this work:&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Assael, Y., Sommerschield, T., Shillingford, B. et al. Restoring and attributing ancient texts using deep neural networks. Nature 603, 280–283 (2022). https://doi.org/10.1038/s41586-022-04448-z.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Media==&lt;br /&gt;
* [https://elpais.com/ciencia/2022-05-24/una-herramienta-de-inteligencia-artificial-ayuda-a-la-restauracion-y-atribucion-de-textos-antiguos.html El Paìs]&lt;br /&gt;
* [https://www.smithsonianmag.com/smart-news/a-new-ai-can-help-historians-decipher-damaged-ancient-greek-texts-180979736/ The Smithsonian]&lt;br /&gt;
* [https://www.ox.ac.uk/news/2022-03-09-oxford-historians-apply-state-art-ai-transform-study-ancient-texts University of Oxford]&lt;br /&gt;
* [https://www.newscientist.com/article/2311657-ai-can-help-historians-restore-ancient-texts-from-damaged-inscriptions/ New Scientist]&lt;br /&gt;
* [https://www.theguardian.com/science/2022/mar/09/ai-could-decipher-gaps-in-ancient-greek-texts-say-researchers The Guardian]&lt;br /&gt;
* [https://www.thetimes.co.uk/article/ai-fills-the-gaps-in-ancient-texts-to-clear-pericles-3qccs25fv The Times]&lt;br /&gt;
* [https://www.lastampa.it/tuttoscienze/2022/04/06/news/arriva_ithaca_l_occhio_dell_ia_che_aiuta_gli_archeologi_a_riscrivere_il_passato-2914070/ La Stampa]&lt;br /&gt;
* [https://digitalculture.gov.gr/2022/03/ithaki-anakaliptontas-tin-archea-ellada-mesa-apo-tin-techniti-noimosini/ Greek Ministry of Culture]&lt;br /&gt;
* [https://www.kathimerini.gr/society/561754495/apokleistiko-ithaki-ekei-poy-i-techniti-noimosyni-synanta-tin-archaia-ellada/ Kathimerini]&lt;br /&gt;
* [https://www.lemonde.fr/sciences/article/2022/03/15/l-intelligence-artificielle-dechiffre-le-grec-ancien_6117639_1650684.html Le Monde]&lt;br /&gt;
* [https://www.unive.it/pag/14024/?tx_news_pi1%5Bnews%5D=12038&amp;amp;cHash=ee3ef41bb70b14cbda7f8dba3f65f07f Ca' Foscari University of Venice]&lt;br /&gt;
&lt;br /&gt;
==Corresponding authors==&lt;br /&gt;
* thea {dot} sommerschield {at} unive {dot} it&lt;br /&gt;
* assael {at} google {dot} com&lt;br /&gt;
&lt;br /&gt;
[[category:tools]] &lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:corpora]]&lt;br /&gt;
[[category:dataset]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:digitization]]&lt;br /&gt;
[[category:machine learning]]&lt;br /&gt;
[[category:NLP]]&lt;br /&gt;
[[category:Openaccess]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:repositories]]&lt;br /&gt;
[[category:text mining]]&lt;br /&gt;
[[category:web service]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Ithaca&amp;diff=11186</id>
		<title>Ithaca</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Ithaca&amp;diff=11186"/>
		<updated>2022-05-24T13:52:42Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: Creation of Ithaca page&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Links==&lt;br /&gt;
&lt;br /&gt;
* DOI: https://www.nature.com/articles/s41586-022-04448-z&lt;br /&gt;
* Ithaca's free online interface: https://ithaca.deepmind.com/&lt;br /&gt;
* I.PHI codebase: https://github.com/sommerschield/iphi&lt;br /&gt;
* Ithaca's training and inference code: https://github.com/deepmind/ithaca &lt;br /&gt;
&lt;br /&gt;
==Authors==&lt;br /&gt;
&lt;br /&gt;
Yannis Assael*, Thea Sommerschield*, Brendan Shillingford, Mahyar Bordbar, John Pavlopoulos, Marita Chatzipanagiotou, Ion Androutsopoulos, Jonathan Prag &amp;amp; Nando de Freitas.&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
'''Ithaca: Restoring and attributing ancient texts using deep neural networks'''&lt;br /&gt;
&lt;br /&gt;
Ancient history relies on disciplines such as epigraphy—the study of inscribed texts known as inscriptions—for evidence of the thought, language, society and history of past civilizations. However, over the centuries, many inscriptions have been damaged to the point of illegibility, transported far from their original location and their date of writing is steeped in uncertainty. Here we present Ithaca, a deep neural network for the textual restoration, geographical attribution and chronological attribution of ancient Greek inscriptions. Ithaca is designed to assist and expand the historian’s workflow. The architecture of Ithaca focuses on collaboration, decision support and interpretability. While Ithaca alone achieves 62% accuracy when restoring damaged texts, the use of Ithaca by historians improved their accuracy from 25% to 72%, confirming the synergistic effect of this research tool. Ithaca can attribute inscriptions to their original location with an accuracy of 71% and can date them to less than 30 years of their ground-truth ranges, redating key texts of Classical Athens and contributing to topical debates in ancient history. This research shows how models such as Ithaca can unlock the cooperative potential between artificial intelligence and historians, transformationally impacting the way that we study and write about one of the most important periods in human history.&lt;br /&gt;
&lt;br /&gt;
Learn more:&lt;br /&gt;
* Nature video: https://www.youtube.com/watch?v=rq0Ex_qCKeQ&lt;br /&gt;
* DeepMind research blog: https://deepmind.com/blog/article/Predicting-the-past-with-Ithaca&lt;br /&gt;
* Nature podcast: https://www.nature.com/articles/d41586-022-00701-7 &lt;br /&gt;
* Nature 'News and Views' piece (by Prof. Charlotte Roueché): https://www.nature.com/articles/d41586-022-00641-2&lt;br /&gt;
&lt;br /&gt;
To cite this work:&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Assael, Y., Sommerschield, T., Shillingford, B. et al. Restoring and attributing ancient texts using deep neural networks. Nature 603, 280–283 (2022). https://doi.org/10.1038/s41586-022-04448-z.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Media==&lt;br /&gt;
* [https://elpais.com/ciencia/2022-05-24/una-herramienta-de-inteligencia-artificial-ayuda-a-la-restauracion-y-atribucion-de-textos-antiguos.html El Paìs]&lt;br /&gt;
* [https://www.smithsonianmag.com/smart-news/a-new-ai-can-help-historians-decipher-damaged-ancient-greek-texts-180979736/ The Smithsonian]&lt;br /&gt;
* [https://www.ox.ac.uk/news/2022-03-09-oxford-historians-apply-state-art-ai-transform-study-ancient-texts University of Oxford]&lt;br /&gt;
* [https://www.newscientist.com/article/2311657-ai-can-help-historians-restore-ancient-texts-from-damaged-inscriptions/ New Scientist]&lt;br /&gt;
* [https://www.theguardian.com/science/2022/mar/09/ai-could-decipher-gaps-in-ancient-greek-texts-say-researchers The Guardian]&lt;br /&gt;
* [https://www.thetimes.co.uk/article/ai-fills-the-gaps-in-ancient-texts-to-clear-pericles-3qccs25fv The Times]&lt;br /&gt;
* [https://www.lastampa.it/tuttoscienze/2022/04/06/news/arriva_ithaca_l_occhio_dell_ia_che_aiuta_gli_archeologi_a_riscrivere_il_passato-2914070/ La Stampa]&lt;br /&gt;
* [https://digitalculture.gov.gr/2022/03/ithaki-anakaliptontas-tin-archea-ellada-mesa-apo-tin-techniti-noimosini/ Greek Ministry of Culture]&lt;br /&gt;
* [https://www.kathimerini.gr/society/561754495/apokleistiko-ithaki-ekei-poy-i-techniti-noimosyni-synanta-tin-archaia-ellada/ Kathimerini]&lt;br /&gt;
* [https://www.lemonde.fr/sciences/article/2022/03/15/l-intelligence-artificielle-dechiffre-le-grec-ancien_6117639_1650684.html Le Monde]&lt;br /&gt;
* [https://www.unive.it/pag/14024/?tx_news_pi1%5Bnews%5D=12038&amp;amp;cHash=ee3ef41bb70b14cbda7f8dba3f65f07f Ca' Foscari University of Venice]&lt;br /&gt;
&lt;br /&gt;
==Corresponding authors==&lt;br /&gt;
* thea {dot} sommerschield {at} unive {dot} it&lt;br /&gt;
* assael {at} google {dot} com&lt;br /&gt;
&lt;br /&gt;
[[category:tools]] &lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:corpora]]&lt;br /&gt;
[[category:dataset]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:digitization]]&lt;br /&gt;
[[category:machine learning]]&lt;br /&gt;
[[category:NLP]]&lt;br /&gt;
[[category:Openaccess]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:repositories]]&lt;br /&gt;
[[category:text mining]]&lt;br /&gt;
[[category:web service]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11185</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11185"/>
		<updated>2022-05-24T13:26:34Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;I am a historian and epigrapher, my research uses machine learning to study the epigraphic cultures of the ancient Mediterranean. &lt;br /&gt;
&lt;br /&gt;
I am a Marie Skłodowska-Curie Fellow at Ca' Foscari University of Venice ([https://pric.unive.it/projects/pythiaplus/home PythiaPlus project]), collaborating with Google Deepmind (London), AUEB and the IMSI Athena Research Centre (Athens). In 2021-2022, I am a CHS Fellow in Hellenic Studies at Harvard University; in 2022-2023 I am part of the second cohort of Google Cloud Research Innovators.&lt;br /&gt;
&lt;br /&gt;
I did my BA in Classics at the University of Pavia and a parallel MA in Humanities at the School for Advanced Study IUSS Pavia. I then moved to Oxford for my MSt in Greek History. I completed my DPhil in Ancient History at the University of Oxford in January 2021. In 2020, I was the Ralegh Radford Rome Awardee at the British School at Rome.&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* thea {dot} sommerschield {at} classics {dot} ox {dot} ac {dot} uk&lt;br /&gt;
* thea {at} sommerschield {dot} it&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* [https://pric.unive.it/projects/pythiaplus/home PythiaPlus]&lt;br /&gt;
* [https://www.unive.it/data/persone/25105303 UniVe Profile]&lt;br /&gt;
* [https://orcid.org/0000-0002-6965-8105 ORCiD]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11113</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11113"/>
		<updated>2021-12-27T15:37:09Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;I am a historian and epigrapher, my research uses machine learning to study the epigraphic cultures of the ancient Mediterranean. &lt;br /&gt;
&lt;br /&gt;
I am a Marie Skłodowska-Curie Fellow at Ca' Foscari University of Venice ([https://pric.unive.it/projects/pythiaplus/home PythiaPlus project]), undertaking secondments at Google Deepmind (London), AUEB and the IMSI Athena Research Centre (Athens). In 2021-2022, I am a CHS Fellow in Hellenic Studies at Harvard University.&lt;br /&gt;
&lt;br /&gt;
I did my BA in Classics at the University of Pavia and a parallel MA in Humanities at the School for Advanced Study IUSS Pavia. I then moved to Oxford for my MSt in Greek History. I completed my DPhil in Ancient History at the University of Oxford in January 2021. In 2020, I was the Ralegh Radford Rome Awardee at the British School at Rome.&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* thea {dot} sommerschield {at} classics {dot} ox {dot} ac {dot} uk&lt;br /&gt;
* thea {at} sommerschield {dot} it&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* [https://pric.unive.it/projects/pythiaplus/home PythiaPlus]&lt;br /&gt;
* [https://www.unive.it/data/persone/25105303 UniVe Profile]&lt;br /&gt;
* [https://orcid.org/0000-0002-6965-8105 ORCiD]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11062</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11062"/>
		<updated>2021-09-23T00:35:44Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;I am a historian and epigrapher, currently CHS Fellow in Hellenic Studies at Harvard University.&lt;br /&gt;
My research uses Machine Learning to study the written cultures of the ancient Mediterranean.&lt;br /&gt;
&lt;br /&gt;
I did my BA in Classics at the University of Pavia and a parallel MA in Humanities at the School for Advanced Study (IUSS Pavia). I then moved to Oxford for my MSt in Greek History and DPhil in Ancient History. I completed my doctorate in January 2021.&lt;br /&gt;
&lt;br /&gt;
In 2020, I was the Ralegh Radford Rome Awardee at the British School at Rome. In 2021, I am a Fellow at Harvard's Center for Hellenic Studies (CHS), where I will be carrying out my residency in the fall. Starting November 2021 I will begin a Marie Curie Fellowship at Ca' Foscari University of Venice on my project &amp;quot;PYTHIA+: Machine Learning for ancient epigraphic cultures&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
* Co-lead of [[Pythia]] –  a deep learning model for the automatic restoration of Greek inscriptions.&amp;lt;br&amp;gt;&lt;br /&gt;
* Lead of [https://www.academia.edu/38335343/A_New_Sicilian_Curse_Corpus_A_Blueprint_for_a_Geographical_and_Chronological_Analysis_of_Defixiones_from_Sicily ISicDef] – A MySQL database &amp;amp; digital corpus of Sicilian curse tablets ''(to be migrated to [[I.Sicily]]).'' &amp;lt;br&amp;gt;&lt;br /&gt;
* Doctoral thesis title: ''“Breaking boundaries: a study of socio-cultural identities in Archaic and Classical western Sicily”''.&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* thea {dot} sommerschield {at} classics {dot} ox {dot} ac {dot} uk&lt;br /&gt;
* thea {at} sommerschield {dot} it&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* [https://orcid.org/0000-0002-6965-8105 ORCiD]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11061</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=11061"/>
		<updated>2021-09-23T00:34:44Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: orcid added&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;I am a historian, epigrapher and digital humanities scientist. My research uses Machine Learning to study the epigraphic cultures of the ancient Mediterranean.&lt;br /&gt;
&lt;br /&gt;
I did my BA in Classics at the University of Pavia and a parallel MA in Humanities at the School for Advanced Study (IUSS Pavia). I then moved to Oxford for my MSt in Greek History and DPhil in Ancient History. I completed my doctorate in January 2021.&amp;lt;br&amp;gt;&lt;br /&gt;
In 2020, I was the Ralegh Radford Rome Awardee at the British School at Rome, and was awarded Visiting Fellowships at the Universities of Cambridge (CREWS Project), Macquarie (CACHE), and Ohio State (Center for Epigraphical and Palaeographical Studies).&amp;lt;br&amp;gt;&lt;br /&gt;
In 2021, I am Fellow in Hellenic Studies at Harvard University (CHS), Associate Researcher at the University of Oxford (Faculty of Classics) and Cultrice della materia at Ca' Foscari University of Venice. Starting November 2021 I will begin a Marie Curie Fellowship at Ca' Foscari University of Venice on my project &amp;quot;PYTHIA+: Machine Learning for ancient epigraphic cultures&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
* Co-lead of [[Pythia]] –  a deep learning model for the automatic restoration of Greek inscriptions.&amp;lt;br&amp;gt;&lt;br /&gt;
* Lead of [https://www.academia.edu/38335343/A_New_Sicilian_Curse_Corpus_A_Blueprint_for_a_Geographical_and_Chronological_Analysis_of_Defixiones_from_Sicily ISicDef] – A MySQL database &amp;amp; digital corpus of Sicilian curse tablets ''(to be migrated to [[I.Sicily]]).'' &amp;lt;br&amp;gt;&lt;br /&gt;
* Doctoral thesis title: ''“Breaking boundaries: a study of socio-cultural identities in Archaic and Classical western Sicily”''.&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* thea {dot} sommerschield {at} classics {dot} ox {dot} ac {dot} uk&lt;br /&gt;
* thea {at} sommerschield {dot} it&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* [https://orcid.org/0000-0002-6965-8105 ORCiD]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=10962</id>
		<title>Pythia</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=10962"/>
		<updated>2021-08-11T12:16:06Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Available==&lt;br /&gt;
&lt;br /&gt;
* Colab: https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ&lt;br /&gt;
* Wikipedia page: https://en.wikipedia.org/wiki/Pythia_(machine_learning)&lt;br /&gt;
* DOI: https://www.aclweb.org/anthology/D19-1668/&lt;br /&gt;
&lt;br /&gt;
==Authors==&lt;br /&gt;
&lt;br /&gt;
* [[User:TheaSommerschield|Thea Sommerschield]], University of Oxford&lt;br /&gt;
* Yannis Assael, DeepMind&lt;br /&gt;
* Jonathan Prag, University of Oxford&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
&lt;br /&gt;
'''PYTHIA: a deep learning model for the automatic restoration of Greek inscriptions''' is the first ancient text restoration model that recovers missing characters from a damaged text input using deep neural networks. Bringing together the disciplines of ancient history and deep learning, this research offers a fully automated aid to the epigraphic restoration of fragmentary inscriptions, providing ancient historians with multiple textual restorations, as well as the confidence level for each hypothesis. &lt;br /&gt;
&lt;br /&gt;
Pythia takes a sequence of damaged text as input, and is trained to predict character sequences comprising hypothesised restorations of ancient Greek inscriptions. The architecture works at both the character- and word-level, thereby effectively handling long-term context information, and dealing efficiently with incomplete word representations. This makes it applicable to all disciplines dealing with ancient texts (philology, papyrology, codicology) and applies to any language (ancient or modern).&lt;br /&gt;
&lt;br /&gt;
To train the model, we wrote a non-trivial pipeline to convert [[PHI Greek Inscriptions|PHI]], the largest digital corpus of ancient Greek inscriptions, to machine actionable text, which we call [[PHI-ML]]. On PHI-ML, Pythia's predictions achieve a 30.1% character error rate, compared to the 57.3% of human epigraphists. Moreover, in 73.5% of cases the ground-truth sequence was among the Top-20 hypotheses of Pythia, which effectively demonstrates the impact of such an assistive method on the field of digital epigraphy, and sets the state-of-the-art in ancient text restoration.&lt;br /&gt;
&lt;br /&gt;
To aid further research in the field we created a freely accessible [https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ online interactive Python notebook], where researchers can query one of our models to get text restorations and visualise the attention weights. Follow the instructions on the Colab notebook to restore texts for your own personal research.&lt;br /&gt;
&lt;br /&gt;
Furthermore, the PHI-ML dataset and processing pipeline have been published on [https://github.com/sommerschield/ancient-text-restoration GitHub], so that any researcher may regenerate PHI-ML and train new models offline. Follow the instructions below under the section ''&amp;quot;Pythia offline&amp;quot;''.&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
&lt;br /&gt;
Article available at:&lt;br /&gt;
* [https://www.aclweb.org/anthology/D19-1668/ ACL anthology]&lt;br /&gt;
* [https://arxiv.org/abs/1910.06262 arXiv preprint]&lt;br /&gt;
&lt;br /&gt;
To quote this work:&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Assael, Y., Sommerschield, T., Prag, J. “Restoring Ancient Text Using Deep Learning: A Case Study on Greek Epigraphy.” In&lt;br /&gt;
''Proceedings of the Conference in Empirical Methods in Natural Language Processing (EMNLP - IJCNLP 2019)''. Association for Computational Linguistics. 6369-6376.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When using any of this project's source code, please cite:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
@inproceedings{assael2019restoring,&lt;br /&gt;
  title={Restoring ancient text using deep learning: a case study on {Greek} epigraphy},&lt;br /&gt;
  author={Assael, Yannis and Sommerschield, Thea and Prag, Jonathan},&lt;br /&gt;
  booktitle={Empirical Methods in Natural Language Processing},&lt;br /&gt;
  pages={6369--6376},&lt;br /&gt;
  year={2019}&lt;br /&gt;
}&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Pythia offline===&lt;br /&gt;
&lt;br /&gt;
The following snippets provide references for regenerating PHI-ML and training new models offline.&lt;br /&gt;
&lt;br /&gt;
====Dependencies====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
pip install -r requirements.txt &amp;amp;&amp;amp; \&lt;br /&gt;
python -m nltk.downloader punkt&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====PHI-ML dataset generation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
# Download PHI (this will take a while)&lt;br /&gt;
python -c 'import pythia.data.phi_download; pythia.data.phi_download.main()'&lt;br /&gt;
&lt;br /&gt;
# Process and generate PHI-ML&lt;br /&gt;
python -c 'import pythia.data.phi_process; pythia.data.phi_process.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Preprocessed PHI-ML uploaded by @Holger.Danske800: [https://drive.google.com/drive/folders/1PQaWYmB02Sc2OC9yokajcsw65wIcLxGD link]&lt;br /&gt;
&lt;br /&gt;
====Training====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Evaluation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.test; pythia.test.main()' --load_checkpoint=&amp;quot;your_model_path/&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Docker execution====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./build.sh&lt;br /&gt;
./run.sh &amp;lt;GPU_ID&amp;gt; python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Media==&lt;br /&gt;
Our research has been featured in blog posts by [https://deepmind.com/research/publications/Restoring-ancient-text-using-deep-learning-a-case-study-on-Greek-epigraphy DeepMind] and the [http://www.ox.ac.uk/news/arts-blog/restoring-ancient-greek-inscriptions-using-ai-deep-learning University of Oxford].&lt;br /&gt;
It has also received press coverage by [https://www.thetimes.co.uk/article/oracle-of-ai-solves-classic-conundrums-v6pdtsps0 The Times], [https://www.newscientist.com/article/2220438-deepmind-ai-beats-humans-at-deciphering-damaged-ancient-greek-tablets/ NewScientist], [https://www.ft.com/content/2b72ed2c-907b-11ea-bc44-dbf6756c871a?FTCamp=engage%2FCAPI%2F%2FChannel_signal%2F%2FB2B/ Financial Times], [https://www.repubblica.it/tecnologia/2019/10/24/news/non_solo_go_l_ai_di_google_batte_anche_gli_archeologi-239393979/ La Repubblica], [https://www.kathimerini.gr/1047963/article/epikairothta/episthmh/py8ia-o-gonos-ths-google-kai-ellhna-ereynhth-poy-diavazei-misokatestrammenes-arxaies-epigrafes Ekathimerini], [https://techcrunch.com/2019/10/18/ai-is-helping-scholars-restore-ancient-greek-texts-on-stone-tablets/ TechCrunch] and others.  &lt;br /&gt;
&lt;br /&gt;
The project has been presented at [https://www.emnlp-ijcnlp2019.org/ EMNLP - IJCNLP 2019] in Hong Kong, at the British School at Rome, at the Oxford Epigraphy Workshop, at the OIKOS National Research School in Classical Studies, during lectures at Venice Ca' Foscari and Rome La Sapienza.&lt;br /&gt;
&lt;br /&gt;
The project poster is published on the [[Epigraphy.info]] website (on occasion of the V Workshop at KU Leuven) and can be accessed [http://www.lust-auf-rom.de/epigraphy_info/poster/Pythia.pdf here].&lt;br /&gt;
&lt;br /&gt;
Pythia is a partner project of [[epigraphy.info]] and [https://eadh.org/projects/pythia-deep-learning-model-automatic-restoration-greek-inscriptions EADH].&lt;br /&gt;
&lt;br /&gt;
==Presentations== &lt;br /&gt;
* Thea Sommerschield (2020), &amp;quot;PYTHIA: a deep neural network model for the automatic restoration of ancient Greek inscriptions&amp;quot;, ''Digital Classicist London Seminar''. Available: https://www.youtube.com/watch?v=nKSfzHYmLtQ&amp;amp;t=26s&lt;br /&gt;
* Alek Keersmaekers, Marton Ribary, Thea Sommerschield (2020),  &amp;quot;Introduction to Computational Linguistics&amp;quot;, ''SunoikisisDC''. Available: https://www.youtube.com/watch?v=zjkyZUpvhAQ&lt;br /&gt;
&lt;br /&gt;
==Contact==&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk]&lt;br /&gt;
&lt;br /&gt;
[[category:tools]] &lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:corpora]]&lt;br /&gt;
[[category:dataset]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:digitization]]&lt;br /&gt;
[[category:machine learning]]&lt;br /&gt;
[[category:NLP]]&lt;br /&gt;
[[category:Openaccess]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:repositories]]&lt;br /&gt;
[[category:text mining]]&lt;br /&gt;
[[category:web service]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=10857</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=10857"/>
		<updated>2021-07-29T18:31:44Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;I am a historian, epigrapher and digital humanities scientist. My research uses Machine Learning to study the epigraphic cultures of the ancient Mediterranean.&lt;br /&gt;
&lt;br /&gt;
I did my BA in Classics at the University of Pavia and a parallel MA in Humanities at the School for Advanced Study (IUSS Pavia). I then moved to Oxford for my MSt in Greek History and DPhil in Ancient History. I completed my doctorate in January 2021.&amp;lt;br&amp;gt;&lt;br /&gt;
In 2020, I was the Ralegh Radford Rome Awardee at the British School at Rome, and was awarded Visiting Fellowships at the Universities of Cambridge (CREWS Project), Macquarie (CACHE), and Ohio State (Center for Epigraphical and Palaeographical Studies).&amp;lt;br&amp;gt;&lt;br /&gt;
In 2021, I am Fellow in Hellenic Studies at Harvard University (CHS), Associate Researcher at the University of Oxford (Faculty of Classics) and Cultrice della materia at Ca' Foscari University of Venice. Starting November 2021 I will begin a Marie Curie Fellowship at Ca' Foscari University of Venice on my project &amp;quot;PYTHIA+: Machine Learning for ancient epigraphic cultures&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
* Co-author of [[Pythia]] –  a deep learning model for the automatic restoration of Greek inscriptions.&amp;lt;br&amp;gt;&lt;br /&gt;
* Author of [https://www.academia.edu/38335343/A_New_Sicilian_Curse_Corpus_A_Blueprint_for_a_Geographical_and_Chronological_Analysis_of_Defixiones_from_Sicily ISicDef] – A MySQL database &amp;amp; digital corpus of Sicilian curse tablets ''(to be migrated to [[I.Sicily]]).'' &amp;lt;br&amp;gt;&lt;br /&gt;
* Doctoral thesis: ''“Breaking boundaries: a study of socio-cultural identities in Archaic and Classical western Sicily”''.&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* thea {dot} sommerschield {at} classics {dot} ox {dot} ac {dot} uk&lt;br /&gt;
* thea {at} sommerschield {dot} it&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=10691</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=10691"/>
		<updated>2021-02-05T11:39:41Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: obfuscated email&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;I am a historian, epigrapher and digital humanities scientist.&lt;br /&gt;
&lt;br /&gt;
I did my BA in Classics at the University of Pavia and a parallel MA in Humanities at the School for Advanced Study (IUSS Pavia). I then moved to Oxford for my MSt in Greek History and DPhil in Ancient History. I completed my doctorate in January 2021.&amp;lt;br&amp;gt;&lt;br /&gt;
In 2020, I was the Ralegh Radford Rome Awardee at the British School at Rome, and was awarded Visiting Fellowships at the Universities of Cambridge (CREWS Project), Macquarie (CACHE), and Ohio State (Center for Epigraphical and Palaeographical Studies).&amp;lt;br&amp;gt;&lt;br /&gt;
In 2021, I am Visiting Associate in Hellenic Studies at Harvard University (CHS) and Associate Researcher at the University of Oxford (Faculty of Classics).&lt;br /&gt;
&lt;br /&gt;
* Co-author of [[Pythia]] –  a deep learning model for the automatic restoration of Greek inscriptions.&amp;lt;br&amp;gt;&lt;br /&gt;
* Author of [https://www.academia.edu/38335343/A_New_Sicilian_Curse_Corpus_A_Blueprint_for_a_Geographical_and_Chronological_Analysis_of_Defixiones_from_Sicily ISicDef] – A MySQL database &amp;amp; digital corpus of Sicilian curse tablets ''(to be migrated to [[I.Sicily]]).'' &amp;lt;br&amp;gt;&lt;br /&gt;
* Doctoral thesis: ''“Breaking boundaries: a study of socio-cultural identities in Archaic and Classical western Sicily”''.&lt;br /&gt;
&lt;br /&gt;
My postdoctoral research project focuses on Digital Epigraphy, exploring how Machine Learning can enable large-scale, in-depth interpretation of the epigraphic cultures of the ancient Mediterranean. I am currently applying for postdocs, so if you are interested in my research and would like to know more, do get in touch!&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* thea {dot} sommerschield {at} classics {dot} ox {dot} ac {dot} uk&lt;br /&gt;
* thea {at} sommerschield {dot} it&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=10690</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=10690"/>
		<updated>2021-02-03T11:18:47Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: update, finished my DPhil!&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;I am a historian, epigrapher and digital humanities scientist.&lt;br /&gt;
&lt;br /&gt;
I did my BA in Classics at the University of Pavia and a parallel MA in Humanities at the School for Advanced Study (IUSS Pavia). I then moved to Oxford for my MSt in Greek History and DPhil in Ancient History. I completed my doctorate in January 2021.&amp;lt;br&amp;gt;&lt;br /&gt;
In 2020, I was the Ralegh Radford Rome Awardee at the British School at Rome, and was awarded Visiting Fellowships at the Universities of Cambridge (CREWS Project), Macquarie (CACHE), and Ohio State (Center for Epigraphical and Palaeographical Studies).&amp;lt;br&amp;gt;&lt;br /&gt;
In 2021, I am Visiting Associate in Hellenic Studies at Harvard University (CHS) and Associate Researcher at the University of Oxford (Faculty of Classics).&lt;br /&gt;
&lt;br /&gt;
* Co-author of [[Pythia]] –  a deep learning model for the automatic restoration of Greek inscriptions.&amp;lt;br&amp;gt;&lt;br /&gt;
* Author of [https://www.academia.edu/38335343/A_New_Sicilian_Curse_Corpus_A_Blueprint_for_a_Geographical_and_Chronological_Analysis_of_Defixiones_from_Sicily ISicDef] – A MySQL database &amp;amp; digital corpus of Sicilian curse tablets ''(to be migrated to [[I.Sicily]]).'' &amp;lt;br&amp;gt;&lt;br /&gt;
* Doctoral thesis: ''“Breaking boundaries: a study of socio-cultural identities in Archaic and Classical western Sicily”''.&lt;br /&gt;
&lt;br /&gt;
My postdoctoral research project focuses on Digital Epigraphy, exploring how Machine Learning can enable large-scale, in-depth interpretation of the epigraphic cultures of the ancient Mediterranean. I am currently applying for postdocs, so if you are interested in my research and would like to know more, do get in touch!&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk]&lt;br /&gt;
* [https://theasommerschield.it theasommerschield.it]&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=10622</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=10622"/>
		<updated>2021-01-06T10:03:33Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Finishing DPhil student in Ancient History at the University of Oxford.&lt;br /&gt;
&lt;br /&gt;
Interested in the Classical and Tech(no) side of research - and music.&lt;br /&gt;
Currently working on machine learning for Latin epigraphical text restoration and other fun stuff.&lt;br /&gt;
&lt;br /&gt;
* Co-author of [[Pythia]] –  a deep learning model for the automatic restoration of Greek inscriptions.&amp;lt;br&amp;gt;&lt;br /&gt;
* Author of [https://www.academia.edu/38335343/A_New_Sicilian_Curse_Corpus_A_Blueprint_for_a_Geographical_and_Chronological_Analysis_of_Defixiones_from_Sicily ISicDef] – A MySQL database &amp;amp; digital corpus of Sicilian curse tablets ''(to be migrated to [[I.Sicily]]).'' &amp;lt;br&amp;gt;&lt;br /&gt;
* Doctoral thesis: ''“Breaking boundaries: a study of socio-cultural identities in Archaic and Classical western Sicily”''.&lt;br /&gt;
&lt;br /&gt;
My postdoctoral research project focuses on Digital Epigraphy, exploring how Machine Learning can enable large-scale, in-depth interpretation of the epigraphic cultures of the ancient Mediterranean. I am currently applying for postdocs, so if you are interested in my research and would like to know more, do get in touch!&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk] | [http://thea@sommerschield.it thea@sommerschield.it]&lt;br /&gt;
* [https://theasommerschield.it theasommerschield.it]&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=10621</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=10621"/>
		<updated>2021-01-06T10:02:51Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: added personal website&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Finishing DPhil student in Ancient History at the University of Oxford.&lt;br /&gt;
&lt;br /&gt;
Interested in the Classical and Tech(no) side of research - and music.&lt;br /&gt;
Currently working on machine learning for Latin epigraphical text restoration and other fun stuff.&lt;br /&gt;
&lt;br /&gt;
* Co-author of [[Pythia]] –  a deep learning model for the automatic restoration of Greek inscriptions.&amp;lt;br&amp;gt;&lt;br /&gt;
* Author of [https://www.academia.edu/38335343/A_New_Sicilian_Curse_Corpus_A_Blueprint_for_a_Geographical_and_Chronological_Analysis_of_Defixiones_from_Sicily ISicDef] – A MySQL database &amp;amp; digital corpus of Sicilian curse tablets ''(to be migrated to [[I.Sicily]]).'' &amp;lt;br&amp;gt;&lt;br /&gt;
* Doctoral thesis: ''“Breaking boundaries: a study of socio-cultural identities in Archaic and Classical western Sicily”''.&lt;br /&gt;
&lt;br /&gt;
My postdoctoral research project focuses on Digital Epigraphy, exploring how Machine Learning can enable large-scale, in-depth interpretation of the epigraphic cultures of the ancient Mediterranean. I am currently applying for postdocs, so if you are interested in my research and would like to know more, do get in touch!&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk] | [http://thea@sommerschield.it thea@sommerschield.it]&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield]&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Epigraphy.info&amp;diff=10620</id>
		<title>Epigraphy.info</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Epigraphy.info&amp;diff=10620"/>
		<updated>2020-12-27T11:52:56Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: added Pythia to collaborators&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Available==&lt;br /&gt;
&lt;br /&gt;
* http://epigraphy.info/&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
&lt;br /&gt;
A Collaborative Environment for Digital Epigraphy&lt;br /&gt;
&lt;br /&gt;
==Steering committee==&lt;br /&gt;
&lt;br /&gt;
Committee 2019-20:&lt;br /&gt;
&lt;br /&gt;
* Chiara Cenati (Austrian Academy of Sciences)&lt;br /&gt;
* Tom Elliott (New York U./ Pleiades)&lt;br /&gt;
* M. Cristina de la Escosura (U. Zaragoza)&lt;br /&gt;
* Tom Gheldof (KU Leuven / Trismegistos)&lt;br /&gt;
* Jonathan Prag (Oxford University)&lt;br /&gt;
* Franziska Weise (Hamburg University)&lt;br /&gt;
* Vincent Razanajao (U. Bordeaux Montaigne / Patrimonium)&lt;br /&gt;
&lt;br /&gt;
==Mission Statement==&lt;br /&gt;
&lt;br /&gt;
Taken from the website (accessed 2019-08-06):&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote&amp;gt;&amp;lt;p&amp;gt;'''Epigraphy.info''' is an international open community pursuing a collaborative environment for digital epigraphy, which facilitates scholarly communication and interaction. We apply FAIR principles to epigraphic information in order to efficiently create, use and share it among researchers, students and enthusiasts around the globe. The Epigraphy.info community works to gather and enhance the many existing epigraphic efforts, and serves as a landing point for digital tools, practices and methodologies for managing collections of inscriptions.&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;Key concerns include:&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Identification, creation, and dissemination of agreed-upon guidelines, standards, and best practices;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Preservation, reuse, and update of existing and emerging datasets providing the most up-to-date versions of inscriptions;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Providing a toolset for searching, analyzing and editing inscriptions and their metadata by both human and machine;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Services for citation, revision and exchange.&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;Epigraphy.info will not replace existing digital resources; it intends to be a hub for a fruitful exchange of epigraphic data and digital solutions that will benefit all epigraphers.&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Workshops==&lt;br /&gt;
&lt;br /&gt;
*[http://epigraphy.info/workshop_1.html Workshop I]&lt;br /&gt;
&lt;br /&gt;
The 1st Epigraphy.info workshop took place at the Heidelberg Academy of Sciences and Humanities (21st - 23rd March 2018). It brought together leading experts in the field of digital epigraphy to discuss ideas for the creation of an open collaborative platform for Greek and Latin epigraphy.&lt;br /&gt;
&lt;br /&gt;
*[http://epigraphy.info/workshop_2.html Workshop II]&lt;br /&gt;
&lt;br /&gt;
The 2nd Epigraphy.info workshop took place in Zadar (14th - 16th December 2018). A preliminary mission statement was defined here.&lt;br /&gt;
&lt;br /&gt;
*[http://epigraphy.info/workshop_3.html Workshop III]&lt;br /&gt;
&lt;br /&gt;
The 3rd Epigraphy.info workshop took place in Vienna (30th May – 1st June 2019). The mission statement was approved by the community and preliminary working groups were formed to manage the tasks discussed here.&lt;br /&gt;
&lt;br /&gt;
*[http://epigraphy.info/workshop_4.html Workshop IV]&lt;br /&gt;
&lt;br /&gt;
The 4th Epigraphy.info workshop took place in Hamburg (19th - 21st February 2020). It was preceded by a TEI/EpiDoc training (17th-19th February 2020).&lt;br /&gt;
&lt;br /&gt;
*[http://epigraphy.info/workshop_5.html Workshop V]&lt;br /&gt;
&lt;br /&gt;
The 5th Epigraphy.info workshop took place in Leuven (4th - 6th November 2020). It was preceded by 2 workshops (3rd-4th November 2020). &lt;br /&gt;
&lt;br /&gt;
==Preliminary Partners==&lt;br /&gt;
&lt;br /&gt;
* [[Epigraphic Database Bari]] - Epigraphic documents of Christian patronage&lt;br /&gt;
* [[Epigraphische Datenbank Heidelberg]] - Latin inscriptions of the Roman Empire&lt;br /&gt;
* [[Epigraphic Database Roma]] - Epigraphic documents from Italic Regions&lt;br /&gt;
* [[Hispania Epigraphica]] - Epigraphic patrimony of Portugal and Spain&lt;br /&gt;
* [[PETRAE]] - Latin and Greek inscriptions (Institut Ausonius)&lt;br /&gt;
* [[Last Statues of Antiquity]] - Database for Late Antiquity Statues&lt;br /&gt;
* [[Roman Inscriptions of Britain]] - Epigraphic patrimony of Britain&lt;br /&gt;
* [[Ubi Erat Lupa]] - Images of stone monuments&lt;br /&gt;
* [[Epigraphik-Datenbank Clauss-Slaby]] - Database of Latin inscriptions&lt;br /&gt;
* [[EAGLE Europeana Network]] - Europeana network of Ancient Greek and Latin Epigraphy&lt;br /&gt;
* IDEA - [[International Digital Epigraphy Association]]&lt;br /&gt;
* [[Trismegistos]] - Portal of papyrological and epigraphical resources&lt;br /&gt;
* [[Epigraphische Datenbank zum antiken Kleinasien]] - Greek &amp;amp; Latin inscriptions from Turkey&lt;br /&gt;
* [[Supplementum Epigraphicum Graecum]] (SEG) - Greek inscriptions with critical apparatus&lt;br /&gt;
* [[US Epigraphy]] Project (USEP) - Collections of Greek &amp;amp; Latin inscriptions in the USA&lt;br /&gt;
* [[Inscriptions of Greek Cyrenaica]] - Online Edition of Greek inscriptions from Libya&lt;br /&gt;
* [[Ancient Graffiti Project]] - Digital resource of graffiti of Herculaneum and Pompeii&lt;br /&gt;
* [[Attic Inscriptions Online]] - Inscriptions of ancient Athens and Attica in English translation&lt;br /&gt;
* [[Greek Inscriptions Online]] - Translations of Ancient Greek inscriptions into Modern Greek&lt;br /&gt;
* [[Patrimonium]] - Geography and Economy of the Imperial Properties in the Roman World&lt;br /&gt;
* [[Epigraphica Romana]] - Recent Epigraphic Editions&lt;br /&gt;
* [[Inscriptions of Israel Palestine]] - Inscriptions of Israel/Palestine from the Persian period through the Islamic conquest&lt;br /&gt;
* [[Krateros]] - Digital repository for the collections of epigraphic squeezes&lt;br /&gt;
* [[Epigraphica 3.0]] - Digital corpus of Roman inscriptions from Ourense province&lt;br /&gt;
* [[LLDB]] - Computerized Historical Linguistic Database of the Latin Inscriptions of the Imperial Age&lt;br /&gt;
* [[Romans 1by1]] - Romans One By One&lt;br /&gt;
* [[Epigraphic Database Vernacular]] - Digital Corpus of all Non-Latin Inscriptions from Italy (9th-15th c)&lt;br /&gt;
* [[Pythia]] - A deep learning model for the automatic restoration of Greek inscriptions&lt;br /&gt;
&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:EpiDoc]]&lt;br /&gt;
[[category:Linked open data]]&lt;br /&gt;
[[category:collaboration]]&lt;br /&gt;
[[category:crowdsourcing]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:community]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=10619</id>
		<title>Pythia</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=10619"/>
		<updated>2020-12-27T11:49:42Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: grammar&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Available==&lt;br /&gt;
&lt;br /&gt;
* Colab: https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ&lt;br /&gt;
* Wikipedia page: https://en.wikipedia.org/wiki/Pythia_(machine_learning)&lt;br /&gt;
* DOI: https://www.aclweb.org/anthology/D19-1668/&lt;br /&gt;
&lt;br /&gt;
==Authors==&lt;br /&gt;
&lt;br /&gt;
* [[User:TheaSommerschield|Thea Sommerschield]], University of Oxford&lt;br /&gt;
* Yannis Assael, DeepMind&lt;br /&gt;
* Jonathan Prag, University of Oxford&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
&lt;br /&gt;
'''PYTHIA: a deep learning model for the automatic restoration of Greek inscriptions''' is the first ancient text restoration model that recovers missing characters from a damaged text input using deep neural networks. Bringing together the disciplines of ancient history and deep learning, this research offers a fully automated aid to the epigraphic restoration of fragmentary inscriptions, providing ancient historians with multiple textual restorations, as well as the confidence level for each hypothesis. &lt;br /&gt;
&lt;br /&gt;
Pythia takes a sequence of damaged text as input, and is trained to predict character sequences comprising hypothesised restorations of ancient Greek inscriptions. The architecture works at both the character- and word-level, thereby effectively handling long-term context information, and dealing efficiently with incomplete word representations. This makes it applicable to all disciplines dealing with ancient texts (philology, papyrology, codicology) and applies to any language (ancient or modern).&lt;br /&gt;
&lt;br /&gt;
To train the model, we wrote a non-trivial pipeline to convert [[PHI Greek Inscriptions|PHI]], the largest digital corpus of ancient Greek inscriptions, to machine actionable text, which we call [[PHI-ML]]. On PHI-ML, Pythia's predictions achieve a 30.1% character error rate, compared to the 57.3% of human epigraphists. Moreover, in 73.5% of cases the ground-truth sequence was among the Top-20 hypotheses of Pythia, which effectively demonstrates the impact of such an assistive method on the field of digital epigraphy, and sets the state-of-the-art in ancient text restoration.&lt;br /&gt;
&lt;br /&gt;
To aid further research in the field we created a freely accessible [https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ online interactive Python notebook], where researchers can query one of our models to get text restorations and visualise the attention weights. Follow the instructions on the Colab notebook to restore texts for your own personal research.&lt;br /&gt;
&lt;br /&gt;
Furthermore, the PHI-ML dataset and processing pipeline have been published on [https://github.com/sommerschield/ancient-text-restoration GitHub], so that any researcher may regenerate PHI-ML and train new models offline. Follow the instructions below under the section ''&amp;quot;Pythia offline&amp;quot;''.&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
&lt;br /&gt;
Article available at:&lt;br /&gt;
* [https://www.aclweb.org/anthology/D19-1668/ ACL anthology]&lt;br /&gt;
* [https://arxiv.org/abs/1910.06262 arXiv preprint]&lt;br /&gt;
&lt;br /&gt;
To quote this work:&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Assael, Y., Sommerschield, T., Prag, J. “Restoring Ancient Text Using Deep Learning: A Case Study on Greek Epigraphy.” In&lt;br /&gt;
''Proceedings of the Conference in Empirical Methods in Natural Language Processing (EMNLP - IJCNLP 2019)''. Association for Computational Linguistics. 6369-6376.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When using any of this project's source code, please cite:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
@inproceedings{assael2019restoring,&lt;br /&gt;
  title={Restoring ancient text using deep learning: a case study on {Greek} epigraphy},&lt;br /&gt;
  author={Assael, Yannis and Sommerschield, Thea and Prag, Jonathan},&lt;br /&gt;
  booktitle={Empirical Methods in Natural Language Processing},&lt;br /&gt;
  pages={6369--6376},&lt;br /&gt;
  year={2019}&lt;br /&gt;
}&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Pythia offline===&lt;br /&gt;
&lt;br /&gt;
The following snippets provide references for regenerating PHI-ML and training new models offline.&lt;br /&gt;
&lt;br /&gt;
====Dependencies====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
pip install -r requirements.txt &amp;amp;&amp;amp; \&lt;br /&gt;
python -m nltk.downloader punkt&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====PHI-ML dataset generation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
# Download PHI (this will take a while)&lt;br /&gt;
python -c 'import pythia.data.phi_download; pythia.data.phi_download.main()'&lt;br /&gt;
&lt;br /&gt;
# Process and generate PHI-ML&lt;br /&gt;
python -c 'import pythia.data.phi_process; pythia.data.phi_process.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Preprocessed PHI-ML uploaded by @Holger.Danske800: [https://drive.google.com/drive/folders/1PQaWYmB02Sc2OC9yokajcsw65wIcLxGD link]&lt;br /&gt;
&lt;br /&gt;
====Training====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Evaluation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.test; pythia.test.main()' --load_checkpoint=&amp;quot;your_model_path/&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Docker execution====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./build.sh&lt;br /&gt;
./run.sh &amp;lt;GPU_ID&amp;gt; python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Media==&lt;br /&gt;
&lt;br /&gt;
Our research has been featured in blog posts by [https://deepmind.com/research/publications/Restoring-ancient-text-using-deep-learning-a-case-study-on-Greek-epigraphy DeepMind] and the [http://www.ox.ac.uk/news/arts-blog/restoring-ancient-greek-inscriptions-using-ai-deep-learning University of Oxford].&lt;br /&gt;
It has also received press coverage by [https://www.thetimes.co.uk/article/oracle-of-ai-solves-classic-conundrums-v6pdtsps0 The Times], [https://www.newscientist.com/article/2220438-deepmind-ai-beats-humans-at-deciphering-damaged-ancient-greek-tablets/ NewScientist], [https://www.ft.com/content/2b72ed2c-907b-11ea-bc44-dbf6756c871a?FTCamp=engage%2FCAPI%2F%2FChannel_signal%2F%2FB2B/ Financial Times], [https://www.repubblica.it/tecnologia/2019/10/24/news/non_solo_go_l_ai_di_google_batte_anche_gli_archeologi-239393979/ La Repubblica], [https://www.kathimerini.gr/1047963/article/epikairothta/episthmh/py8ia-o-gonos-ths-google-kai-ellhna-ereynhth-poy-diavazei-misokatestrammenes-arxaies-epigrafes Ekathimerini], [https://techcrunch.com/2019/10/18/ai-is-helping-scholars-restore-ancient-greek-texts-on-stone-tablets/ TechCrunch] and others.  &lt;br /&gt;
&lt;br /&gt;
The project has been presented at [https://www.emnlp-ijcnlp2019.org/ EMNLP - IJCNLP 2019] in Hong Kong, at the British School at Rome, at the Oxford Epigraphy Workshop, at the OIKOS National Research School in Classical Studies, during lectures at Venice Ca' Foscari and Rome La Sapienza, and at the [https://www.youtube.com/watch?v=nKSfzHYmLtQ&amp;amp;t=26s Digital Classicist London Seminar].&lt;br /&gt;
&lt;br /&gt;
The project poster is published on the [[Epigraphy.info]] website (on occasion of the V Workshop at KU Leuven) and can be accessed [http://www.lust-auf-rom.de/epigraphy_info/poster/Pythia.pdf here].&lt;br /&gt;
&lt;br /&gt;
Pythia is a partner project of [[epigraphy.info]] and [https://eadh.org/projects/pythia-deep-learning-model-automatic-restoration-greek-inscriptions EADH].&lt;br /&gt;
&lt;br /&gt;
==Contact==&lt;br /&gt;
&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk]&lt;br /&gt;
&lt;br /&gt;
[[category:tools]] &lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:corpora]]&lt;br /&gt;
[[category:dataset]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:digitization]]&lt;br /&gt;
[[category:machine learning]]&lt;br /&gt;
[[category:NLP]]&lt;br /&gt;
[[category:Openaccess]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:repositories]]&lt;br /&gt;
[[category:text mining]]&lt;br /&gt;
[[category:web service]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=10618</id>
		<title>Pythia</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=10618"/>
		<updated>2020-12-27T11:48:12Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: added links&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Available==&lt;br /&gt;
&lt;br /&gt;
* Colab: https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ&lt;br /&gt;
* Wikipedia page: https://en.wikipedia.org/wiki/Pythia_(machine_learning)&lt;br /&gt;
* DOI: https://www.aclweb.org/anthology/D19-1668/&lt;br /&gt;
&lt;br /&gt;
==Authors==&lt;br /&gt;
&lt;br /&gt;
* [[User:TheaSommerschield|Thea Sommerschield]], University of Oxford&lt;br /&gt;
* Yannis Assael, DeepMind&lt;br /&gt;
* Jonathan Prag, University of Oxford&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
&lt;br /&gt;
'''PYTHIA: a deep learning model for the automatic restoration of Greek inscriptions''' is the first ancient text restoration model that recovers missing characters from a damaged text input using deep neural networks. Bringing together the disciplines of ancient history and deep learning, this research offers a fully automated aid to the epigraphic restoration of fragmentary inscriptions, providing ancient historians with multiple textual restorations, as well as the confidence level for each hypothesis. &lt;br /&gt;
&lt;br /&gt;
Pythia takes a sequence of damaged text as input, and is trained to predict character sequences comprising hypothesised restorations of ancient Greek inscriptions. The architecture works at both the character- and word-level, thereby effectively handling long-term context information, and dealing efficiently with incomplete word representations. This makes it applicable to all disciplines dealing with ancient texts (philology, papyrology, codicology) and applies to any language (ancient or modern).&lt;br /&gt;
&lt;br /&gt;
To train the model, we wrote a non-trivial pipeline to convert [[PHI Greek Inscriptions|PHI]], the largest digital corpus of ancient Greek inscriptions, to machine actionable text, which we call [[PHI-ML]]. On PHI-ML, Pythia's predictions achieve a 30.1% character error rate, compared to the 57.3% of human epigraphists. Moreover, in 73.5% of cases the ground-truth sequence was among the Top-20 hypotheses of Pythia, which effectively demonstrates the impact of such an assistive method on the field of digital epigraphy, and sets the state-of-the-art in ancient text restoration.&lt;br /&gt;
&lt;br /&gt;
To aid further research in the field we created a freely accessible [https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ online interactive Python notebook], where researchers can query one of our models to get text restorations and visualise the attention weights. Follow the instructions on the Colab notebook to restore texts for your own personal research.&lt;br /&gt;
&lt;br /&gt;
Furthermore, the PHI-ML dataset and processing pipeline have been published on [https://github.com/sommerschield/ancient-text-restoration GitHub], so that any researcher may regenerate PHI-ML and train new models offline. Follow the instructions below under the section ''&amp;quot;Pythia offline&amp;quot;''.&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
&lt;br /&gt;
Article available at:&lt;br /&gt;
* [https://www.aclweb.org/anthology/D19-1668/ ACL anthology]&lt;br /&gt;
* [https://arxiv.org/abs/1910.06262 arXiv preprint]&lt;br /&gt;
&lt;br /&gt;
To quote this work:&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Assael, Y., Sommerschield, T., Prag, J. “Restoring Ancient Text Using Deep Learning: A Case Study on Greek Epigraphy.” In&lt;br /&gt;
''Proceedings of the Conference in Empirical Methods in Natural Language Processing (EMNLP - IJCNLP 2019)''. Association for Computational Linguistics. 6369-6376.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When using any of this project's source code, please cite:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
@inproceedings{assael2019restoring,&lt;br /&gt;
  title={Restoring ancient text using deep learning: a case study on {Greek} epigraphy},&lt;br /&gt;
  author={Assael, Yannis and Sommerschield, Thea and Prag, Jonathan},&lt;br /&gt;
  booktitle={Empirical Methods in Natural Language Processing},&lt;br /&gt;
  pages={6369--6376},&lt;br /&gt;
  year={2019}&lt;br /&gt;
}&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Pythia offline===&lt;br /&gt;
&lt;br /&gt;
The following snippets provide references for regenerating PHI-ML and training new models offline.&lt;br /&gt;
&lt;br /&gt;
====Dependencies====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
pip install -r requirements.txt &amp;amp;&amp;amp; \&lt;br /&gt;
python -m nltk.downloader punkt&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====PHI-ML dataset generation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
# Download PHI (this will take a while)&lt;br /&gt;
python -c 'import pythia.data.phi_download; pythia.data.phi_download.main()'&lt;br /&gt;
&lt;br /&gt;
# Process and generate PHI-ML&lt;br /&gt;
python -c 'import pythia.data.phi_process; pythia.data.phi_process.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Preprocessed PHI-ML uploaded by @Holger.Danske800: [https://drive.google.com/drive/folders/1PQaWYmB02Sc2OC9yokajcsw65wIcLxGD link]&lt;br /&gt;
&lt;br /&gt;
====Training====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Evaluation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.test; pythia.test.main()' --load_checkpoint=&amp;quot;your_model_path/&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Docker execution====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./build.sh&lt;br /&gt;
./run.sh &amp;lt;GPU_ID&amp;gt; python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Media==&lt;br /&gt;
&lt;br /&gt;
Our research has been featured in blog posts by [https://deepmind.com/research/publications/Restoring-ancient-text-using-deep-learning-a-case-study-on-Greek-epigraphy DeepMind] and the [http://www.ox.ac.uk/news/arts-blog/restoring-ancient-greek-inscriptions-using-ai-deep-learning University of Oxford].&lt;br /&gt;
It has also featured in articles on [https://www.thetimes.co.uk/article/oracle-of-ai-solves-classic-conundrums-v6pdtsps0 The Times], [https://www.newscientist.com/article/2220438-deepmind-ai-beats-humans-at-deciphering-damaged-ancient-greek-tablets/ NewScientist], [https://www.ft.com/content/2b72ed2c-907b-11ea-bc44-dbf6756c871a?FTCamp=engage%2FCAPI%2F%2FChannel_signal%2F%2FB2B/ Financial Times], [https://www.repubblica.it/tecnologia/2019/10/24/news/non_solo_go_l_ai_di_google_batte_anche_gli_archeologi-239393979/ La Repubblica], [https://www.kathimerini.gr/1047963/article/epikairothta/episthmh/py8ia-o-gonos-ths-google-kai-ellhna-ereynhth-poy-diavazei-misokatestrammenes-arxaies-epigrafes Ekathimerini], [https://techcrunch.com/2019/10/18/ai-is-helping-scholars-restore-ancient-greek-texts-on-stone-tablets/ TechCrunch] and others.  &lt;br /&gt;
&lt;br /&gt;
The project has been presented at [https://www.emnlp-ijcnlp2019.org/ EMNLP - IJCNLP 2019] in Hong Kong, at the British School at Rome, at the Oxford Epigraphy Workshop, at the OIKOS National Research School in Classical Studies, during lectures at Venice Ca' Foscari and Rome La Sapienza, and at the [https://www.youtube.com/watch?v=nKSfzHYmLtQ&amp;amp;t=26s Digital Classicist London Seminar].&lt;br /&gt;
&lt;br /&gt;
The project poster is published on the [[Epigraphy.info]] website (on occasion of the V Workshop at KU Leuven) and can be accessed [http://www.lust-auf-rom.de/epigraphy_info/poster/Pythia.pdf here].&lt;br /&gt;
&lt;br /&gt;
Pythia is a partner project of [[epigraphy.info]] and [https://eadh.org/projects/pythia-deep-learning-model-automatic-restoration-greek-inscriptions EADH].&lt;br /&gt;
&lt;br /&gt;
==Contact==&lt;br /&gt;
&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk]&lt;br /&gt;
&lt;br /&gt;
[[category:tools]] &lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:corpora]]&lt;br /&gt;
[[category:dataset]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:digitization]]&lt;br /&gt;
[[category:machine learning]]&lt;br /&gt;
[[category:NLP]]&lt;br /&gt;
[[category:Openaccess]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:repositories]]&lt;br /&gt;
[[category:text mining]]&lt;br /&gt;
[[category:web service]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=10617</id>
		<title>Pythia</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=10617"/>
		<updated>2020-12-27T11:41:57Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: /* Media */ added wikipedia page&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Available==&lt;br /&gt;
&lt;br /&gt;
* https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ&lt;br /&gt;
&lt;br /&gt;
==Authors==&lt;br /&gt;
&lt;br /&gt;
* [[User:TheaSommerschield|Thea Sommerschield]], University of Oxford&lt;br /&gt;
* Yannis Assael, DeepMind&lt;br /&gt;
* Jonathan Prag, University of Oxford&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
&lt;br /&gt;
'''PYTHIA: a deep learning model for the automatic restoration of Greek inscriptions''' is the first ancient text restoration model that recovers missing characters from a damaged text input using deep neural networks. Bringing together the disciplines of ancient history and deep learning, this research offers a fully automated aid to the epigraphic restoration of fragmentary inscriptions, providing ancient historians with multiple textual restorations, as well as the confidence level for each hypothesis. &lt;br /&gt;
&lt;br /&gt;
Pythia takes a sequence of damaged text as input, and is trained to predict character sequences comprising hypothesised restorations of ancient Greek inscriptions. The architecture works at both the character- and word-level, thereby effectively handling long-term context information, and dealing efficiently with incomplete word representations. This makes it applicable to all disciplines dealing with ancient texts (philology, papyrology, codicology) and applies to any language (ancient or modern).&lt;br /&gt;
&lt;br /&gt;
To train the model, we wrote a non-trivial pipeline to convert [[PHI Greek Inscriptions|PHI]], the largest digital corpus of ancient Greek inscriptions, to machine actionable text, which we call [[PHI-ML]]. On PHI-ML, Pythia's predictions achieve a 30.1% character error rate, compared to the 57.3% of human epigraphists. Moreover, in 73.5% of cases the ground-truth sequence was among the Top-20 hypotheses of Pythia, which effectively demonstrates the impact of such an assistive method on the field of digital epigraphy, and sets the state-of-the-art in ancient text restoration.&lt;br /&gt;
&lt;br /&gt;
To aid further research in the field we created a freely accessible [https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ online interactive Python notebook], where researchers can query one of our models to get text restorations and visualise the attention weights. Follow the instructions on the Colab notebook to restore texts for your own personal research.&lt;br /&gt;
&lt;br /&gt;
Furthermore, the PHI-ML dataset and processing pipeline have been published on [https://github.com/sommerschield/ancient-text-restoration GitHub], so that any researcher may regenerate PHI-ML and train new models offline. Follow the instructions below under the section ''&amp;quot;Pythia offline&amp;quot;''.&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
&lt;br /&gt;
Article available at:&lt;br /&gt;
* [https://www.aclweb.org/anthology/D19-1668/ ACL anthology]&lt;br /&gt;
* [https://arxiv.org/abs/1910.06262 arXiv preprint]&lt;br /&gt;
&lt;br /&gt;
To quote this work:&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Assael, Y., Sommerschield, T., Prag, J. “Restoring Ancient Text Using Deep Learning: A Case Study on Greek Epigraphy.” In&lt;br /&gt;
''Proceedings of the Conference in Empirical Methods in Natural Language Processing (EMNLP - IJCNLP 2019)''. Association for Computational Linguistics. 6369-6376.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When using any of this project's source code, please cite:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
@inproceedings{assael2019restoring,&lt;br /&gt;
  title={Restoring ancient text using deep learning: a case study on {Greek} epigraphy},&lt;br /&gt;
  author={Assael, Yannis and Sommerschield, Thea and Prag, Jonathan},&lt;br /&gt;
  booktitle={Empirical Methods in Natural Language Processing},&lt;br /&gt;
  pages={6369--6376},&lt;br /&gt;
  year={2019}&lt;br /&gt;
}&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Pythia offline===&lt;br /&gt;
&lt;br /&gt;
The following snippets provide references for regenerating PHI-ML and training new models offline.&lt;br /&gt;
&lt;br /&gt;
====Dependencies====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
pip install -r requirements.txt &amp;amp;&amp;amp; \&lt;br /&gt;
python -m nltk.downloader punkt&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====PHI-ML dataset generation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
# Download PHI (this will take a while)&lt;br /&gt;
python -c 'import pythia.data.phi_download; pythia.data.phi_download.main()'&lt;br /&gt;
&lt;br /&gt;
# Process and generate PHI-ML&lt;br /&gt;
python -c 'import pythia.data.phi_process; pythia.data.phi_process.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Preprocessed PHI-ML uploaded by @Holger.Danske800: [https://drive.google.com/drive/folders/1PQaWYmB02Sc2OC9yokajcsw65wIcLxGD link]&lt;br /&gt;
&lt;br /&gt;
====Training====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Evaluation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.test; pythia.test.main()' --load_checkpoint=&amp;quot;your_model_path/&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Docker execution====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./build.sh&lt;br /&gt;
./run.sh &amp;lt;GPU_ID&amp;gt; python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Media===&lt;br /&gt;
&lt;br /&gt;
Our research has been featured in blog posts by [https://deepmind.com/research/publications/Restoring-ancient-text-using-deep-learning-a-case-study-on-Greek-epigraphy DeepMind] and the [http://www.ox.ac.uk/news/arts-blog/restoring-ancient-greek-inscriptions-using-ai-deep-learning University of Oxford], and has been published in articles on [https://www.thetimes.co.uk/article/oracle-of-ai-solves-classic-conundrums-v6pdtsps0 The Times], [https://www.newscientist.com/article/2220438-deepmind-ai-beats-humans-at-deciphering-damaged-ancient-greek-tablets/ NewScientist], [https://www.ft.com/content/2b72ed2c-907b-11ea-bc44-dbf6756c871a?FTCamp=engage%2FCAPI%2F%2FChannel_signal%2F%2FB2B/ Financial Times], [https://www.repubblica.it/tecnologia/2019/10/24/news/non_solo_go_l_ai_di_google_batte_anche_gli_archeologi-239393979/ La Repubblica], [https://www.kathimerini.gr/1047963/article/epikairothta/episthmh/py8ia-o-gonos-ths-google-kai-ellhna-ereynhth-poy-diavazei-misokatestrammenes-arxaies-epigrafes Ekathimerini], [https://techcrunch.com/2019/10/18/ai-is-helping-scholars-restore-ancient-greek-texts-on-stone-tablets/ TechCrunch] and others (see Pythia's [https://en.wikipedia.org/wiki/Pythia_(machine_learning)/ Wikipedia page] for more).  &lt;br /&gt;
&lt;br /&gt;
It has also been presented at [https://www.emnlp-ijcnlp2019.org/ EMNLP - IJCNLP 2019] in Hong Kong, at the British School at Rome, the Oxford Epigraphy Workshop, the OIKOS National Research School in Classical Studies, during lectures at Venice Ca' Foscari and Rome La Sapienza, and at the [https://www.youtube.com/watch?v=nKSfzHYmLtQ&amp;amp;t=26s Digital Classicist London Seminar].&lt;br /&gt;
The project poster is published on the [[Epigraphy.info]] website (on occasion of the V Workshop at KU Leuven) and can be accessed [http://www.lust-auf-rom.de/epigraphy_info/poster/Pythia.pdf here].&lt;br /&gt;
&lt;br /&gt;
Pythia is a partner project of [[epigraphy.info]] and [https://eadh.org/projects/pythia-deep-learning-model-automatic-restoration-greek-inscriptions EADH].&lt;br /&gt;
&lt;br /&gt;
===Contact===&lt;br /&gt;
&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk]&lt;br /&gt;
&lt;br /&gt;
[[category:tools]] &lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:corpora]]&lt;br /&gt;
[[category:dataset]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:digitization]]&lt;br /&gt;
[[category:machine learning]]&lt;br /&gt;
[[category:NLP]]&lt;br /&gt;
[[category:Openaccess]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:repositories]]&lt;br /&gt;
[[category:text mining]]&lt;br /&gt;
[[category:web service]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=10362</id>
		<title>Pythia</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=10362"/>
		<updated>2020-11-03T08:04:24Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: added link&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Available==&lt;br /&gt;
&lt;br /&gt;
* https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ&lt;br /&gt;
&lt;br /&gt;
==Authors==&lt;br /&gt;
&lt;br /&gt;
* [[User:TheaSommerschield|Thea Sommerschield]], University of Oxford&lt;br /&gt;
* Yannis Assael, DeepMind&lt;br /&gt;
* Jonathan Prag, University of Oxford&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
&lt;br /&gt;
'''PYTHIA: a deep learning model for the automatic restoration of Greek inscriptions''' is the first ancient text restoration model that recovers missing characters from a damaged text input using deep neural networks. Bringing together the disciplines of ancient history and deep learning, this research offers a fully automated aid to the epigraphic restoration of fragmentary inscriptions, providing ancient historians with multiple textual restorations, as well as the confidence level for each hypothesis. &lt;br /&gt;
&lt;br /&gt;
Pythia takes a sequence of damaged text as input, and is trained to predict character sequences comprising hypothesised restorations of ancient Greek inscriptions. The architecture works at both the character- and word-level, thereby effectively handling long-term context information, and dealing efficiently with incomplete word representations. This makes it applicable to all disciplines dealing with ancient texts (philology, papyrology, codicology) and applies to any language (ancient or modern).&lt;br /&gt;
&lt;br /&gt;
To train the model, we wrote a non-trivial pipeline to convert [[PHI Greek Inscriptions|PHI]], the largest digital corpus of ancient Greek inscriptions, to machine actionable text, which we call [[PHI-ML]]. On PHI-ML, Pythia's predictions achieve a 30.1% character error rate, compared to the 57.3% of human epigraphists. Moreover, in 73.5% of cases the ground-truth sequence was among the Top-20 hypotheses of Pythia, which effectively demonstrates the impact of such an assistive method on the field of digital epigraphy, and sets the state-of-the-art in ancient text restoration.&lt;br /&gt;
&lt;br /&gt;
To aid further research in the field we created a freely accessible [https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ online interactive Python notebook], where researchers can query one of our models to get text restorations and visualise the attention weights. Follow the instructions on the Colab notebook to restore texts for your own personal research.&lt;br /&gt;
&lt;br /&gt;
Furthermore, the PHI-ML dataset and processing pipeline have been published on [https://github.com/sommerschield/ancient-text-restoration GitHub], so that any researcher may regenerate PHI-ML and train new models offline. Follow the instructions below under the section ''&amp;quot;Pythia offline&amp;quot;''.&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
&lt;br /&gt;
Article available at:&lt;br /&gt;
* [https://www.aclweb.org/anthology/D19-1668/ ACL anthology]&lt;br /&gt;
* [https://arxiv.org/abs/1910.06262 arXiv preprint]&lt;br /&gt;
&lt;br /&gt;
To quote this work:&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Assael, Y., Sommerschield, T., Prag, J. “Restoring Ancient Text Using Deep Learning: A Case Study on Greek Epigraphy.” In&lt;br /&gt;
''Proceedings of the Conference in Empirical Methods in Natural Language Processing (EMNLP - IJCNLP 2019)''. Association for Computational Linguistics. 6369-6376.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When using any of this project's source code, please cite:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
@inproceedings{assael2019restoring,&lt;br /&gt;
  title={Restoring ancient text using deep learning: a case study on {Greek} epigraphy},&lt;br /&gt;
  author={Assael, Yannis and Sommerschield, Thea and Prag, Jonathan},&lt;br /&gt;
  booktitle={Empirical Methods in Natural Language Processing},&lt;br /&gt;
  pages={6369--6376},&lt;br /&gt;
  year={2019}&lt;br /&gt;
}&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Pythia offline===&lt;br /&gt;
&lt;br /&gt;
The following snippets provide references for regenerating PHI-ML and training new models offline.&lt;br /&gt;
&lt;br /&gt;
====Dependencies====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
pip install -r requirements.txt &amp;amp;&amp;amp; \&lt;br /&gt;
python -m nltk.downloader punkt&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====PHI-ML dataset generation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
# Download PHI (this will take a while)&lt;br /&gt;
python -c 'import pythia.data.phi_download; pythia.data.phi_download.main()'&lt;br /&gt;
&lt;br /&gt;
# Process and generate PHI-ML&lt;br /&gt;
python -c 'import pythia.data.phi_process; pythia.data.phi_process.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Preprocessed PHI-ML uploaded by @Holger.Danske800: [https://drive.google.com/drive/folders/1PQaWYmB02Sc2OC9yokajcsw65wIcLxGD link]&lt;br /&gt;
&lt;br /&gt;
====Training====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Evaluation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.test; pythia.test.main()' --load_checkpoint=&amp;quot;your_model_path/&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Docker execution====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./build.sh&lt;br /&gt;
./run.sh &amp;lt;GPU_ID&amp;gt; python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Media===&lt;br /&gt;
&lt;br /&gt;
Our research has been featured in blog posts by [https://deepmind.com/research/publications/Restoring-ancient-text-using-deep-learning-a-case-study-on-Greek-epigraphy DeepMind] and the [http://www.ox.ac.uk/news/arts-blog/restoring-ancient-greek-inscriptions-using-ai-deep-learning University of Oxford], and has been published in articles on [https://www.thetimes.co.uk/article/oracle-of-ai-solves-classic-conundrums-v6pdtsps0 The Times], [https://www.newscientist.com/article/2220438-deepmind-ai-beats-humans-at-deciphering-damaged-ancient-greek-tablets/ NewScientist], [https://www.ft.com/content/2b72ed2c-907b-11ea-bc44-dbf6756c871a?FTCamp=engage%2FCAPI%2F%2FChannel_signal%2F%2FB2B/ Financial Times], [https://www.repubblica.it/tecnologia/2019/10/24/news/non_solo_go_l_ai_di_google_batte_anche_gli_archeologi-239393979/ La Repubblica], [https://www.kathimerini.gr/1047963/article/epikairothta/episthmh/py8ia-o-gonos-ths-google-kai-ellhna-ereynhth-poy-diavazei-misokatestrammenes-arxaies-epigrafes Ekathimerini], [https://techcrunch.com/2019/10/18/ai-is-helping-scholars-restore-ancient-greek-texts-on-stone-tablets/ TechCrunch] and others. &lt;br /&gt;
&lt;br /&gt;
It has also been presented at [https://www.emnlp-ijcnlp2019.org/ EMNLP - IJCNLP 2019] in Hong Kong, at the British School at Rome, the Oxford Epigraphy Workshop, the OIKOS National Research School in Classical Studies, during lectures at Venice Ca' Foscari and Rome La Sapienza, and at the [https://www.youtube.com/watch?v=nKSfzHYmLtQ&amp;amp;t=26s Digital Classicist London Seminar].&lt;br /&gt;
The project poster is published on the [[Epigraphy.info]] website (on occasion of the V Workshop at KU Leuven) and can be accessed [http://www.lust-auf-rom.de/epigraphy_info/poster/Pythia.pdf here].&lt;br /&gt;
&lt;br /&gt;
Pythia is a partner project of [[epigraphy.info]] and [https://eadh.org/projects/pythia-deep-learning-model-automatic-restoration-greek-inscriptions EADH].&lt;br /&gt;
&lt;br /&gt;
===Contact===&lt;br /&gt;
&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk]&lt;br /&gt;
&lt;br /&gt;
[[category:tools]] &lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:corpora]]&lt;br /&gt;
[[category:dataset]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:digitization]]&lt;br /&gt;
[[category:machine learning]]&lt;br /&gt;
[[category:NLP]]&lt;br /&gt;
[[category:Openaccess]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:repositories]]&lt;br /&gt;
[[category:text mining]]&lt;br /&gt;
[[category:web service]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=10361</id>
		<title>Pythia</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=10361"/>
		<updated>2020-10-29T09:22:00Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: /* Media */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Available==&lt;br /&gt;
&lt;br /&gt;
* https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ&lt;br /&gt;
&lt;br /&gt;
==Authors==&lt;br /&gt;
&lt;br /&gt;
* [[User:TheaSommerschield|Thea Sommerschield]], University of Oxford&lt;br /&gt;
* Yannis Assael, DeepMind&lt;br /&gt;
* Jonathan Prag, University of Oxford&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
&lt;br /&gt;
'''PYTHIA: a deep learning model for the automatic restoration of Greek inscriptions''' is the first ancient text restoration model that recovers missing characters from a damaged text input using deep neural networks. Bringing together the disciplines of ancient history and deep learning, this research offers a fully automated aid to the epigraphic restoration of fragmentary inscriptions, providing ancient historians with multiple textual restorations, as well as the confidence level for each hypothesis. &lt;br /&gt;
&lt;br /&gt;
Pythia takes a sequence of damaged text as input, and is trained to predict character sequences comprising hypothesised restorations of ancient Greek inscriptions. The architecture works at both the character- and word-level, thereby effectively handling long-term context information, and dealing efficiently with incomplete word representations. This makes it applicable to all disciplines dealing with ancient texts (philology, papyrology, codicology) and applies to any language (ancient or modern).&lt;br /&gt;
&lt;br /&gt;
To train the model, we wrote a non-trivial pipeline to convert [[PHI Greek Inscriptions|PHI]], the largest digital corpus of ancient Greek inscriptions, to machine actionable text, which we call [[PHI-ML]]. On PHI-ML, Pythia's predictions achieve a 30.1% character error rate, compared to the 57.3% of human epigraphists. Moreover, in 73.5% of cases the ground-truth sequence was among the Top-20 hypotheses of Pythia, which effectively demonstrates the impact of such an assistive method on the field of digital epigraphy, and sets the state-of-the-art in ancient text restoration.&lt;br /&gt;
&lt;br /&gt;
To aid further research in the field we created a freely accessible [https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ online interactive Python notebook], where researchers can query one of our models to get text restorations and visualise the attention weights. Follow the instructions on the Colab notebook to restore texts for your own personal research.&lt;br /&gt;
&lt;br /&gt;
Furthermore, the PHI-ML dataset and processing pipeline have been published on [https://github.com/sommerschield/ancient-text-restoration GitHub], so that any researcher may regenerate PHI-ML and train new models offline. Follow the instructions below under the section ''&amp;quot;Pythia offline&amp;quot;''.&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
&lt;br /&gt;
Article available at:&lt;br /&gt;
* [https://www.aclweb.org/anthology/D19-1668/ ACL anthology]&lt;br /&gt;
* [https://arxiv.org/abs/1910.06262 arXiv preprint]&lt;br /&gt;
&lt;br /&gt;
To quote this work:&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Assael, Y., Sommerschield, T., Prag, J. “Restoring Ancient Text Using Deep Learning: A Case Study on Greek Epigraphy.” In&lt;br /&gt;
''Proceedings of the Conference in Empirical Methods in Natural Language Processing (EMNLP - IJCNLP 2019)''. Association for Computational Linguistics. 6369-6376.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When using any of this project's source code, please cite:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
@inproceedings{assael2019restoring,&lt;br /&gt;
  title={Restoring ancient text using deep learning: a case study on {Greek} epigraphy},&lt;br /&gt;
  author={Assael, Yannis and Sommerschield, Thea and Prag, Jonathan},&lt;br /&gt;
  booktitle={Empirical Methods in Natural Language Processing},&lt;br /&gt;
  pages={6369--6376},&lt;br /&gt;
  year={2019}&lt;br /&gt;
}&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Pythia offline===&lt;br /&gt;
&lt;br /&gt;
The following snippets provide references for regenerating PHI-ML and training new models offline.&lt;br /&gt;
&lt;br /&gt;
====Dependencies====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
pip install -r requirements.txt &amp;amp;&amp;amp; \&lt;br /&gt;
python -m nltk.downloader punkt&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====PHI-ML dataset generation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
# Download PHI (this will take a while)&lt;br /&gt;
python -c 'import pythia.data.phi_download; pythia.data.phi_download.main()'&lt;br /&gt;
&lt;br /&gt;
# Process and generate PHI-ML&lt;br /&gt;
python -c 'import pythia.data.phi_process; pythia.data.phi_process.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Preprocessed PHI-ML uploaded by @Holger.Danske800: [https://drive.google.com/drive/folders/1PQaWYmB02Sc2OC9yokajcsw65wIcLxGD link]&lt;br /&gt;
&lt;br /&gt;
====Training====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Evaluation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.test; pythia.test.main()' --load_checkpoint=&amp;quot;your_model_path/&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Docker execution====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./build.sh&lt;br /&gt;
./run.sh &amp;lt;GPU_ID&amp;gt; python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Media===&lt;br /&gt;
&lt;br /&gt;
Our research has been featured in blog posts by [https://deepmind.com/research/publications/Restoring-ancient-text-using-deep-learning-a-case-study-on-Greek-epigraphy DeepMind] and the [http://www.ox.ac.uk/news/arts-blog/restoring-ancient-greek-inscriptions-using-ai-deep-learning University of Oxford], and has been published in articles on [https://www.thetimes.co.uk/article/oracle-of-ai-solves-classic-conundrums-v6pdtsps0 The Times], [https://www.newscientist.com/article/2220438-deepmind-ai-beats-humans-at-deciphering-damaged-ancient-greek-tablets/ NewScientist], [https://www.ft.com/content/2b72ed2c-907b-11ea-bc44-dbf6756c871a?FTCamp=engage%2FCAPI%2F%2FChannel_signal%2F%2FB2B/ Financial Times], [https://www.repubblica.it/tecnologia/2019/10/24/news/non_solo_go_l_ai_di_google_batte_anche_gli_archeologi-239393979/ La Repubblica], [https://www.kathimerini.gr/1047963/article/epikairothta/episthmh/py8ia-o-gonos-ths-google-kai-ellhna-ereynhth-poy-diavazei-misokatestrammenes-arxaies-epigrafes Ekathimerini], [https://techcrunch.com/2019/10/18/ai-is-helping-scholars-restore-ancient-greek-texts-on-stone-tablets/ TechCrunch] and others. &lt;br /&gt;
&lt;br /&gt;
It has also been presented at [https://www.emnlp-ijcnlp2019.org/ EMNLP - IJCNLP 2019] in Hong Kong, at the British School at Rome, the Oxford Epigraphy Workshop, the OIKOS National Research School in Classical Studies, during lectures at Venice Ca' Foscari and Rome La Sapienza, and most recently at the [https://www.youtube.com/watch?v=nKSfzHYmLtQ&amp;amp;t=26s Digital Classicist London Seminar].&lt;br /&gt;
&lt;br /&gt;
Pythia is a partner project of [[epigraphy.info]] and [https://eadh.org/projects/pythia-deep-learning-model-automatic-restoration-greek-inscriptions EADH].&lt;br /&gt;
&lt;br /&gt;
===Contact===&lt;br /&gt;
&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk]&lt;br /&gt;
&lt;br /&gt;
[[category:tools]] &lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:corpora]]&lt;br /&gt;
[[category:dataset]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:digitization]]&lt;br /&gt;
[[category:machine learning]]&lt;br /&gt;
[[category:NLP]]&lt;br /&gt;
[[category:Openaccess]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:repositories]]&lt;br /&gt;
[[category:text mining]]&lt;br /&gt;
[[category:web service]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User_talk:TheaSommerschield&amp;diff=10015</id>
		<title>User talk:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User_talk:TheaSommerschield&amp;diff=10015"/>
		<updated>2020-07-02T10:02:38Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: /* ISicDef */ reply&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== ISicDef ==&lt;br /&gt;
Hi Thea, would you like to create a page for [[ISicDef]] on the wiki as well? (Even if it's not online yet and won't be an independent publication once it migrates…) [[User:GabrielBodard|GabrielBodard]] ([[User talk:GabrielBodard|talk]]) 18:06, 1 July 2020 (BST)&lt;br /&gt;
&lt;br /&gt;
Hi Gabby! (I haven't figured out how to reply to threads on the Wiki yet). That sounds like a great idea, I will get to it as soon as I can - I'm still snowed under with applications and projects at the moment.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=10009</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=10009"/>
		<updated>2020-07-01T13:18:48Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: added bullet&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Finishing DPhil student in Ancient History at the University of Oxford.&lt;br /&gt;
&lt;br /&gt;
Interested in the Classical and Tech(no) side of research - and music.&lt;br /&gt;
Currently working on machine learning for Latin epigraphical text restoration and other fun stuff.&lt;br /&gt;
&lt;br /&gt;
* Co-author of [[Pythia]] –  a deep learning model for the automatic restoration of Greek inscriptions.&amp;lt;br&amp;gt;&lt;br /&gt;
* Author of [https://www.academia.edu/38335343/A_New_Sicilian_Curse_Corpus_A_Blueprint_for_a_Geographical_and_Chronological_Analysis_of_Defixiones_from_Sicily ISicDef] – A MySQL database &amp;amp; digital corpus of Sicilian curse tablets ''(to be migrated to [[I.Sicily]]).'' &amp;lt;br&amp;gt;&lt;br /&gt;
* Doctoral thesis: ''“Breaking boundaries: a study of socio-cultural identities in Archaic and Classical western Sicily”''.&lt;br /&gt;
&lt;br /&gt;
My postdoctoral research project focuses on Digital Epigraphy, exploring how Machine Learning can enable large-scale, in-depth interpretation of the epigraphic cultures of the ancient Mediterranean. I am currently applying for postdocs, so if you are interested in my research and would like to know more, do get in touch!&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk] | [http://thea@sommerschield.it thea@sommerschield.it]&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=10008</id>
		<title>Pythia</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=10008"/>
		<updated>2020-07-01T11:29:43Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Available==&lt;br /&gt;
&lt;br /&gt;
* https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ&lt;br /&gt;
&lt;br /&gt;
==Authors==&lt;br /&gt;
&lt;br /&gt;
* [[User:TheaSommerschield|Thea Sommerschield]], University of Oxford&lt;br /&gt;
* Yannis Assael, DeepMind&lt;br /&gt;
* Jonathan Prag, University of Oxford&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
&lt;br /&gt;
'''PYTHIA: a deep learning model for the automatic restoration of Greek inscriptions''' is the first ancient text restoration model that recovers missing characters from a damaged text input using deep neural networks. Bringing together the disciplines of ancient history and deep learning, this research offers a fully automated aid to the epigraphic restoration of fragmentary inscriptions, providing ancient historians with multiple textual restorations, as well as the confidence level for each hypothesis. &lt;br /&gt;
&lt;br /&gt;
Pythia takes a sequence of damaged text as input, and is trained to predict character sequences comprising hypothesised restorations of ancient Greek inscriptions. The architecture works at both the character- and word-level, thereby effectively handling long-term context information, and dealing efficiently with incomplete word representations. This makes it applicable to all disciplines dealing with ancient texts (philology, papyrology, codicology) and applies to any language (ancient or modern).&lt;br /&gt;
&lt;br /&gt;
To train the model, we wrote a non-trivial pipeline to convert [[PHI Greek Inscriptions|PHI]], the largest digital corpus of ancient Greek inscriptions, to machine actionable text, which we call [[PHI-ML]]. On PHI-ML, Pythia's predictions achieve a 30.1% character error rate, compared to the 57.3% of human epigraphists. Moreover, in 73.5% of cases the ground-truth sequence was among the Top-20 hypotheses of Pythia, which effectively demonstrates the impact of such an assistive method on the field of digital epigraphy, and sets the state-of-the-art in ancient text restoration.&lt;br /&gt;
&lt;br /&gt;
To aid further research in the field we created a freely accessible [https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ online interactive Python notebook], where researchers can query one of our models to get text restorations and visualise the attention weights. Follow the instructions on the Colab notebook to restore texts for your own personal research.&lt;br /&gt;
&lt;br /&gt;
Furthermore, the PHI-ML dataset and processing pipeline have been published on [https://github.com/sommerschield/ancient-text-restoration GitHub], so that any researcher may regenerate PHI-ML and train new models offline. Follow the instructions below under the section ''&amp;quot;Pythia offline&amp;quot;''.&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
&lt;br /&gt;
Article available at:&lt;br /&gt;
* [https://www.aclweb.org/anthology/D19-1668/ ACL anthology]&lt;br /&gt;
* [https://arxiv.org/abs/1910.06262 arXiv preprint]&lt;br /&gt;
&lt;br /&gt;
To quote this work:&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Assael, Y., Sommerschield, T., Prag, J. “Restoring Ancient Text Using Deep Learning: A Case Study on Greek Epigraphy.” In&lt;br /&gt;
''Proceedings of the Conference in Empirical Methods in Natural Language Processing (EMNLP - IJCNLP 2019)''. Association for Computational Linguistics. 6369-6376.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When using any of this project's source code, please cite:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
@inproceedings{assael2019restoring,&lt;br /&gt;
  title={Restoring ancient text using deep learning: a case study on {Greek} epigraphy},&lt;br /&gt;
  author={Assael, Yannis and Sommerschield, Thea and Prag, Jonathan},&lt;br /&gt;
  booktitle={Empirical Methods in Natural Language Processing},&lt;br /&gt;
  pages={6369--6376},&lt;br /&gt;
  year={2019}&lt;br /&gt;
}&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Pythia offline===&lt;br /&gt;
&lt;br /&gt;
The following snippets provide references for regenerating PHI-ML and training new models offline.&lt;br /&gt;
&lt;br /&gt;
====Dependencies====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
pip install -r requirements.txt &amp;amp;&amp;amp; \&lt;br /&gt;
python -m nltk.downloader punkt&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====PHI-ML dataset generation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
# Download PHI (this will take a while)&lt;br /&gt;
python -c 'import pythia.data.phi_download; pythia.data.phi_download.main()'&lt;br /&gt;
&lt;br /&gt;
# Process and generate PHI-ML&lt;br /&gt;
python -c 'import pythia.data.phi_process; pythia.data.phi_process.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Preprocessed PHI-ML uploaded by @Holger.Danske800: [https://drive.google.com/drive/folders/1PQaWYmB02Sc2OC9yokajcsw65wIcLxGD link]&lt;br /&gt;
&lt;br /&gt;
====Training====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Evaluation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.test; pythia.test.main()' --load_checkpoint=&amp;quot;your_model_path/&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Docker execution====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./build.sh&lt;br /&gt;
./run.sh &amp;lt;GPU_ID&amp;gt; python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Media===&lt;br /&gt;
&lt;br /&gt;
Our research has been featured in blog posts by [https://deepmind.com/research/publications/Restoring-ancient-text-using-deep-learning-a-case-study-on-Greek-epigraphy DeepMind] and the [http://www.ox.ac.uk/news/arts-blog/restoring-ancient-greek-inscriptions-using-ai-deep-learning University of Oxford], and has been published in articles on [https://www.thetimes.co.uk/article/oracle-of-ai-solves-classic-conundrums-v6pdtsps0 The Times], [https://www.newscientist.com/article/2220438-deepmind-ai-beats-humans-at-deciphering-damaged-ancient-greek-tablets/ NewScientist], [https://www.ft.com/content/2b72ed2c-907b-11ea-bc44-dbf6756c871a?FTCamp=engage%2FCAPI%2F%2FChannel_signal%2F%2FB2B/ Financial Times], [https://www.repubblica.it/tecnologia/2019/10/24/news/non_solo_go_l_ai_di_google_batte_anche_gli_archeologi-239393979/ La Repubblica], [https://www.kathimerini.gr/1047963/article/epikairothta/episthmh/py8ia-o-gonos-ths-google-kai-ellhna-ereynhth-poy-diavazei-misokatestrammenes-arxaies-epigrafes Ekathimerini], [https://techcrunch.com/2019/10/18/ai-is-helping-scholars-restore-ancient-greek-texts-on-stone-tablets/ TechCrunch] and others. &lt;br /&gt;
&lt;br /&gt;
It has also been presented at [https://www.emnlp-ijcnlp2019.org/ EMNLP - IJCNLP 2019] in Hong Kong, at the British School at Rome, the Oxford Epigraphy Workshop, the OIKOS National Research School in Classical Studies, during lectures at Venice Ca' Foscari and Rome La Sapienza, and most recently at the [https://www.youtube.com/watch?v=nKSfzHYmLtQ&amp;amp;t=26s Digital Classicist London Seminar].&lt;br /&gt;
&lt;br /&gt;
Pythia is a partner project of [[epigraphy.info]].&lt;br /&gt;
&lt;br /&gt;
===Contact===&lt;br /&gt;
&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk]&lt;br /&gt;
&lt;br /&gt;
[[category:tools]] &lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:corpora]]&lt;br /&gt;
[[category:dataset]]&lt;br /&gt;
[[category:deep learning]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:digitization]]&lt;br /&gt;
[[category:inscriptions]]&lt;br /&gt;
[[category:machine learning]]&lt;br /&gt;
[[category:NLP]]&lt;br /&gt;
[[category:neural networks]]&lt;br /&gt;
[[category:Openaccess]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:repositories]]&lt;br /&gt;
[[category:text mining]]&lt;br /&gt;
[[category:web service]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=10007</id>
		<title>Pythia</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=10007"/>
		<updated>2020-07-01T11:00:39Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: added link&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Available==&lt;br /&gt;
&lt;br /&gt;
* https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ&lt;br /&gt;
&lt;br /&gt;
==Authors==&lt;br /&gt;
&lt;br /&gt;
* [[User:TheaSommerschield|Thea Sommerschield]], University of Oxford&lt;br /&gt;
* Yannis Assael, DeepMind&lt;br /&gt;
* Jonathan Prag, University of Oxford&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
&lt;br /&gt;
'''PYTHIA: a deep learning model for the automatic restoration of Greek inscriptions''' is the first ancient text restoration model that recovers missing characters from a damaged text input using deep neural networks. Bringing together the disciplines of ancient history and deep learning, this research offers a fully automated aid to the epigraphic restoration of fragmentary inscriptions, providing ancient historians with multiple textual restorations, as well as the confidence level for each hypothesis. &lt;br /&gt;
&lt;br /&gt;
Pythia takes a sequence of damaged text as input, and is trained to predict character sequences comprising hypothesised restorations of ancient Greek inscriptions. The architecture works at both the character- and word-level, thereby effectively handling long-term context information, and dealing efficiently with incomplete word representations. This makes it applicable to all disciplines dealing with ancient texts (philology, papyrology, codicology) and applies to any language (ancient or modern).&lt;br /&gt;
&lt;br /&gt;
To train the model, we wrote a non-trivial pipeline to convert [[PHI Greek Inscriptions|PHI]], the largest digital corpus of ancient Greek inscriptions, to machine actionable text, which we call [[PHI-ML]]. On PHI-ML, Pythia's predictions achieve a 30.1% character error rate, compared to the 57.3% of human epigraphists. Moreover, in 73.5% of cases the ground-truth sequence was among the Top-20 hypotheses of Pythia, which effectively demonstrates the impact of such an assistive method on the field of digital epigraphy, and sets the state-of-the-art in ancient text restoration.&lt;br /&gt;
&lt;br /&gt;
To aid further research in the field we created a freely accessible [https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ online interactive Python notebook], where researchers can query one of our models to get text restorations and visualise the attention weights. Follow the instructions on the Colab notebook to restore texts for your own personal research.&lt;br /&gt;
&lt;br /&gt;
Furthermore, the PHI-ML dataset and processing pipeline have been published on [https://github.com/sommerschield/ancient-text-restoration GitHub], so that any researcher may regenerate PHI-ML and train new models offline. Follow the instructions below under the section ''&amp;quot;Pythia offline&amp;quot;''.&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
&lt;br /&gt;
Article available at:&lt;br /&gt;
* [https://www.aclweb.org/anthology/D19-1668/ ACL anthology]&lt;br /&gt;
* [https://arxiv.org/abs/1910.06262 arXiv preprint]&lt;br /&gt;
&lt;br /&gt;
To quote this work:&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Assael, Y., Sommerschield, T., Prag, J. “Restoring Ancient Text Using Deep Learning: A Case Study on Greek Epigraphy.” In&lt;br /&gt;
''Empirical Methods in Natural Language Processing (EMNLP - IJCNLP 2019)''. Association for Computational Linguistics. 6369-6376.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When using any of this project's source code, please cite:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
@inproceedings{assael2019restoring,&lt;br /&gt;
  title={Restoring ancient text using deep learning: a case study on {Greek} epigraphy},&lt;br /&gt;
  author={Assael, Yannis and Sommerschield, Thea and Prag, Jonathan},&lt;br /&gt;
  booktitle={Empirical Methods in Natural Language Processing},&lt;br /&gt;
  pages={6369--6376},&lt;br /&gt;
  year={2019}&lt;br /&gt;
}&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Pythia offline===&lt;br /&gt;
&lt;br /&gt;
The following snippets provide references for regenerating PHI-ML and training new models offline.&lt;br /&gt;
&lt;br /&gt;
====Dependencies====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
pip install -r requirements.txt &amp;amp;&amp;amp; \&lt;br /&gt;
python -m nltk.downloader punkt&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====PHI-ML dataset generation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
# Download PHI (this will take a while)&lt;br /&gt;
python -c 'import pythia.data.phi_download; pythia.data.phi_download.main()'&lt;br /&gt;
&lt;br /&gt;
# Process and generate PHI-ML&lt;br /&gt;
python -c 'import pythia.data.phi_process; pythia.data.phi_process.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Preprocessed PHI-ML uploaded by @Holger.Danske800: [https://drive.google.com/drive/folders/1PQaWYmB02Sc2OC9yokajcsw65wIcLxGD link]&lt;br /&gt;
&lt;br /&gt;
====Training====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Evaluation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.test; pythia.test.main()' --load_checkpoint=&amp;quot;your_model_path/&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Docker execution====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./build.sh&lt;br /&gt;
./run.sh &amp;lt;GPU_ID&amp;gt; python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Media===&lt;br /&gt;
&lt;br /&gt;
Our research has been featured in blog posts by [https://deepmind.com/research/publications/Restoring-ancient-text-using-deep-learning-a-case-study-on-Greek-epigraphy DeepMind] and the [http://www.ox.ac.uk/news/arts-blog/restoring-ancient-greek-inscriptions-using-ai-deep-learning University of Oxford], and has been published in articles on [https://www.thetimes.co.uk/article/oracle-of-ai-solves-classic-conundrums-v6pdtsps0 The Times], [https://www.newscientist.com/article/2220438-deepmind-ai-beats-humans-at-deciphering-damaged-ancient-greek-tablets/ NewScientist], [https://www.ft.com/content/2b72ed2c-907b-11ea-bc44-dbf6756c871a?FTCamp=engage%2FCAPI%2F%2FChannel_signal%2F%2FB2B/ Financial Times], [https://www.repubblica.it/tecnologia/2019/10/24/news/non_solo_go_l_ai_di_google_batte_anche_gli_archeologi-239393979/ La Repubblica], [https://www.kathimerini.gr/1047963/article/epikairothta/episthmh/py8ia-o-gonos-ths-google-kai-ellhna-ereynhth-poy-diavazei-misokatestrammenes-arxaies-epigrafes Ekathimerini], [https://techcrunch.com/2019/10/18/ai-is-helping-scholars-restore-ancient-greek-texts-on-stone-tablets/ TechCrunch] and others. &lt;br /&gt;
&lt;br /&gt;
It has also been presented at [https://www.emnlp-ijcnlp2019.org/ EMNLP - IJCNLP 2019] in Hong Kong, at the British School at Rome, the Oxford Epigraphy Workshop, the OIKOS National Research School in Classical Studies, during lectures at Venice Ca' Foscari and Rome La Sapienza, and most recently at the [https://www.youtube.com/watch?v=nKSfzHYmLtQ&amp;amp;t=26s Digital Classicist London Seminar].&lt;br /&gt;
&lt;br /&gt;
Pythia is a partner project of [[epigraphy.info]].&lt;br /&gt;
&lt;br /&gt;
===Contact===&lt;br /&gt;
&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk]&lt;br /&gt;
&lt;br /&gt;
[[category:tools]] &lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:corpora]]&lt;br /&gt;
[[category:dataset]]&lt;br /&gt;
[[category:deep learning]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:digitization]]&lt;br /&gt;
[[category:inscriptions]]&lt;br /&gt;
[[category:machine learning]]&lt;br /&gt;
[[category:NLP]]&lt;br /&gt;
[[category:neural networks]]&lt;br /&gt;
[[category:Openaccess]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:repositories]]&lt;br /&gt;
[[category:text mining]]&lt;br /&gt;
[[category:web service]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=9978</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=9978"/>
		<updated>2020-06-11T11:13:21Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: added link&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Finishing DPhil student in Ancient History at the University of Oxford.&lt;br /&gt;
&lt;br /&gt;
Interested in the Classical and Tech(no) side of research - and music! Currently working on machine learning for Latin epigraphical text restoration and other fun stuff.&lt;br /&gt;
&lt;br /&gt;
Co-author of [[Pythia]] –  a deep learning model for the automatic restoration of Greek inscriptions.&amp;lt;br&amp;gt;&lt;br /&gt;
Author of [https://www.academia.edu/38335343/A_New_Sicilian_Curse_Corpus_A_Blueprint_for_a_Geographical_and_Chronological_Analysis_of_Defixiones_from_Sicily ISicDef] – A MySQL database &amp;amp; digital corpus of Sicilian curse tablets ''(to be migrated to [[I.Sicily]]).'' &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk] | [http://thea@sommerschield.it thea@sommerschield.it]&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=9977</id>
		<title>Pythia</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=9977"/>
		<updated>2020-06-11T11:07:41Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: /* Authors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Available==&lt;br /&gt;
&lt;br /&gt;
* https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ&lt;br /&gt;
&lt;br /&gt;
==Authors==&lt;br /&gt;
&lt;br /&gt;
* [[User:TheaSommerschield|Thea Sommerschield]], University of Oxford&lt;br /&gt;
* Yannis Assael, DeepMind&lt;br /&gt;
* Jonathan Prag, University of Oxford&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
&lt;br /&gt;
'''PYTHIA: a deep learning model for the automatic restoration of Greek inscriptions''' is the first ancient text restoration model that recovers missing characters from a damaged text input using deep neural networks. Bringing together the disciplines of ancient history and deep learning, this research offers a fully automated aid to the epigraphic restoration of fragmentary inscriptions, providing ancient historians with multiple textual restorations, as well as the confidence level for each hypothesis. &lt;br /&gt;
&lt;br /&gt;
Pythia takes a sequence of damaged text as input, and is trained to predict character sequences comprising hypothesised restorations of ancient Greek inscriptions. The architecture works at both the character- and word-level, thereby effectively handling long-term context information, and dealing efficiently with incomplete word representations. This makes it applicable to all disciplines dealing with ancient texts (philology, papyrology, codicology) and applies to any language (ancient or modern).&lt;br /&gt;
&lt;br /&gt;
To train the model, we wrote a non-trivial pipeline to convert [[PHI Greek Inscriptions|PHI]], the largest digital corpus of ancient Greek inscriptions, to machine actionable text, which we call [[PHI-ML]]. On PHI-ML, Pythia's predictions achieve a 30.1% character error rate, compared to the 57.3% of human epigraphists. Moreover, in 73.5% of cases the ground-truth sequence was among the Top-20 hypotheses of Pythia, which effectively demonstrates the impact of such an assistive method on the field of digital epigraphy, and sets the state-of-the-art in ancient text restoration.&lt;br /&gt;
&lt;br /&gt;
To aid further research in the field we created a freely accessible [https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ online interactive Python notebook], where researchers can query one of our models to get text restorations and visualise the attention weights. Follow the instructions on the Colab notebook to restore texts for your own personal research.&lt;br /&gt;
&lt;br /&gt;
Furthermore, the PHI-ML dataset and processing pipeline have been published on [https://github.com/sommerschield/ancient-text-restoration GitHub], so that any researcher may regenerate PHI-ML and train new models offline. Follow the instructions below under the section ''&amp;quot;Pythia offline&amp;quot;''.&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
&lt;br /&gt;
Article available at:&lt;br /&gt;
* [https://www.aclweb.org/anthology/D19-1668/ ACL anthology]&lt;br /&gt;
* [https://arxiv.org/abs/1910.06262 arXiv preprint]&lt;br /&gt;
&lt;br /&gt;
To quote this work:&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Assael, Y., Sommerschield, T., Prag, J. “Restoring Ancient Text Using Deep Learning: A Case Study on Greek Epigraphy.” In&lt;br /&gt;
''Empirical Methods in Natural Language Processing (EMNLP - IJCNLP 2019)''. Association for Computational Linguistics. 6369-6376.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When using any of this project's source code, please cite:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
@inproceedings{assael2019restoring,&lt;br /&gt;
  title={Restoring ancient text using deep learning: a case study on {Greek} epigraphy},&lt;br /&gt;
  author={Assael, Yannis and Sommerschield, Thea and Prag, Jonathan},&lt;br /&gt;
  booktitle={Empirical Methods in Natural Language Processing},&lt;br /&gt;
  pages={6369--6376},&lt;br /&gt;
  year={2019}&lt;br /&gt;
}&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Pythia offline===&lt;br /&gt;
&lt;br /&gt;
The following snippets provide references for regenerating PHI-ML and training new models offline.&lt;br /&gt;
&lt;br /&gt;
====Dependencies====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
pip install -r requirements.txt &amp;amp;&amp;amp; \&lt;br /&gt;
python -m nltk.downloader punkt&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====PHI-ML dataset generation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
# Download PHI (this will take a while)&lt;br /&gt;
python -c 'import pythia.data.phi_download; pythia.data.phi_download.main()'&lt;br /&gt;
&lt;br /&gt;
# Process and generate PHI-ML&lt;br /&gt;
python -c 'import pythia.data.phi_process; pythia.data.phi_process.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Preprocessed PHI-ML uploaded by @Holger.Danske800: [https://drive.google.com/drive/folders/1PQaWYmB02Sc2OC9yokajcsw65wIcLxGD link]&lt;br /&gt;
&lt;br /&gt;
====Training====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Evaluation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.test; pythia.test.main()' --load_checkpoint=&amp;quot;your_model_path/&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Docker execution====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./build.sh&lt;br /&gt;
./run.sh &amp;lt;GPU_ID&amp;gt; python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Media===&lt;br /&gt;
&lt;br /&gt;
Our research has been featured in blog posts by [https://deepmind.com/research/publications/Restoring-ancient-text-using-deep-learning-a-case-study-on-Greek-epigraphy DeepMind] and the [http://www.ox.ac.uk/news/arts-blog/restoring-ancient-greek-inscriptions-using-ai-deep-learning University of Oxford], and has been published in articles on [https://www.thetimes.co.uk/article/oracle-of-ai-solves-classic-conundrums-v6pdtsps0 The Times], [https://www.newscientist.com/article/2220438-deepmind-ai-beats-humans-at-deciphering-damaged-ancient-greek-tablets/ NewScientist], [https://www.repubblica.it/tecnologia/2019/10/24/news/non_solo_go_l_ai_di_google_batte_anche_gli_archeologi-239393979/ La Repubblica], [https://www.kathimerini.gr/1047963/article/epikairothta/episthmh/py8ia-o-gonos-ths-google-kai-ellhna-ereynhth-poy-diavazei-misokatestrammenes-arxaies-epigrafes Ekathimerini], [https://techcrunch.com/2019/10/18/ai-is-helping-scholars-restore-ancient-greek-texts-on-stone-tablets/ TechCrunch] and others. &lt;br /&gt;
&lt;br /&gt;
It has also been presented at [https://www.emnlp-ijcnlp2019.org/ EMNLP - IJCNLP 2019] in Hong Kong, at the British School at Rome, the Oxford Epigraphy Workshop, the OIKOS National Research School in Classical Studies, during lectures at Venice Ca' Foscari and Rome La Sapienza, and most recently at the [https://www.youtube.com/watch?v=nKSfzHYmLtQ&amp;amp;t=26s Digital Classicist London Seminar].&lt;br /&gt;
&lt;br /&gt;
Pythia is a partner project of [[epigraphy.info]].&lt;br /&gt;
&lt;br /&gt;
===Contact===&lt;br /&gt;
&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk]&lt;br /&gt;
&lt;br /&gt;
[[category:tools]] &lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:corpora]]&lt;br /&gt;
[[category:dataset]]&lt;br /&gt;
[[category:deep learning]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:digitization]]&lt;br /&gt;
[[category:inscriptions]]&lt;br /&gt;
[[category:machine learning]]&lt;br /&gt;
[[category:NLP]]&lt;br /&gt;
[[category:neural networks]]&lt;br /&gt;
[[category:Openaccess]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:repositories]]&lt;br /&gt;
[[category:text mining]]&lt;br /&gt;
[[category:web service]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=9976</id>
		<title>User:TheaSommerschield</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=User:TheaSommerschield&amp;diff=9976"/>
		<updated>2020-06-11T11:01:08Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: Thea short bio + contacts&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Finishing DPhil student in Ancient History at the University of Oxford.&lt;br /&gt;
&lt;br /&gt;
Interested in the Classical and Tech(no) side of research - and music! Currently working on machine learning for Latin epigraphical text restoration and other fun stuff.&lt;br /&gt;
&lt;br /&gt;
Co-author of [[Pythia]] –  a deep learning model for the automatic restoration of Greek inscriptions.&amp;lt;br&amp;gt;&lt;br /&gt;
Author of ISicDef – A MySQL database &amp;amp; digital corpus of Sicilian curse tablets ''(to be migrated to [[I.Sicily]]).'' &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Find me on:&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk] | [http://thea@sommerschield.it thea@sommerschield.it]&lt;br /&gt;
* [https://oxford.academia.edu/TheaSommerschield Academia.edu]&lt;br /&gt;
* [https://github.com/sommerschield GitHub]&lt;br /&gt;
* [https://scholar.google.it/citations?user=dUifv5oAAAAJ&amp;amp;hl=en Google Scholar]&lt;br /&gt;
* [https://twitter.com/TSommerschield Twitter]&lt;br /&gt;
* a Mediterranean island with good internet connection.&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=9975</id>
		<title>Pythia</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=9975"/>
		<updated>2020-06-11T10:45:03Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: /* Authors */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Available==&lt;br /&gt;
&lt;br /&gt;
* https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ&lt;br /&gt;
&lt;br /&gt;
==Authors==&lt;br /&gt;
&lt;br /&gt;
* Thea Sommerschield, University of Oxford&lt;br /&gt;
* Yannis Assael, DeepMind&lt;br /&gt;
* Jonathan Prag, University of Oxford&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
&lt;br /&gt;
'''PYTHIA: a deep learning model for the automatic restoration of Greek inscriptions''' is the first ancient text restoration model that recovers missing characters from a damaged text input using deep neural networks. Bringing together the disciplines of ancient history and deep learning, this research offers a fully automated aid to the epigraphic restoration of fragmentary inscriptions, providing ancient historians with multiple textual restorations, as well as the confidence level for each hypothesis. &lt;br /&gt;
&lt;br /&gt;
Pythia takes a sequence of damaged text as input, and is trained to predict character sequences comprising hypothesised restorations of ancient Greek inscriptions. The architecture works at both the character- and word-level, thereby effectively handling long-term context information, and dealing efficiently with incomplete word representations. This makes it applicable to all disciplines dealing with ancient texts (philology, papyrology, codicology) and applies to any language (ancient or modern).&lt;br /&gt;
&lt;br /&gt;
To train the model, we wrote a non-trivial pipeline to convert [[PHI Greek Inscriptions|PHI]], the largest digital corpus of ancient Greek inscriptions, to machine actionable text, which we call [[PHI-ML]]. On PHI-ML, Pythia's predictions achieve a 30.1% character error rate, compared to the 57.3% of human epigraphists. Moreover, in 73.5% of cases the ground-truth sequence was among the Top-20 hypotheses of Pythia, which effectively demonstrates the impact of such an assistive method on the field of digital epigraphy, and sets the state-of-the-art in ancient text restoration.&lt;br /&gt;
&lt;br /&gt;
To aid further research in the field we created a freely accessible [https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ online interactive Python notebook], where researchers can query one of our models to get text restorations and visualise the attention weights. Follow the instructions on the Colab notebook to restore texts for your own personal research.&lt;br /&gt;
&lt;br /&gt;
Furthermore, the PHI-ML dataset and processing pipeline have been published on [https://github.com/sommerschield/ancient-text-restoration GitHub], so that any researcher may regenerate PHI-ML and train new models offline. Follow the instructions below under the section ''&amp;quot;Pythia offline&amp;quot;''.&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
&lt;br /&gt;
Article available at:&lt;br /&gt;
* [https://www.aclweb.org/anthology/D19-1668/ ACL anthology]&lt;br /&gt;
* [https://arxiv.org/abs/1910.06262 arXiv preprint]&lt;br /&gt;
&lt;br /&gt;
To quote this work:&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Assael, Y., Sommerschield, T., Prag, J. “Restoring Ancient Text Using Deep Learning: A Case Study on Greek Epigraphy.” In&lt;br /&gt;
''Empirical Methods in Natural Language Processing (EMNLP - IJCNLP 2019)''. Association for Computational Linguistics. 6369-6376.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When using any of this project's source code, please cite:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
@inproceedings{assael2019restoring,&lt;br /&gt;
  title={Restoring ancient text using deep learning: a case study on {Greek} epigraphy},&lt;br /&gt;
  author={Assael, Yannis and Sommerschield, Thea and Prag, Jonathan},&lt;br /&gt;
  booktitle={Empirical Methods in Natural Language Processing},&lt;br /&gt;
  pages={6369--6376},&lt;br /&gt;
  year={2019}&lt;br /&gt;
}&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Pythia offline===&lt;br /&gt;
&lt;br /&gt;
The following snippets provide references for regenerating PHI-ML and training new models offline.&lt;br /&gt;
&lt;br /&gt;
====Dependencies====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
pip install -r requirements.txt &amp;amp;&amp;amp; \&lt;br /&gt;
python -m nltk.downloader punkt&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====PHI-ML dataset generation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
# Download PHI (this will take a while)&lt;br /&gt;
python -c 'import pythia.data.phi_download; pythia.data.phi_download.main()'&lt;br /&gt;
&lt;br /&gt;
# Process and generate PHI-ML&lt;br /&gt;
python -c 'import pythia.data.phi_process; pythia.data.phi_process.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Preprocessed PHI-ML uploaded by @Holger.Danske800: [https://drive.google.com/drive/folders/1PQaWYmB02Sc2OC9yokajcsw65wIcLxGD link]&lt;br /&gt;
&lt;br /&gt;
====Training====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Evaluation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.test; pythia.test.main()' --load_checkpoint=&amp;quot;your_model_path/&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Docker execution====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./build.sh&lt;br /&gt;
./run.sh &amp;lt;GPU_ID&amp;gt; python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Media===&lt;br /&gt;
&lt;br /&gt;
Our research has been featured in blog posts by [https://deepmind.com/research/publications/Restoring-ancient-text-using-deep-learning-a-case-study-on-Greek-epigraphy DeepMind] and the [http://www.ox.ac.uk/news/arts-blog/restoring-ancient-greek-inscriptions-using-ai-deep-learning University of Oxford], and has been published in articles on [https://www.thetimes.co.uk/article/oracle-of-ai-solves-classic-conundrums-v6pdtsps0 The Times], [https://www.newscientist.com/article/2220438-deepmind-ai-beats-humans-at-deciphering-damaged-ancient-greek-tablets/ NewScientist], [https://www.repubblica.it/tecnologia/2019/10/24/news/non_solo_go_l_ai_di_google_batte_anche_gli_archeologi-239393979/ La Repubblica], [https://www.kathimerini.gr/1047963/article/epikairothta/episthmh/py8ia-o-gonos-ths-google-kai-ellhna-ereynhth-poy-diavazei-misokatestrammenes-arxaies-epigrafes Ekathimerini], [https://techcrunch.com/2019/10/18/ai-is-helping-scholars-restore-ancient-greek-texts-on-stone-tablets/ TechCrunch] and others. &lt;br /&gt;
&lt;br /&gt;
It has also been presented at [https://www.emnlp-ijcnlp2019.org/ EMNLP - IJCNLP 2019] in Hong Kong, at the British School at Rome, the Oxford Epigraphy Workshop, the OIKOS National Research School in Classical Studies, during lectures at Venice Ca' Foscari and Rome La Sapienza, and most recently at the [https://www.youtube.com/watch?v=nKSfzHYmLtQ&amp;amp;t=26s Digital Classicist London Seminar].&lt;br /&gt;
&lt;br /&gt;
Pythia is a partner project of [[epigraphy.info]].&lt;br /&gt;
&lt;br /&gt;
===Contact===&lt;br /&gt;
&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk]&lt;br /&gt;
&lt;br /&gt;
[[category:tools]] &lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:corpora]]&lt;br /&gt;
[[category:dataset]]&lt;br /&gt;
[[category:deep learning]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:digitization]]&lt;br /&gt;
[[category:inscriptions]]&lt;br /&gt;
[[category:machine learning]]&lt;br /&gt;
[[category:NLP]]&lt;br /&gt;
[[category:neural networks]]&lt;br /&gt;
[[category:Openaccess]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:repositories]]&lt;br /&gt;
[[category:text mining]]&lt;br /&gt;
[[category:web service]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=9974</id>
		<title>Pythia</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=9974"/>
		<updated>2020-06-11T10:44:44Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: /* Description */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Available==&lt;br /&gt;
&lt;br /&gt;
* https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ&lt;br /&gt;
&lt;br /&gt;
==Authors==&lt;br /&gt;
&lt;br /&gt;
* Thea Sommerschield, University of Oxford&lt;br /&gt;
* Yannis Assael, DeepMind&lt;br /&gt;
* Jonathan R. W. Prag, University of Oxford&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
&lt;br /&gt;
'''PYTHIA: a deep learning model for the automatic restoration of Greek inscriptions''' is the first ancient text restoration model that recovers missing characters from a damaged text input using deep neural networks. Bringing together the disciplines of ancient history and deep learning, this research offers a fully automated aid to the epigraphic restoration of fragmentary inscriptions, providing ancient historians with multiple textual restorations, as well as the confidence level for each hypothesis. &lt;br /&gt;
&lt;br /&gt;
Pythia takes a sequence of damaged text as input, and is trained to predict character sequences comprising hypothesised restorations of ancient Greek inscriptions. The architecture works at both the character- and word-level, thereby effectively handling long-term context information, and dealing efficiently with incomplete word representations. This makes it applicable to all disciplines dealing with ancient texts (philology, papyrology, codicology) and applies to any language (ancient or modern).&lt;br /&gt;
&lt;br /&gt;
To train the model, we wrote a non-trivial pipeline to convert [[PHI Greek Inscriptions|PHI]], the largest digital corpus of ancient Greek inscriptions, to machine actionable text, which we call [[PHI-ML]]. On PHI-ML, Pythia's predictions achieve a 30.1% character error rate, compared to the 57.3% of human epigraphists. Moreover, in 73.5% of cases the ground-truth sequence was among the Top-20 hypotheses of Pythia, which effectively demonstrates the impact of such an assistive method on the field of digital epigraphy, and sets the state-of-the-art in ancient text restoration.&lt;br /&gt;
&lt;br /&gt;
To aid further research in the field we created a freely accessible [https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ online interactive Python notebook], where researchers can query one of our models to get text restorations and visualise the attention weights. Follow the instructions on the Colab notebook to restore texts for your own personal research.&lt;br /&gt;
&lt;br /&gt;
Furthermore, the PHI-ML dataset and processing pipeline have been published on [https://github.com/sommerschield/ancient-text-restoration GitHub], so that any researcher may regenerate PHI-ML and train new models offline. Follow the instructions below under the section ''&amp;quot;Pythia offline&amp;quot;''.&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
&lt;br /&gt;
Article available at:&lt;br /&gt;
* [https://www.aclweb.org/anthology/D19-1668/ ACL anthology]&lt;br /&gt;
* [https://arxiv.org/abs/1910.06262 arXiv preprint]&lt;br /&gt;
&lt;br /&gt;
To quote this work:&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Assael, Y., Sommerschield, T., Prag, J. “Restoring Ancient Text Using Deep Learning: A Case Study on Greek Epigraphy.” In&lt;br /&gt;
''Empirical Methods in Natural Language Processing (EMNLP - IJCNLP 2019)''. Association for Computational Linguistics. 6369-6376.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When using any of this project's source code, please cite:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
@inproceedings{assael2019restoring,&lt;br /&gt;
  title={Restoring ancient text using deep learning: a case study on {Greek} epigraphy},&lt;br /&gt;
  author={Assael, Yannis and Sommerschield, Thea and Prag, Jonathan},&lt;br /&gt;
  booktitle={Empirical Methods in Natural Language Processing},&lt;br /&gt;
  pages={6369--6376},&lt;br /&gt;
  year={2019}&lt;br /&gt;
}&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Pythia offline===&lt;br /&gt;
&lt;br /&gt;
The following snippets provide references for regenerating PHI-ML and training new models offline.&lt;br /&gt;
&lt;br /&gt;
====Dependencies====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
pip install -r requirements.txt &amp;amp;&amp;amp; \&lt;br /&gt;
python -m nltk.downloader punkt&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====PHI-ML dataset generation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
# Download PHI (this will take a while)&lt;br /&gt;
python -c 'import pythia.data.phi_download; pythia.data.phi_download.main()'&lt;br /&gt;
&lt;br /&gt;
# Process and generate PHI-ML&lt;br /&gt;
python -c 'import pythia.data.phi_process; pythia.data.phi_process.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Preprocessed PHI-ML uploaded by @Holger.Danske800: [https://drive.google.com/drive/folders/1PQaWYmB02Sc2OC9yokajcsw65wIcLxGD link]&lt;br /&gt;
&lt;br /&gt;
====Training====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Evaluation====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.test; pythia.test.main()' --load_checkpoint=&amp;quot;your_model_path/&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Docker execution====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./build.sh&lt;br /&gt;
./run.sh &amp;lt;GPU_ID&amp;gt; python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Media===&lt;br /&gt;
&lt;br /&gt;
Our research has been featured in blog posts by [https://deepmind.com/research/publications/Restoring-ancient-text-using-deep-learning-a-case-study-on-Greek-epigraphy DeepMind] and the [http://www.ox.ac.uk/news/arts-blog/restoring-ancient-greek-inscriptions-using-ai-deep-learning University of Oxford], and has been published in articles on [https://www.thetimes.co.uk/article/oracle-of-ai-solves-classic-conundrums-v6pdtsps0 The Times], [https://www.newscientist.com/article/2220438-deepmind-ai-beats-humans-at-deciphering-damaged-ancient-greek-tablets/ NewScientist], [https://www.repubblica.it/tecnologia/2019/10/24/news/non_solo_go_l_ai_di_google_batte_anche_gli_archeologi-239393979/ La Repubblica], [https://www.kathimerini.gr/1047963/article/epikairothta/episthmh/py8ia-o-gonos-ths-google-kai-ellhna-ereynhth-poy-diavazei-misokatestrammenes-arxaies-epigrafes Ekathimerini], [https://techcrunch.com/2019/10/18/ai-is-helping-scholars-restore-ancient-greek-texts-on-stone-tablets/ TechCrunch] and others. &lt;br /&gt;
&lt;br /&gt;
It has also been presented at [https://www.emnlp-ijcnlp2019.org/ EMNLP - IJCNLP 2019] in Hong Kong, at the British School at Rome, the Oxford Epigraphy Workshop, the OIKOS National Research School in Classical Studies, during lectures at Venice Ca' Foscari and Rome La Sapienza, and most recently at the [https://www.youtube.com/watch?v=nKSfzHYmLtQ&amp;amp;t=26s Digital Classicist London Seminar].&lt;br /&gt;
&lt;br /&gt;
Pythia is a partner project of [[epigraphy.info]].&lt;br /&gt;
&lt;br /&gt;
===Contact===&lt;br /&gt;
&lt;br /&gt;
* [http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk]&lt;br /&gt;
&lt;br /&gt;
[[category:tools]] &lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:corpora]]&lt;br /&gt;
[[category:dataset]]&lt;br /&gt;
[[category:deep learning]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:digitization]]&lt;br /&gt;
[[category:inscriptions]]&lt;br /&gt;
[[category:machine learning]]&lt;br /&gt;
[[category:NLP]]&lt;br /&gt;
[[category:neural networks]]&lt;br /&gt;
[[category:Openaccess]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:repositories]]&lt;br /&gt;
[[category:text mining]]&lt;br /&gt;
[[category:web service]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Epigraphy.info&amp;diff=9969</id>
		<title>Epigraphy.info</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Epigraphy.info&amp;diff=9969"/>
		<updated>2020-06-09T19:41:23Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: /* Preliminary Partners */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Available==&lt;br /&gt;
&lt;br /&gt;
* http://epigraphy.info/&lt;br /&gt;
&lt;br /&gt;
==Description==&lt;br /&gt;
&lt;br /&gt;
A Collaborative Environment for Digital Epigraphy&lt;br /&gt;
&lt;br /&gt;
==Steering committee==&lt;br /&gt;
&lt;br /&gt;
Committee 2019-20:&lt;br /&gt;
&lt;br /&gt;
* Chiara Cenati (Austrian Academy of Sciences)&lt;br /&gt;
* Tom Elliott (New York U./ Pleiades)&lt;br /&gt;
* M. Cristina de la Escosura (U. Zaragoza)&lt;br /&gt;
* Tom Gheldof (KU Leuven / Trismegistos)&lt;br /&gt;
* Andrea Mannocci (CNR Pisa / Eagle Project)&lt;br /&gt;
* Vincent Razanajao (U. Bordeaux Montaigne / Patrimonium)&lt;br /&gt;
&lt;br /&gt;
==Mission Statement==&lt;br /&gt;
&lt;br /&gt;
Taken from the website (accessed 2019-08-06):&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote&amp;gt;&amp;lt;p&amp;gt;'''Epigraphy.info''' is an international open community pursuing a collaborative environment for digital epigraphy, which facilitates scholarly communication and interaction. We apply FAIR principles to epigraphic information in order to efficiently create, use and share it among researchers, students and enthusiasts around the globe. The Epigraphy.info community works to gather and enhance the many existing epigraphic efforts, and serves as a landing point for digital tools, practices and methodologies for managing collections of inscriptions.&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;Key concerns include:&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Identification, creation, and dissemination of agreed-upon guidelines, standards, and best practices;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Preservation, reuse, and update of existing and emerging datasets providing the most up-to-date versions of inscriptions;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Providing a toolset for searching, analyzing and editing inscriptions and their metadata by both human and machine;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Services for citation, revision and exchange.&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;Epigraphy.info will not replace existing digital resources; it intends to be a hub for a fruitful exchange of epigraphic data and digital solutions that will benefit all epigraphers.&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Workshops==&lt;br /&gt;
&lt;br /&gt;
*[http://epigraphy.info/workshop_1.html Workshop I]&lt;br /&gt;
&lt;br /&gt;
The 1st Epigraphy.info workshop took place at the Heidelberg Academy of Sciences and Humanities (21st - 23rd March 2018). It brought together leading experts in the field of digital epigraphy to discuss ideas for the creation of an open collaborative platform for Greek and Latin epigraphy.&lt;br /&gt;
&lt;br /&gt;
*[http://epigraphy.info/workshop_2.html Workshop II]&lt;br /&gt;
&lt;br /&gt;
The 2nd Epigraphy.info workshop took place in Zadar (14th - 16th December 2018). A preliminary mission statement was defined here.&lt;br /&gt;
&lt;br /&gt;
*[http://epigraphy.info/workshop_3.html Workshop III]&lt;br /&gt;
&lt;br /&gt;
The 3rd Epigraphy.info workshop took place in Vienna (30th May – 1st June 2019). The mission statement was approved by the community and preliminary working groups were formed to manage the tasks discussed here.&lt;br /&gt;
&lt;br /&gt;
*[https://doodle.com/poll/dy8k5bngr2s5e8fb Workshop IV]&lt;br /&gt;
&lt;br /&gt;
The 4th Epigraphy.info workshop will take place in Hamburg (19th - 21st February 2020). It will be preceded by a TEI/EpiDoc training (17th-19th February 2020).&lt;br /&gt;
&lt;br /&gt;
==Preliminary Partners==&lt;br /&gt;
&lt;br /&gt;
* [[Epigraphic Database Bari]] - Epigraphic documents of Christian patronage&lt;br /&gt;
* [[Epigraphische Datenbank Heidelberg]] - Latin inscriptions of the Roman Empire&lt;br /&gt;
* [[Epigraphic Database Roma]] - Epigraphic documents from Italic Regions&lt;br /&gt;
* [[Hispania Epigraphica]] - Epigraphic patrimony of Portugal and Spain&lt;br /&gt;
* [[PETRAE]] - Latin and Greek inscriptions (Institut Ausonius)&lt;br /&gt;
* [[Last Statues of Antiquity]] - Database for Late Antiquity Statues&lt;br /&gt;
* [[Roman Inscriptions of Britain]] - Epigraphic patrimony of Britain&lt;br /&gt;
* [[Ubi Erat Lupa]] - Images of stone monuments&lt;br /&gt;
* [[Epigraphik-Datenbank Clauss-Slaby]] - Database of Latin inscriptions&lt;br /&gt;
* [[EAGLE Europeana Network]] - Europeana network of Ancient Greek and Latin Epigraphy&lt;br /&gt;
* IDEA - [[International Digital Epigraphy Association]]&lt;br /&gt;
* [[Trismegistos]] - Portal of papyrological and epigraphical resources&lt;br /&gt;
* [[Epigraphische Datenbank zum antiken Kleinasien]] - Greek &amp;amp; Latin inscriptions from Turkey&lt;br /&gt;
* [[Supplementum Epigraphicum Graecum]] (SEG) - Greek inscriptions with critical apparatus&lt;br /&gt;
* [[US Epigraphy]] Project (USEP) - Collections of Greek &amp;amp; Latin inscriptions in the USA&lt;br /&gt;
* [[Inscriptions of Greek Cyrenaica]] - Online Edition of Greek inscriptions from Libya&lt;br /&gt;
* [[Ancient Graffiti Project]] - Digital resource of graffiti of Herculaneum and Pompeii&lt;br /&gt;
* [[Attic Inscriptions Online]] - Inscriptions of ancient Athens and Attica in English translation&lt;br /&gt;
* [[Greek Inscriptions Online]] - Translations of Ancient Greek inscriptions into Modern Greek&lt;br /&gt;
* [[Patrimonium]] - Geography and Economy of the Imperial Properties in the Roman World&lt;br /&gt;
* [[Epigraphica Romana]] - Recent Epigraphic Editions&lt;br /&gt;
* [[Inscriptions of Israel Palestine]] - Inscriptions of Israel/Palestine from the Persian period through the Islamic conquest&lt;br /&gt;
* [[Krateros]] - Digital repository for the collections of epigraphic squeezes&lt;br /&gt;
* [[Epigraphica 3.0]] - Digital corpus of Roman inscriptions from Ourense province&lt;br /&gt;
* [[LLDB]] - Computerized Historical Linguistic Database of the Latin Inscriptions of the Imperial Age&lt;br /&gt;
* [[Romans 1by1]] - Romans One By One&lt;br /&gt;
* [[Epigraphic Database Vernacular]] - Digital Corpus of all Non-Latin Inscriptions from Italy (9th-15th c)&lt;br /&gt;
&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:EpiDoc]]&lt;br /&gt;
[[category:Linked open data]]&lt;br /&gt;
[[category:collaboration]]&lt;br /&gt;
[[category:crowdsourcing]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:community]]&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=9968</id>
		<title>Pythia</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=9968"/>
		<updated>2020-06-09T19:30:59Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;lt;div id=&amp;quot;mw-content-text&amp;quot; lang=&amp;quot;en-GB&amp;quot; dir=&amp;quot;ltr&amp;quot; class=&amp;quot;mw-content-ltr&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;div class=&amp;quot;mw-parser-output&amp;quot;&amp;gt;&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Available&amp;quot;&amp;gt;Available&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;[https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ PYTHIA: the deep learning model for the automatic restoration of Greek inscriptions]&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Authors&amp;quot;&amp;gt;Authors&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Thea Sommerschield, University of Oxford&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Yannis Assael, DeepMind&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Jonathan R. W. Prag, University of Oxford&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Description&amp;quot;&amp;gt;Description&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
&amp;lt;b&amp;gt;Pythia&amp;lt;/b&amp;gt; is the first ancient text restoration model that recovers missing characters from a damaged text input using deep neural networks. Bringing together the disciplines of ancient history and deep learning, this research offers a fully automated aid to the epigraphic restoration of fragmentary inscriptions, providing ancient historians with multiple textual restorations, as well as the confidence level for each hypothesis. &lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
&amp;lt;b&amp;gt;Pythia&amp;lt;/b&amp;gt; takes a sequence of damaged text as input, and is trained to predict character sequences comprising hypothesised restorations of ancient Greek inscriptions. The architecture works at both the character- and word-level, thereby effectively handling long-term context information, and dealing efficiently with incomplete word representations. This makes it applicable to all disciplines dealing with ancient texts (philology, papyrology, codicology) and applies to any language (ancient or modern).&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
To train the model, we wrote a non-trivial pipeline to convert [[PHI Greek Inscriptions|PHI]], the largest digital corpus of ancient Greek inscriptions, to machine actionable text, which we call &amp;lt;i&amp;gt;PHI-ML&amp;lt;/i&amp;gt;. On &amp;lt;i&amp;gt;PHI-ML&amp;lt;/i&amp;gt;, &amp;lt;b&amp;gt;Pythia&amp;lt;/b&amp;gt;'s predictions achieve a 30.1% character error rate, compared to the 57.3% of human epigraphists. Moreover, in 73.5% of cases the ground-truth sequence was among the Top-20 hypotheses of &amp;lt;b&amp;gt;Pythia&amp;lt;/b&amp;gt;, which effectively demonstrates the impact of such an assistive method on the field of digital epigraphy, and sets the state-of-the-art in ancient text restoration.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
To aid further research in the field we created a freely accessible [https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ online interactive Python notebook], where researchers can query one of our models to get text restorations and visualise the attention weights. Follow the instructions on the Colab notebook to restore texts for your own personal research.&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
Furthermore, the PHI-ML dataset and processing pipeline have been published on [https://github.com/sommerschield/ancient-text-restoration GitHub], so that any researcher may regenerate PHI-ML and train new models offline. Follow the instructions below under the section ''&amp;quot;Pythia offline&amp;quot;''.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;References&amp;quot;&amp;gt;References&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Article available at:&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;[https://www.aclweb.org/anthology/D19-1668/ ACL anthology]&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;[https://arxiv.org/abs/1910.06262 arXiv preprint]&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
To quote this work:&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Assael, Y., Sommerschield, T., Prag, J. “Restoring Ancient Text Using Deep Learning: A Case Study on Greek Epigraphy.” In&lt;br /&gt;
''Empirical Methods in Natural Language Processing (EMNLP - IJCNLP 2019)''. Association for Computational Linguistics. 6369-6376.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
When using any of this project's source code, please cite:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
@inproceedings{assael2019restoring,&lt;br /&gt;
  title={Restoring ancient text using deep learning: a case study on {Greek} epigraphy},&lt;br /&gt;
  author={Assael, Yannis and Sommerschield, Thea and Prag, Jonathan},&lt;br /&gt;
  booktitle={Empirical Methods in Natural Language Processing},&lt;br /&gt;
  pages={6369--6376},&lt;br /&gt;
  year={2019}&lt;br /&gt;
}&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Pythia-offline&amp;quot;&amp;gt;Pythia offline&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
The following snippets provide references for regenerating PHI-ML and training new models offline.&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
* '''Dependencies'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
pip install -r requirements.txt &amp;amp;&amp;amp; \&lt;br /&gt;
python -m nltk.downloader punkt&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* '''PHI-ML dataset generation'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
# Download PHI (this will take a while)&lt;br /&gt;
python -c 'import pythia.data.phi_download; pythia.data.phi_download.main()'&lt;br /&gt;
&lt;br /&gt;
# Process and generate PHI-ML&lt;br /&gt;
python -c 'import pythia.data.phi_process; pythia.data.phi_process.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Preprocessed PHI-ML uploaded by @Holger.Danske800: [https://drive.google.com/drive/folders/1PQaWYmB02Sc2OC9yokajcsw65wIcLxGD link]&lt;br /&gt;
&lt;br /&gt;
* '''Training'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* '''Evaluation'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.test; pythia.test.main()' --load_checkpoint=&amp;quot;your_model_path/&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* '''Docker execution'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./build.sh&lt;br /&gt;
./run.sh &amp;lt;GPU_ID&amp;gt; python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Media&amp;quot;&amp;gt;Media&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Our research has been featured in blog posts by [https://deepmind.com/research/publications/Restoring-ancient-text-using-deep-learning-a-case-study-on-Greek-epigraphy DeepMind] and the [http://www.ox.ac.uk/news/arts-blog/restoring-ancient-greek-inscriptions-using-ai-deep-learning University of Oxford], and has been published in articles on [https://www.thetimes.co.uk/article/oracle-of-ai-solves-classic-conundrums-v6pdtsps0 The Times], [https://www.newscientist.com/article/2220438-deepmind-ai-beats-humans-at-deciphering-damaged-ancient-greek-tablets/ NewScientist], [https://www.repubblica.it/tecnologia/2019/10/24/news/non_solo_go_l_ai_di_google_batte_anche_gli_archeologi-239393979/ La Repubblica], [https://www.kathimerini.gr/1047963/article/epikairothta/episthmh/py8ia-o-gonos-ths-google-kai-ellhna-ereynhth-poy-diavazei-misokatestrammenes-arxaies-epigrafes Ekathimerini], [https://techcrunch.com/2019/10/18/ai-is-helping-scholars-restore-ancient-greek-texts-on-stone-tablets/ TechCrunch] and others. &lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
It has also been presented at [https://www.emnlp-ijcnlp2019.org/ EMNLP - IJCNLP 2019] in Hong Kong, at the British School at Rome, the Oxford Epigraphy Workshop, the OIKOS National Research School in Classical Studies, during lectures at Venice Ca' Foscari and Rome La Sapienza, and most recently at the [https://www.youtube.com/watch?v=nKSfzHYmLtQ&amp;amp;t=26s Digital Classicist London Seminar].&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Pythia is a partner project of [[epigraphy.info]].&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Contact&amp;quot;&amp;gt;Contact&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;[http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk]&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[category:tools]] &lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:corpora]]&lt;br /&gt;
[[category:dataset]]&lt;br /&gt;
[[category:deep learning]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:digitization]]&lt;br /&gt;
[[category:inscriptions]]&lt;br /&gt;
[[category:machine learning]]&lt;br /&gt;
[[category:NLP]]&lt;br /&gt;
[[category:neural networks]]&lt;br /&gt;
[[category:Openaccess]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:repositories]]&lt;br /&gt;
[[category:text mining]]&lt;br /&gt;
[[category:web service]]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=9967</id>
		<title>Pythia</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=9967"/>
		<updated>2020-06-09T18:39:15Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;lt;div id=&amp;quot;mw-content-text&amp;quot; lang=&amp;quot;en-GB&amp;quot; dir=&amp;quot;ltr&amp;quot; class=&amp;quot;mw-content-ltr&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;div class=&amp;quot;mw-parser-output&amp;quot;&amp;gt;&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Available&amp;quot;&amp;gt;Available&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;[https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ PYTHIA: the deep learning model for the automatic restoration of Greek inscriptions]&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Authors&amp;quot;&amp;gt;Authors&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Thea Sommerschield, University of Oxford&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Yannis Assael, DeepMind&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Jonathan R. W. Prag, University of Oxford&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Description&amp;quot;&amp;gt;Description&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Ancient History relies on disciplines such as Epigraphy, the study of ancient inscribed texts, for evidence of the recorded past. However, these texts, &amp;quot;inscriptions&amp;quot;, are often damaged over the centuries, and illegible parts of the text must be restored by specialists, known as epigraphists. This work presents a novel assistive method for providing text restorations using deep neural networks.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
&amp;lt;b&amp;gt;Pythia&amp;lt;/b&amp;gt; is the first ancient text restoration model that recovers missing characters from a damaged text input.  Bringing together the disciplines of ancient history and deep learning, the present work offers a fully automated aid to the text restoration task, providing ancient historians with multiple textual restorations, as well as the confidence level for each hypothesis. &lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
&amp;lt;b&amp;gt;Pythia&amp;lt;/b&amp;gt; takes a sequence of damaged text as input, and is trained to predict character sequences comprising hypothesised restorations of ancient Greek inscriptions (texts written in the Greek alphabet dating between the seventh century BCE and the fifth century CE). The architecture works at both the character- and word-level, thereby effectively handling long-term context information, and dealing efficiently with incomplete word representations. This makes it applicable to all disciplines dealing with ancient texts (philology, papyrology, codicology) and applies to any language (ancient or modern).&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
To train it, we wrote a non-trivial pipeline to convert [[PHI Greek Inscriptions|PHI]], the largest digital corpus of ancient Greek inscriptions, to machine actionable text, which we call &amp;lt;i&amp;gt;PHI-ML&amp;lt;/i&amp;gt;. On &amp;lt;i&amp;gt;PHI-ML&amp;lt;/i&amp;gt;, &amp;lt;b&amp;gt;Pythia&amp;lt;/b&amp;gt;'s predictions achieve a 30.1% character error rate, compared to the 57.3% of human epigraphists. Moreover, in 73.5% of cases the ground-truth sequence was among the Top-20 hypotheses of &amp;lt;b&amp;gt;Pythia&amp;lt;/b&amp;gt;, which effectively demonstrates the impact of such an assistive method on the field of digital epigraphy, and sets the state-of-the-art in ancient text restoration.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
To aid further research in the field we created a freely accessible [https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ online interactive Python notebook], where researchers can query one of our models to get text restorations and visualise the attention weights. &lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Furthermore, the PHI-ML dataset and processing pipeline have been published on [https://github.com/sommerschield/ancient-text-restoration GitHub], so that any researcher may regenerate PHI-ML and train new models offline. Follow the instructions below under the section ''&amp;quot;Pythia offline&amp;quot;''.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;References&amp;quot;&amp;gt;References&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Article available at:&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;[https://www.aclweb.org/anthology/D19-1668/ ACL anthology]&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;[https://arxiv.org/abs/1910.06262 arXiv preprint]&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
&amp;lt;blockquote&amp;gt;Assael, Y., Sommerschield, T., Prag, J. “Restoring Ancient Text Using Deep Learning: A Case Study on Greek Epigraphy.” In&lt;br /&gt;
''Empirical Methods in Natural Language Processing (EMNLP - IJCNLP 2019)''. Association for Computational Linguistics. 6369-6376.&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
When using any of this project's source code, please cite:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
@inproceedings{assael2019restoring,&lt;br /&gt;
  title={Restoring ancient text using deep learning: a case study on {Greek} epigraphy},&lt;br /&gt;
  author={Assael, Yannis and Sommerschield, Thea and Prag, Jonathan},&lt;br /&gt;
  booktitle={Empirical Methods in Natural Language Processing},&lt;br /&gt;
  pages={6369--6376},&lt;br /&gt;
  year={2019}&lt;br /&gt;
}&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Pythia-offline&amp;quot;&amp;gt;Pythia offline&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
The following snippets provide references for regenerating PHI-ML and training new models offline.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
* '''Dependencies'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
pip install -r requirements.txt &amp;amp;&amp;amp; \&lt;br /&gt;
python -m nltk.downloader punkt&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* '''PHI-ML dataset generation'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
# Download PHI (this will take a while)&lt;br /&gt;
python -c 'import pythia.data.phi_download; pythia.data.phi_download.main()'&lt;br /&gt;
&lt;br /&gt;
# Process and generate PHI-ML&lt;br /&gt;
python -c 'import pythia.data.phi_process; pythia.data.phi_process.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Preprocessed PHI-ML uploaded by @Holger.Danske800: [https://drive.google.com/drive/folders/1PQaWYmB02Sc2OC9yokajcsw65wIcLxGD link]&lt;br /&gt;
&lt;br /&gt;
* '''Training'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* '''Evaluation'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.test; pythia.test.main()' --load_checkpoint=&amp;quot;your_model_path/&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* '''Docker execution'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./build.sh&lt;br /&gt;
./run.sh &amp;lt;GPU_ID&amp;gt; python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Media&amp;quot;&amp;gt;Media&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Our research has been featured in blog posts by [https://deepmind.com/research/publications/Restoring-ancient-text-using-deep-learning-a-case-study-on-Greek-epigraphy DeepMind] and the [http://www.ox.ac.uk/news/arts-blog/restoring-ancient-greek-inscriptions-using-ai-deep-learning University of Oxford], and has been published in articles on [https://www.thetimes.co.uk/article/oracle-of-ai-solves-classic-conundrums-v6pdtsps0 The Times], [https://www.newscientist.com/article/2220438-deepmind-ai-beats-humans-at-deciphering-damaged-ancient-greek-tablets/ NewScientist], [https://www.repubblica.it/tecnologia/2019/10/24/news/non_solo_go_l_ai_di_google_batte_anche_gli_archeologi-239393979/ La Repubblica], [https://www.kathimerini.gr/1047963/article/epikairothta/episthmh/py8ia-o-gonos-ths-google-kai-ellhna-ereynhth-poy-diavazei-misokatestrammenes-arxaies-epigrafes Ekathimerini], [https://techcrunch.com/2019/10/18/ai-is-helping-scholars-restore-ancient-greek-texts-on-stone-tablets/ TechCrunch] and others. &lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
It has also been presented at [https://www.emnlp-ijcnlp2019.org/ EMNLP - IJCNLP 2019] in Hong Kong, at the British School at Rome, the Oxford Epigraphy Workshop, the OIKOS National Research School in Classical Studies, during lectures at Venice Ca' Foscari and Rome La Sapienza, and most recently at the [https://www.youtube.com/watch?v=nKSfzHYmLtQ&amp;amp;t=26s Digital Classicist London Seminar].&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Pythia is a partner project of [[epigraphy.info]].&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Contact&amp;quot;&amp;gt;Contact&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;[http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk]&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[category:tools]] &lt;br /&gt;
[[category:projects]]&lt;br /&gt;
[[category:Epigraphy]]&lt;br /&gt;
[[category:corpora]]&lt;br /&gt;
[[category:dataset]]&lt;br /&gt;
[[category:deep learning]]&lt;br /&gt;
[[category:digital library]]&lt;br /&gt;
[[category:digitization]]&lt;br /&gt;
[[category:inscriptions]]&lt;br /&gt;
[[category:machine learning]]&lt;br /&gt;
[[category:NLP]]&lt;br /&gt;
[[category:neural networks]]&lt;br /&gt;
[[category:Openaccess]]&lt;br /&gt;
[[category:Opensource]]&lt;br /&gt;
[[category:repositories]]&lt;br /&gt;
[[category:text mining]]&lt;br /&gt;
[[category:web service]]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=9966</id>
		<title>Pythia</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=9966"/>
		<updated>2020-06-09T17:37:20Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;lt;div id=&amp;quot;mw-content-text&amp;quot; lang=&amp;quot;en-GB&amp;quot; dir=&amp;quot;ltr&amp;quot; class=&amp;quot;mw-content-ltr&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;div class=&amp;quot;mw-parser-output&amp;quot;&amp;gt;&amp;lt;div id=&amp;quot;toc&amp;quot; class=&amp;quot;toc&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;div class=&amp;quot;toctitle&amp;quot; lang=&amp;quot;en-GB&amp;quot; dir=&amp;quot;ltr&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;Contents&amp;lt;/h2&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li class=&amp;quot;toclevel-1 tocsection-1&amp;quot;&amp;gt;&amp;lt;a href=&amp;quot;#Available&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;tocnumber&amp;quot;&amp;gt;1&amp;lt;/span&amp;gt; &amp;lt;span class=&amp;quot;toctext&amp;quot;&amp;gt;Available&amp;lt;/span&amp;gt;&amp;lt;/a&amp;gt;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li class=&amp;quot;toclevel-1 tocsection-2&amp;quot;&amp;gt;&amp;lt;a href=&amp;quot;#Authors&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;tocnumber&amp;quot;&amp;gt;2&amp;lt;/span&amp;gt; &amp;lt;span class=&amp;quot;toctext&amp;quot;&amp;gt;Authors&amp;lt;/span&amp;gt;&amp;lt;/a&amp;gt;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li class=&amp;quot;toclevel-1 tocsection-3&amp;quot;&amp;gt;&amp;lt;a href=&amp;quot;#Description&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;tocnumber&amp;quot;&amp;gt;3&amp;lt;/span&amp;gt; &amp;lt;span class=&amp;quot;toctext&amp;quot;&amp;gt;Description&amp;lt;/span&amp;gt;&amp;lt;/a&amp;gt;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li class=&amp;quot;toclevel-1 tocsection-4&amp;quot;&amp;gt;&amp;lt;a href=&amp;quot;#References&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;tocnumber&amp;quot;&amp;gt;4&amp;lt;/span&amp;gt; &amp;lt;span class=&amp;quot;toctext&amp;quot;&amp;gt;References&amp;lt;/span&amp;gt;&amp;lt;/a&amp;gt;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li class=&amp;quot;toclevel-1 tocsection-5&amp;quot;&amp;gt;&amp;lt;a href=&amp;quot;#Pythia_offline&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;tocnumber&amp;quot;&amp;gt;5&amp;lt;/span&amp;gt; &amp;lt;span class=&amp;quot;toctext&amp;quot;&amp;gt;Pythia offline&amp;lt;/span&amp;gt;&amp;lt;/a&amp;gt;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li class=&amp;quot;toclevel-1 tocsection-6&amp;quot;&amp;gt;&amp;lt;a href=&amp;quot;#Media&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;tocnumber&amp;quot;&amp;gt;65&amp;lt;/span&amp;gt; &amp;lt;span class=&amp;quot;toctext&amp;quot;&amp;gt;Media&amp;lt;/span&amp;gt;&amp;lt;/a&amp;gt;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li class=&amp;quot;toclevel-1 tocsection-7&amp;quot;&amp;gt;&amp;lt;a href=&amp;quot;#Contact&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;tocnumber&amp;quot;&amp;gt;65&amp;lt;/span&amp;gt; &amp;lt;span class=&amp;quot;toctext&amp;quot;&amp;gt;Contact&amp;lt;/span&amp;gt;&amp;lt;/a&amp;gt;&amp;lt;/li&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Available&amp;quot;&amp;gt;Available&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;[https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ PYTHIA: the deep learning model for the automatic restoration of Greek inscriptions]&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Authors&amp;quot;&amp;gt;Authors&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Thea Sommerschield, University of Oxford&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Yannis Assael, DeepMind&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Jonathan R. W. Prag, University of Oxford&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Description&amp;quot;&amp;gt;Description&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Ancient History relies on disciplines such as Epigraphy, the study of ancient inscribed texts, for evidence of the recorded past. However, these texts, &amp;quot;inscriptions&amp;quot;, are often damaged over the centuries, and illegible parts of the text must be restored by specialists, known as epigraphists. This work presents a novel assistive method for providing text restorations using deep neural networks.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
&amp;lt;b&amp;gt;Pythia&amp;lt;/b&amp;gt; is the first ancient text restoration model that recovers missing characters from a damaged text input. Its architecture is carefully designed to handle long-term context information, and deal efficiently with missing or corrupted character and word representations. &lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
To train it, we wrote a non-trivial pipeline to convert [[PHI Greek Inscriptions|PHI]], the largest digital corpus of ancient Greek inscriptions, to machine actionable text, which we call &amp;lt;i&amp;gt;PHI-ML&amp;lt;/i&amp;gt;. On &amp;lt;i&amp;gt;PHI-ML&amp;lt;/i&amp;gt;, &amp;lt;b&amp;gt;Pythia&amp;lt;/b&amp;gt;'s predictions achieve a 30.1% character error rate, compared to the 57.3% of human epigraphists. Moreover, in 73.5% of cases the ground-truth sequence was among the Top-20 hypotheses of &amp;lt;b&amp;gt;Pythia&amp;lt;/b&amp;gt;, which effectively demonstrates the impact of such an assistive method on the field of digital epigraphy, and sets the state-of-the-art in ancient text restoration.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
To aid further research in the field we created a freely accessible [https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ online interactive Python notebook], where researchers can query one of our models to get text restorations and visualise the attention weights. &lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Furthermore, the PHI-ML dataset and processing pipeline have been published on [https://github.com/sommerschield/ancient-text-restoration GitHub], so that any researcher may regenerate PHI-ML and train new models offline. Follow the instructions below under the section ''&amp;quot;Pythia offline&amp;quot;''.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;References&amp;quot;&amp;gt;References&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Assael, Y., Sommerschield, T., Prag, J. “Restoring Ancient Text Using Deep Learning: A Case Study on Greek Epigraphy.” In&lt;br /&gt;
''Empirical Methods in Natural Language Processing (EMNLP - IJCNLP 2019)''. Association for Computational Linguistics. 6369-6376.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Available at:&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;[https://www.aclweb.org/anthology/D19-1668/ ACL anthology]&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;[https://arxiv.org/abs/1910.06262 arXiv preprint]&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
When using any of this project's source code, please cite:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
@inproceedings{assael2019restoring,&lt;br /&gt;
  title={Restoring ancient text using deep learning: a case study on {Greek} epigraphy},&lt;br /&gt;
  author={Assael, Yannis and Sommerschield, Thea and Prag, Jonathan},&lt;br /&gt;
  booktitle={Empirical Methods in Natural Language Processing},&lt;br /&gt;
  pages={6369--6376},&lt;br /&gt;
  year={2019}&lt;br /&gt;
}&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Pythia-offline&amp;quot;&amp;gt;Pythia offline&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
The following snippets provide references for regenerating PHI-ML and training new models offline.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
'''Dependencies'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
pip install -r requirements.txt &amp;amp;&amp;amp; \&lt;br /&gt;
python -m nltk.downloader punkt&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''PHI-ML dataset generation'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
# Download PHI (this will take a while)&lt;br /&gt;
python -c 'import pythia.data.phi_download; pythia.data.phi_download.main()'&lt;br /&gt;
&lt;br /&gt;
# Process and generate PHI-ML&lt;br /&gt;
python -c 'import pythia.data.phi_process; pythia.data.phi_process.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Preprocessed PHI-ML uploaded by @Holger.Danske800: [https://drive.google.com/drive/folders/1PQaWYmB02Sc2OC9yokajcsw65wIcLxGD link]&lt;br /&gt;
&lt;br /&gt;
'''Training'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''Evaluation'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.test; pythia.test.main()' --load_checkpoint=&amp;quot;your_model_path/&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''Docker execution'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./build.sh&lt;br /&gt;
./run.sh &amp;lt;GPU_ID&amp;gt; python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Media&amp;quot;&amp;gt;Media&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Our research has been featured in blog posts by [https://deepmind.com/research/publications/Restoring-ancient-text-using-deep-learning-a-case-study-on-Greek-epigraphy DeepMind] and the [http://www.ox.ac.uk/news/arts-blog/restoring-ancient-greek-inscriptions-using-ai-deep-learning University of Oxford], and has been published in articles on [https://www.thetimes.co.uk/article/oracle-of-ai-solves-classic-conundrums-v6pdtsps0 The Times], [https://www.newscientist.com/article/2220438-deepmind-ai-beats-humans-at-deciphering-damaged-ancient-greek-tablets/ NewScientist], [https://www.repubblica.it/tecnologia/2019/10/24/news/non_solo_go_l_ai_di_google_batte_anche_gli_archeologi-239393979/ La Repubblica], [https://www.kathimerini.gr/1047963/article/epikairothta/episthmh/py8ia-o-gonos-ths-google-kai-ellhna-ereynhth-poy-diavazei-misokatestrammenes-arxaies-epigrafes Ekathimerini], [https://techcrunch.com/2019/10/18/ai-is-helping-scholars-restore-ancient-greek-texts-on-stone-tablets/ TechCrunch] and others. &lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
It has also been presented at [https://www.emnlp-ijcnlp2019.org/ EMNLP - IJCNLP 2019] in Hong Kong, at the British School at Rome, the Oxford Epigraphy Workshop, the OIKOS National Research School in Classical Studies, during lectures at Venice Ca' Foscari and Rome La Sapienza, and most recently at the [https://www.youtube.com/watch?v=nKSfzHYmLtQ&amp;amp;t=26s Digital Classicist London Seminar].&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Pythia is a partner project of [[epigraphy.info]].&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Contact&amp;quot;&amp;gt;Contact&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;[http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk]&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
	<entry>
		<id>https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=9965</id>
		<title>Pythia</title>
		<link rel="alternate" type="text/html" href="https://wiki.digitalclassicist.org/index.php?title=Pythia&amp;diff=9965"/>
		<updated>2020-06-09T17:19:58Z</updated>

		<summary type="html">&lt;p&gt;TheaSommerschield: Created page with &amp;quot;&amp;lt;div id=&amp;quot;mw-content-text&amp;quot; lang=&amp;quot;en-GB&amp;quot; dir=&amp;quot;ltr&amp;quot; class=&amp;quot;mw-content-ltr&amp;quot;&amp;gt; &amp;lt;div class=&amp;quot;mw-parser-output&amp;quot;&amp;gt;&amp;lt;div id=&amp;quot;toc&amp;quot; class=&amp;quot;toc&amp;quot;&amp;gt;  &amp;lt;div class=&amp;quot;toctitle&amp;quot; lang=&amp;quot;en-GB&amp;quot; dir=&amp;quot;ltr&amp;quot;...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;lt;div id=&amp;quot;mw-content-text&amp;quot; lang=&amp;quot;en-GB&amp;quot; dir=&amp;quot;ltr&amp;quot; class=&amp;quot;mw-content-ltr&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;div class=&amp;quot;mw-parser-output&amp;quot;&amp;gt;&amp;lt;div id=&amp;quot;toc&amp;quot; class=&amp;quot;toc&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div class=&amp;quot;toctitle&amp;quot; lang=&amp;quot;en-GB&amp;quot; dir=&amp;quot;ltr&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;Contents&amp;lt;/h2&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li class=&amp;quot;toclevel-1 tocsection-1&amp;quot;&amp;gt;&amp;lt;a href=&amp;quot;#Available&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;tocnumber&amp;quot;&amp;gt;1&amp;lt;/span&amp;gt; &amp;lt;span class=&amp;quot;toctext&amp;quot;&amp;gt;Available&amp;lt;/span&amp;gt;&amp;lt;/a&amp;gt;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li class=&amp;quot;toclevel-1 tocsection-2&amp;quot;&amp;gt;&amp;lt;a href=&amp;quot;#Authors&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;tocnumber&amp;quot;&amp;gt;2&amp;lt;/span&amp;gt; &amp;lt;span class=&amp;quot;toctext&amp;quot;&amp;gt;Authors&amp;lt;/span&amp;gt;&amp;lt;/a&amp;gt;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li class=&amp;quot;toclevel-1 tocsection-3&amp;quot;&amp;gt;&amp;lt;a href=&amp;quot;#Description&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;tocnumber&amp;quot;&amp;gt;3&amp;lt;/span&amp;gt; &amp;lt;span class=&amp;quot;toctext&amp;quot;&amp;gt;Description&amp;lt;/span&amp;gt;&amp;lt;/a&amp;gt;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li class=&amp;quot;toclevel-1 tocsection-4&amp;quot;&amp;gt;&amp;lt;a href=&amp;quot;#References&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;tocnumber&amp;quot;&amp;gt;4&amp;lt;/span&amp;gt; &amp;lt;span class=&amp;quot;toctext&amp;quot;&amp;gt;References&amp;lt;/span&amp;gt;&amp;lt;/a&amp;gt;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li class=&amp;quot;toclevel-1 tocsection-5&amp;quot;&amp;gt;&amp;lt;a href=&amp;quot;#Pythia_offline&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;tocnumber&amp;quot;&amp;gt;5&amp;lt;/span&amp;gt; &amp;lt;span class=&amp;quot;toctext&amp;quot;&amp;gt;Pythia offline&amp;lt;/span&amp;gt;&amp;lt;/a&amp;gt;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li class=&amp;quot;toclevel-1 tocsection-6&amp;quot;&amp;gt;&amp;lt;a href=&amp;quot;#Media&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;tocnumber&amp;quot;&amp;gt;65&amp;lt;/span&amp;gt; &amp;lt;span class=&amp;quot;toctext&amp;quot;&amp;gt;Media&amp;lt;/span&amp;gt;&amp;lt;/a&amp;gt;&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li class=&amp;quot;toclevel-1 tocsection-7&amp;quot;&amp;gt;&amp;lt;a href=&amp;quot;#Contact&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;tocnumber&amp;quot;&amp;gt;65&amp;lt;/span&amp;gt; &amp;lt;span class=&amp;quot;toctext&amp;quot;&amp;gt;Contact&amp;lt;/span&amp;gt;&amp;lt;/a&amp;gt;&amp;lt;/li&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Available&amp;quot;&amp;gt;Available&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;[https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ PYTHIA: the deep learning model for the automatic restoration of Greek inscriptions]&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Authors&amp;quot;&amp;gt;Authors&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Thea Sommerschield, University of Oxford&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Yannis Assael, DeepMind&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;Jonathan R. W. Prag, University of Oxford&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Description&amp;quot;&amp;gt;Description&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Ancient History relies on disciplines such as Epigraphy, the study of ancient inscribed texts, for evidence of the recorded past. However, these texts, &amp;quot;inscriptions&amp;quot;, are often damaged over the centuries, and illegible parts of the text must be restored by specialists, known as epigraphists. This work presents a novel assistive method for providing text restorations using deep neural networks.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
&amp;lt;b&amp;gt;Pythia&amp;lt;/b&amp;gt; is the first ancient text restoration model that recovers missing characters from a damaged text input. Its architecture is carefully designed to handle long-term context information, and deal efficiently with missing or corrupted character and word representations. &lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
To train it, we wrote a non-trivial pipeline to convert [[PHI Greek Inscriptions|PHI]], the largest digital corpus of ancient Greek inscriptions, to machine actionable text, which we call &amp;lt;i&amp;gt;PHI-ML&amp;lt;/i&amp;gt;. On &amp;lt;i&amp;gt;PHI-ML&amp;lt;/i&amp;gt;, &amp;lt;b&amp;gt;Pythia&amp;lt;/b&amp;gt;'s predictions achieve a 30.1% character error rate, compared to the 57.3% of human epigraphists. Moreover, in 73.5% of cases the ground-truth sequence was among the Top-20 hypotheses of &amp;lt;b&amp;gt;Pythia&amp;lt;/b&amp;gt;, which effectively demonstrates the impact of such an assistive method on the field of digital epigraphy, and sets the state-of-the-art in ancient text restoration.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
To aid further research in the field we created a freely accessible [https://colab.research.google.com/drive/16RfCpZLm0M6bf3eGIA7VUPclFdW8P8pZ online interactive Python notebook], where researchers can query one of our models to get text restorations and visualise the attention weights. &lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Furthermore, the PHI-ML dataset and processing pipeline have been published on [https://github.com/sommerschield/ancient-text-restoration GitHub], so that any researcher may regenerate PHI-ML and train new models offline. Follow the instructions below under the section ''&amp;quot;Pythia offline&amp;quot;''.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;References&amp;quot;&amp;gt;References&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Assael, Y., Sommerschield, T., Prag, J. “Restoring Ancient Text Using Deep Learning: A Case Study on Greek Epigraphy.” In&lt;br /&gt;
''Empirical Methods in Natural Language Processing (EMNLP - IJCNLP 2019)''. Association for Computational Linguistics. 6369-6376.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Available at:&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;[https://www.aclweb.org/anthology/D19-1668/ ACL anthology]&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;[https://arxiv.org/abs/1910.06262 arXiv preprint]&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
When using any of this project's source code, please cite:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
@inproceedings{assael2019restoring,&lt;br /&gt;
  title={Restoring ancient text using deep learning: a case study on {Greek} epigraphy},&lt;br /&gt;
  author={Assael, Yannis and Sommerschield, Thea and Prag, Jonathan},&lt;br /&gt;
  booktitle={Empirical Methods in Natural Language Processing},&lt;br /&gt;
  pages={6369--6376},&lt;br /&gt;
  year={2019}&lt;br /&gt;
}&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Pythia-offline&amp;quot;&amp;gt;Pythia offline&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
The following snippets provide references for regenerating PHI-ML and training new models offline.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
'''Dependencies'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
pip install -r requirements.txt &amp;amp;&amp;amp; \&lt;br /&gt;
python -m nltk.downloader punkt&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''PHI-ML dataset generation'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
# Download PHI (this will take a while)&lt;br /&gt;
python -c 'import pythia.data.phi_download; pythia.data.phi_download.main()'&lt;br /&gt;
&lt;br /&gt;
# Process and generate PHI-ML&lt;br /&gt;
python -c 'import pythia.data.phi_process; pythia.data.phi_process.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Preprocessed PHI-ML uploaded by @Holger.Danske800: [https://drive.google.com/drive/folders/1PQaWYmB02Sc2OC9yokajcsw65wIcLxGD link]&lt;br /&gt;
&lt;br /&gt;
'''Training'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''Evaluation'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
python -c 'import pythia.test; pythia.test.main()' --load_checkpoint=&amp;quot;your_model_path/&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''Docker execution'''&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./build.sh&lt;br /&gt;
./run.sh &amp;lt;GPU_ID&amp;gt; python -c 'import pythia.train; pythia.train.main()'&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Media&amp;quot;&amp;gt;Media&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Our research has been featured in blog posts by [https://deepmind.com/research/publications/Restoring-ancient-text-using-deep-learning-a-case-study-on-Greek-epigraphy DeepMind] and the [http://www.ox.ac.uk/news/arts-blog/restoring-ancient-greek-inscriptions-using-ai-deep-learning University of Oxford], and has been published in articles on [https://www.thetimes.co.uk/article/oracle-of-ai-solves-classic-conundrums-v6pdtsps0 The Times], [https://www.newscientist.com/article/2220438-deepmind-ai-beats-humans-at-deciphering-damaged-ancient-greek-tablets/ NewScientist], [https://www.repubblica.it/tecnologia/2019/10/24/news/non_solo_go_l_ai_di_google_batte_anche_gli_archeologi-239393979/ La Repubblica], [https://www.kathimerini.gr/1047963/article/epikairothta/episthmh/py8ia-o-gonos-ths-google-kai-ellhna-ereynhth-poy-diavazei-misokatestrammenes-arxaies-epigrafes Ekathimerini], [https://techcrunch.com/2019/10/18/ai-is-helping-scholars-restore-ancient-greek-texts-on-stone-tablets/ TechCrunch] and others. &lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
It has also been presented at [https://www.emnlp-ijcnlp2019.org/ EMNLP - IJCNLP 2019] in Hong Kong, at the British School at Rome, the Oxford Epigraphy Workshop, the OIKOS National Research School in Classical Studies, during lectures at Venice Ca' Foscari and Rome La Sapienza, and most recently at the [https://www.youtube.com/watch?v=nKSfzHYmLtQ&amp;amp;t=26s Digital Classicist London Seminar].&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p&amp;gt;&lt;br /&gt;
Pythia is a partner project of [[epigraphy.info]].&lt;br /&gt;
&lt;br /&gt;
&amp;lt;h2&amp;gt;&lt;br /&gt;
&amp;lt;span class=&amp;quot;mw-headline&amp;quot; id=&amp;quot;Contact&amp;quot;&amp;gt;Contact&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/h2&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;ul&amp;gt;&lt;br /&gt;
&amp;lt;li&amp;gt;[http://thea.sommerschield@classics.ox.ac.uk thea.sommerschield@classics.ox.ac.uk]&amp;lt;/li&amp;gt;&lt;br /&gt;
&amp;lt;/ul&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;/div&gt;</summary>
		<author><name>TheaSommerschield</name></author>
	</entry>
</feed>