Preservation of Original Orthography in the Construction of an Old Irish Corpus

Main Authors: Adrian Doyle, John P. McCrae, Clodagh Downey
Format: Proceeding
Terbitan: , 2018
Online Access: https://zenodo.org/record/2599925
ctrlnum 2599925
fullrecord <?xml version="1.0"?> <dc schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd"><creator>Adrian Doyle</creator><creator>John P. McCrae</creator><creator>Clodagh Downey</creator><date>2018-05-12</date><description>This paper will examine the process of creating a digital corpus based on the W&#xFC;rzburg glosses, the earliest large collection of glosses written in the Irish language. Modern editorial standards applied in publications of these glosses can alter spelling, punctuation, and even the semantic meaning of a sentence where one word is used in place of another. Therefore, an understanding of the original orthography utilised by Old Irish scribes is important in determining the orthography which should be utilised in a modern digital corpus. This paper will outline why the text of the W&#xFC;rzburg glosses as it appears in Thesaurus Palaeohibernicus is the best candidate for digitisation. The automated digitisation and proofing process of the corpus will be outlined, and details will be given of a tag-set utilised within the digital corpus in order to preserve information present in Thesaurus Palaeohibernicus as metadata.</description><identifier>https://zenodo.org/record/2599925</identifier><identifier>10.5281/zenodo.2599925</identifier><identifier>oai:zenodo.org:2599925</identifier><relation>doi:10.5281/zenodo.2599924</relation><rights>info:eu-repo/semantics/openAccess</rights><rights>https://creativecommons.org/licenses/by/4.0/legalcode</rights><title>Preservation of Original Orthography in the Construction of an Old Irish Corpus</title><type>Journal:Proceeding</type><type>Journal:Proceeding</type><recordID>2599925</recordID></dc>
format Journal:Proceeding
Journal
author Adrian Doyle
John P. McCrae
Clodagh Downey
title Preservation of Original Orthography in the Construction of an Old Irish Corpus
publishDate 2018
url https://zenodo.org/record/2599925
contents This paper will examine the process of creating a digital corpus based on the Würzburg glosses, the earliest large collection of glosses written in the Irish language. Modern editorial standards applied in publications of these glosses can alter spelling, punctuation, and even the semantic meaning of a sentence where one word is used in place of another. Therefore, an understanding of the original orthography utilised by Old Irish scribes is important in determining the orthography which should be utilised in a modern digital corpus. This paper will outline why the text of the Würzburg glosses as it appears in Thesaurus Palaeohibernicus is the best candidate for digitisation. The automated digitisation and proofing process of the corpus will be outlined, and details will be given of a tag-set utilised within the digital corpus in order to preserve information present in Thesaurus Palaeohibernicus as metadata.
id IOS16997.2599925
institution DEFAULT
institution_type library:public
library
library DEFAULT
collection DEFAULT
city DEFAULT
province DEFAULT
repoId IOS16997
first_indexed 2022-06-06T03:41:31Z
last_indexed 2022-06-06T03:41:31Z
recordtype dc
merged_child_boolean 1
_version_ 1739478958284472320
score 17.204899