XML Isidore Guidelines

From XML
Revision as of 16:53, 1 February 2021 by PBO (Talk | contribs) (First version)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search


For the encoding the glosses the Etymologiae, we use the following guidelines

Witnesses

For each of the manuscripts fow which the glosses have been transcribed, we describe its location in an <msDesc> element in the <sourceDesc>. We also describe the relevant hands in a <handNote> in a <handDesc> in the <physDescDesc>. For each hand we give a <date>.

For example:

<msDesc xml:id="Clm4541">
   <msIdentifier>
      <settlement>Munich</settlement>
      <institution>Bayerische Staatsbibliothek</institution>
      <idno>Clm 4541</idno>
   </msIdentifier>
   <physDesc>
      <handDesc>
         <handNote xml:id="Clm4541_main">main hand<date when-iso="800/900">9th
               century</date></handNote>
         <handNote xml:id="Clm4541_g1">glossing hand 1<date when-iso="1000/1100">11th
               century</date></handNote>
      </handDesc>
   </physDesc>
</msDesc>

Isidore's text

Isidore's text is given in the <text> <body>. Only Book I is present. Each chapter is a div, each paragraph an <ab>. <pb> elements provide the link to surfaces in the various manuscripts using the @corresp attribute. Glossed text fragments are identified using <seg> elements. Their @xml:id attributes are by the <glossGrp>s further on in the file.

For example:

<div n="II" type="chapter">
   <head rend="script(ruscaps)"><seg xml:id="L2.1.1">DE SEPTEM LIBERALIBVS
         DISCIPLINIS.</seg></head>
   <ab n="1">Disciplinae <seg xml:id="L2.1.2">liberalium artium</seg> septem sunt. Prima
         <seg xml:id="L2.1.3">grammatica</seg>, id est loquendi <seg xml:id="L2.1.4"
         >peritia</seg>. Secunda <seg xml:id="L2.1.5">rhetorica</seg>, quae propter <seg
         xml:id="L2.1.6">nitorem</seg> et <seg xml:id="L2.1.7">copiam</seg>
      <seg xml:id="L2.1.8">eloquentiae</seg> suae maxime in <seg xml:id="L2.1.9"
         >civilibus</seg> quaestionibus necessaria <seg xml:id="L2.1.10"
         >existimatur</seg>. Tertia <seg xml:id="L2.1.11">dialectica</seg>
   <pb corresp="#Harley3941_2v"/>
      ...
   </ab>
   ...
</div>

Quotations in Isidore's text are identified using the <quote>-element. The <quote>'s @source attribute points to a list of works (<listBibl>) in the <standoff>-element

For: example:

<quote source="#ApocalypsisI8">Ego sum <seg xml:id="L3.9.8">Α</seg> et Ω.</quote>

The glosses

The glosses are defined by <gloss>-elements within <hi:glossGrp>s referring to <seg>s in the text. All <hi:glossGrp>s are contained in a <hi:glossGrpList> in the file's <standoff>-element. They use the @corresp-attribute to point to the manuscript that contains them and the @hand attribute to point to the responsible hand. The <gloss>'s @weight-attribute gives a researcher-assigned weight for the non-triviality of the gloss.

Similarity between glosses is defined by <linkGrp>s. A <linkGrp> contains pointers (<ptr>) to similar glosses. <linkGrp>s are, by convention, placed in the <glossGrp> that contains the <gloss>s that most of its <ptr>-members point to. <linkGrp>s have an @ana-attribute that can be used to group related groups of similarities. It points to groups that can be defined in an <interpGrp> at the bottom of the file.

For example:

<hi:glossGrp target="#L37.35.17" witCount="5">
   <linkGrp ana="#E" type="simgroup">
      <ptr target="#L37.35.17_2"/>
      <ptr target="#L37.35.17_5"/>
   </linkGrp>
   <linkGrp ana="#F" type="simgroup">
      <ptr target="#L37.35.17_3"/>
      <ptr target="#L37.35.17_4"/>
   </linkGrp>
   <gloss xml:id="L37.35.17_1" corresp="#Bologna797">id est Eneas</gloss>
   <gloss xml:id="L37.35.17_2" corresp="#GothaI147" weight="4">id est Aenias de patre suo
      dicit</gloss>
   <gloss xml:id="L37.35.17_3" corresp="#Harley3941" weight="4">Aeneas dixit quando voluit
      visitare patrem suum in infer[no]</gloss>
   <gloss xml:id="L37.35.17_4" corresp="#Orleans296" weight="4">Aeneas dixit quando voluit
      visitare patrem suum in inferno</gloss>
   <gloss xml:id="L37.35.17_5" corresp="#Paris7585" weight="4">scilicet Aeneas de patre suo
      dicit</gloss>
</hi:glossGrp>

Textual details

Gaps in the text are encoded as <gap>, unclear passages as <unclear>, corrections as (see example):

<choice>
   <orig>partiones</orig>
   <corr>partitiones</corr>
</choice>