Difference between revisions of "XML Isidore Guidelines"
(First version) |
m |
||
(2 intermediate revisions by the same user not shown) | |||
Line 2: | Line 2: | ||
− | For the encoding the glosses the Etymologiae, we use | + | For the encoding the glosses in the Etymologiae, we use these guidelines. |
==Witnesses== | ==Witnesses== | ||
− | For each of the manuscripts | + | For each of the manuscripts for which the glosses have been transcribed, we describe its location in an <msDesc>-element |
− | element in the <sourceDesc>. We also describe the relevant hands in a <handNote> in a <handDesc> in the < | + | in the <sourceDesc>. We also describe the relevant hands in a <handNote> in a <handDesc> in the <physDesc>. |
For each hand we give a <date>. | For each hand we give a <date>. | ||
Line 30: | Line 30: | ||
==Isidore's text== | ==Isidore's text== | ||
Isidore's text is given in the <text> <body>. Only Book I is present. Each chapter is a div, each paragraph an <ab>. | Isidore's text is given in the <text> <body>. Only Book I is present. Each chapter is a div, each paragraph an <ab>. | ||
− | <pb> elements provide the link to | + | <pb> elements provide the link to <surface>s in the various manuscripts using the @corresp attribute. Glossed text fragments are |
− | identified using <seg> elements. Their @xml:id attributes are by the <glossGrp>s further on in the file. | + | identified using <seg> elements. Their @xml:id attributes are used by the <glossGrp>s further on in the file. |
For example: | For example: | ||
<pre> | <pre> | ||
<div n="II" type="chapter"> | <div n="II" type="chapter"> | ||
− | <head | + | <head><seg xml:id="L2.1.1">DE SEPTEM LIBERALIBVS |
DISCIPLINIS.</seg></head> | DISCIPLINIS.</seg></head> | ||
<ab n="1">Disciplinae <seg xml:id="L2.1.2">liberalium artium</seg> septem sunt. Prima | <ab n="1">Disciplinae <seg xml:id="L2.1.2">liberalium artium</seg> septem sunt. Prima | ||
Line 53: | Line 53: | ||
Quotations in Isidore's text are identified using the <quote>-element. The <quote>'s @source | Quotations in Isidore's text are identified using the <quote>-element. The <quote>'s @source | ||
− | attribute points to | + | attribute points to an element in the list of works (<listBibl>) in the <standoff>-element |
− | For | + | For example: |
<pre> | <pre> | ||
<quote source="#ApocalypsisI8">Ego sum <seg xml:id="L3.9.8">Α</seg> et Ω.</quote> | <quote source="#ApocalypsisI8">Ego sum <seg xml:id="L3.9.8">Α</seg> et Ω.</quote> | ||
Line 62: | Line 62: | ||
==The glosses== | ==The glosses== | ||
The glosses are defined by <gloss>-elements within <hi:glossGrp>s referring to <seg>s in the text. All <hi:glossGrp>s | The glosses are defined by <gloss>-elements within <hi:glossGrp>s referring to <seg>s in the text. All <hi:glossGrp>s | ||
− | are contained in a <hi:glossGrpList> in the file's <standoff>-element. | + | are contained in a <hi:glossGrpList> in the file's <standoff>-element. The glosses use the @corresp-attribute to point to |
the manuscript that contains them and the @hand attribute to point to the responsible hand. The <gloss>'s @weight-attribute | the manuscript that contains them and the @hand attribute to point to the responsible hand. The <gloss>'s @weight-attribute | ||
gives a researcher-assigned weight for the non-triviality of the gloss. | gives a researcher-assigned weight for the non-triviality of the gloss. | ||
Line 69: | Line 69: | ||
<linkGrp>s are, by convention, placed in the <glossGrp> that contains the <gloss>s that most of its <ptr>-members point to. | <linkGrp>s are, by convention, placed in the <glossGrp> that contains the <gloss>s that most of its <ptr>-members point to. | ||
<linkGrp>s have an @ana-attribute that can be used to group related groups of similarities. It points to groups that | <linkGrp>s have an @ana-attribute that can be used to group related groups of similarities. It points to groups that | ||
− | can be | + | can be described (but as yet are not) in an <interpGrp> at the bottom of the file. |
For example: | For example: |
Latest revision as of 10:53, 25 March 2021
For the encoding the glosses in the Etymologiae, we use these guidelines.
Witnesses
For each of the manuscripts for which the glosses have been transcribed, we describe its location in an <msDesc>-element in the <sourceDesc>. We also describe the relevant hands in a <handNote> in a <handDesc> in the <physDesc>. For each hand we give a <date>.
For example:
<msDesc xml:id="Clm4541"> <msIdentifier> <settlement>Munich</settlement> <institution>Bayerische Staatsbibliothek</institution> <idno>Clm 4541</idno> </msIdentifier> <physDesc> <handDesc> <handNote xml:id="Clm4541_main">main hand<date when-iso="800/900">9th century</date></handNote> <handNote xml:id="Clm4541_g1">glossing hand 1<date when-iso="1000/1100">11th century</date></handNote> </handDesc> </physDesc> </msDesc>
Isidore's text
Isidore's text is given in the <text> <body>. Only Book I is present. Each chapter is a div, each paragraph an <ab>. <pb> elements provide the link to <surface>s in the various manuscripts using the @corresp attribute. Glossed text fragments are identified using <seg> elements. Their @xml:id attributes are used by the <glossGrp>s further on in the file.
For example:
<div n="II" type="chapter"> <head><seg xml:id="L2.1.1">DE SEPTEM LIBERALIBVS DISCIPLINIS.</seg></head> <ab n="1">Disciplinae <seg xml:id="L2.1.2">liberalium artium</seg> septem sunt. Prima <seg xml:id="L2.1.3">grammatica</seg>, id est loquendi <seg xml:id="L2.1.4" >peritia</seg>. Secunda <seg xml:id="L2.1.5">rhetorica</seg>, quae propter <seg xml:id="L2.1.6">nitorem</seg> et <seg xml:id="L2.1.7">copiam</seg> <seg xml:id="L2.1.8">eloquentiae</seg> suae maxime in <seg xml:id="L2.1.9" >civilibus</seg> quaestionibus necessaria <seg xml:id="L2.1.10" >existimatur</seg>. Tertia <seg xml:id="L2.1.11">dialectica</seg> <pb corresp="#Harley3941_2v"/> ... </ab> ... </div>
Quotations in Isidore's text are identified using the <quote>-element. The <quote>'s @source attribute points to an element in the list of works (<listBibl>) in the <standoff>-element
For example:
<quote source="#ApocalypsisI8">Ego sum <seg xml:id="L3.9.8">Α</seg> et Ω.</quote>
The glosses
The glosses are defined by <gloss>-elements within <hi:glossGrp>s referring to <seg>s in the text. All <hi:glossGrp>s are contained in a <hi:glossGrpList> in the file's <standoff>-element. The glosses use the @corresp-attribute to point to the manuscript that contains them and the @hand attribute to point to the responsible hand. The <gloss>'s @weight-attribute gives a researcher-assigned weight for the non-triviality of the gloss.
Similarity between glosses is defined by <linkGrp>s. A <linkGrp> contains pointers (<ptr>) to similar glosses. <linkGrp>s are, by convention, placed in the <glossGrp> that contains the <gloss>s that most of its <ptr>-members point to. <linkGrp>s have an @ana-attribute that can be used to group related groups of similarities. It points to groups that can be described (but as yet are not) in an <interpGrp> at the bottom of the file.
For example:
<hi:glossGrp target="#L37.35.17" witCount="5"> <linkGrp ana="#E" type="simgroup"> <ptr target="#L37.35.17_2"/> <ptr target="#L37.35.17_5"/> </linkGrp> <linkGrp ana="#F" type="simgroup"> <ptr target="#L37.35.17_3"/> <ptr target="#L37.35.17_4"/> </linkGrp> <gloss xml:id="L37.35.17_1" corresp="#Bologna797">id est Eneas</gloss> <gloss xml:id="L37.35.17_2" corresp="#GothaI147" weight="4">id est Aenias de patre suo dicit</gloss> <gloss xml:id="L37.35.17_3" corresp="#Harley3941" weight="4">Aeneas dixit quando voluit visitare patrem suum in infer[no]</gloss> <gloss xml:id="L37.35.17_4" corresp="#Orleans296" weight="4">Aeneas dixit quando voluit visitare patrem suum in inferno</gloss> <gloss xml:id="L37.35.17_5" corresp="#Paris7585" weight="4">scilicet Aeneas de patre suo dicit</gloss> </hi:glossGrp>
Textual details
Gaps in the text are encoded as <gap>, unclear passages as <unclear>, corrections as (see example):
<choice> <orig>partiones</orig> <corr>partitiones</corr> </choice>