Text structure (ePistolarium)

From XML
Revision as of 12:01, 9 January 2018 by 194.171.144.181 (Talk)

Jump to: navigation, search

The documents in the ePistolarium have the following basic structure:

<text>
<body>
<div xml:id="div-1" type="artifact" subtype="letter">
...
</div>
<div xml:id="div-2" type="para">
...
</div>
<div xml:id="div-3" type="notes">
...
</div>
</body>
</text>

The text and body elements serve as a container for the actual text. The TEI Guidelines allow for front and back elements as siblings of body, but these are not used.

The type attribute of the div element is used to distinguish between various parts of the text, the most important ones being artifact for the 'real' letter text and notes for the text of (editorial) notes.


Subsections

A div element may contain other div elements, which is often used in combination with a head element.

Example: https://correspondence.huygens.knaw.nl/documents/920e5785-62d4-4108-b126-8caa1df370b8 (subdivisions)

<div type="artifact" subtype="letter">
<div type="section">
<head>Propositio 5.</head>
<p><figure><graphic url="huyg003oeuv01ill19.gif"/></figure></p>
<p>Si il y a tant de gravitez qu'on veut ...</p>
</div>
<div type="section">
<head>Propositio 6.</head>
<p><figure><graphic url="huyg003oeuv01ill20.gif"/></figure></p>
<p>Eadem methodo probatur si ...</p>
</div>
...
</div>


Paragraphs

Paragraphs are encoded with the p element. In the ePistolarium the p element only occurs in div elements. The elements are not allowed to be nested, that is, a p element in a p element is not allowed. Note that this is much stricter than allowed by the TEI Guidelines. The reason for the restriction is that some analysis methods used by the ePistolarium use the paragraph as unit of analysis.