<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.tei-c.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Gremid</id>
	<title>TEIWiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.tei-c.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Gremid"/>
	<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Special:Contributions/Gremid"/>
	<updated>2026-04-18T21:43:26Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.32.0</generator>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10148</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10148"/>
		<updated>2011-10-14T12:19:08Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* Torsten Schaßan (TS) &lt;br /&gt;
* Christian Wittern (CW)&lt;br /&gt;
* Ron Van den Branden (RB)&lt;br /&gt;
* Joachim Veit (JV)&lt;br /&gt;
* Malte Rehbein (MR)&lt;br /&gt;
* Oliver Gasperlin (OG)&lt;br /&gt;
* Gregor Middell (GM) – minutes&lt;br /&gt;
* ''(many more, please add your name here)''&lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;br /&gt;
&lt;br /&gt;
== What should we do, what could you do? ==&lt;br /&gt;
&lt;br /&gt;
* TS: ODDs from [http://enrich.manuscriptorium.com/ ENRICH project] available&lt;br /&gt;
* support for marginal notes in the Guidelines/ SIG proposal&lt;br /&gt;
* manuscript description: &amp;amp;lt;summary/&amp;gt; content model is not flexible enough for more complex descriptions (e.g. &amp;amp;lt;p/&amp;gt;)&lt;br /&gt;
* CW: there are – apart from the SIG – means to address such issues, for example the [http://sourceforge.net/tracker/?atid=644065&amp;amp;group_id=106328&amp;amp;func=browse Sourceforge Issue Tracker]&lt;br /&gt;
* EP: SIG can act as a proxy though&lt;br /&gt;
* RB: we could extend [http://tbe.kantl.be/TBE/ TEI by Example] with manuscript-related resources&lt;br /&gt;
* TS: refering to ENRICH again, there are existing resources and documentation for manuscript description, including conversion scenarios (TEI, EAD, ...)&lt;br /&gt;
* JV: structure of text is sometimes hard to describe with existing elements, not a problem of manuscripts alone, maybe some conceptual overlap with manuscript description&lt;br /&gt;
* EP: SIG &amp;quot;Text-Bearing Objects&amp;quot; might be a good place to address this&lt;br /&gt;
* idea about further modularizing the TEI, e.g. via use of different namespaces, address different use cases this way&lt;br /&gt;
* MR: problem of SIG's scope, lots of interest in manuscripts, but how to derive concrete tasks from that, maybe should come up with a new definition of what the SIG should do&lt;br /&gt;
* OG: supports idea of asking for micropapers before SIG meetings&lt;br /&gt;
&lt;br /&gt;
[[Category:SIG|MSS]]&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10144</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10144"/>
		<updated>2011-10-14T08:30:58Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* Torsten Schaßan (TS) &lt;br /&gt;
* Christian Wittern (CW)&lt;br /&gt;
* Ron Van den Branden (RB)&lt;br /&gt;
* Joachim Veit (JV)&lt;br /&gt;
* Malte Rehbein (MR)&lt;br /&gt;
* Oliver Gasperlin (OG)&lt;br /&gt;
* Gregor Middell (GM) – minutes&lt;br /&gt;
* ''(many more, please add your name here)''&lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;br /&gt;
&lt;br /&gt;
== What should we do, what could you do? ==&lt;br /&gt;
&lt;br /&gt;
* TS: ODDs from [http://enrich.manuscriptorium.com/ ENRICH project] available&lt;br /&gt;
* support for marginal notes in the Guidelines/ SIG proposal&lt;br /&gt;
* manuscript description: &amp;amp;lt;summary/&amp;gt; content model is not flexible enough for more complex descriptions (e.g. &amp;amp;lt;p/&amp;gt;)&lt;br /&gt;
* CW: there are – apart from the SIG – means to address such issues, for example the [http://sourceforge.net/tracker/?atid=644065&amp;amp;group_id=106328&amp;amp;func=browse Sourceforge Issue Tracker]&lt;br /&gt;
* EP: SIG can act as a proxy though&lt;br /&gt;
* RB: we could extend [http://tbe.kantl.be/TBE/ TEI by Example] with manuscript-related resources&lt;br /&gt;
* TS: refering to ENRICH again, there are existing resources and documentation for manuscript description, including conversion scenarios (TEI, EAD, ...)&lt;br /&gt;
* JV: structure of text is sometimes hard to describe with existing elements, not a problem of manuscripts alone, maybe some conceptual overlap with manuscript description&lt;br /&gt;
* EP: SIG &amp;quot;Text-Bearing Objects&amp;quot; might be a good place to address this&lt;br /&gt;
* idea about further modularizing the TEI, e.g. via use of different namespaces, address different use cases this way&lt;br /&gt;
* MR: problem of SIG's scope, lots of interest in manuscripts, but how to derive concrete tasks from that, maybe should come up with a new definition of what the SIG should do&lt;br /&gt;
* OG: supports idea of asking for micropapers before SIG meetings&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10143</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10143"/>
		<updated>2011-10-14T08:28:53Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* Torsten Schaßan (TS) &lt;br /&gt;
* Christian Wittern (CW)&lt;br /&gt;
* Ron Van den Branden (RB)&lt;br /&gt;
* Joachim Veit (JV)&lt;br /&gt;
* Malte Rehbein (MR)&lt;br /&gt;
* Gregor Middell (GM) – minutes&lt;br /&gt;
...&lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;br /&gt;
&lt;br /&gt;
== What should we do, what could you do? ==&lt;br /&gt;
&lt;br /&gt;
* TS: ODDs from [http://enrich.manuscriptorium.com/ ENRICH project] available&lt;br /&gt;
* support for marginal notes in the Guidelines/ SIG proposal&lt;br /&gt;
* manuscript description: &amp;amp;lt;summary/&amp;gt; content model is not flexible enough for more complex descriptions (e.g. &amp;amp;lt;p/&amp;gt;)&lt;br /&gt;
* CW: there are – apart from the SIG – means to address such issues, for example the [http://sourceforge.net/tracker/?atid=644065&amp;amp;group_id=106328&amp;amp;func=browse Sourceforge Issue Tracker]&lt;br /&gt;
* EP: SIG can act as a proxy though&lt;br /&gt;
* RB: we could extend [http://tbe.kantl.be/TBE/ TEI by Example] with manuscript-related resources&lt;br /&gt;
* TS: refering to ENRICH again, there are existing resources and documentation for manuscript description, including conversion scenarios (TEI, EAD, ...)&lt;br /&gt;
* JV: structure of text is sometimes hard to describe with existing elements, not a problem of manuscripts alone, maybe some conceptual overlap with manuscript description&lt;br /&gt;
* EP: SIG &amp;quot;Text-Bearing Objects&amp;quot; might be a good place to address this&lt;br /&gt;
* idea about further modularizing the TEI, e.g. via use of different namespaces, address different use cases this way&lt;br /&gt;
* MR: problem of SIG's scope, lots of interest in manuscripts, but how to derive concrete tasks from that, maybe should come up with a new definition of what the SIG should do&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10142</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10142"/>
		<updated>2011-10-14T08:25:25Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* Torsten Schaßan (TS) &lt;br /&gt;
* Christian Wittern (CW)&lt;br /&gt;
* Ron Van den Branden (RB)&lt;br /&gt;
* Joachim Veit (JV)&lt;br /&gt;
* Gregor Middell (GM) – minutes&lt;br /&gt;
...&lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;br /&gt;
&lt;br /&gt;
== What should we do, what could you do? ==&lt;br /&gt;
&lt;br /&gt;
* TS: ODDs from [http://enrich.manuscriptorium.com/ ENRICH project] available&lt;br /&gt;
* support for marginal notes in the Guidelines/ SIG proposal&lt;br /&gt;
* manuscript description: &amp;amp;lt;summary/&amp;gt; content model is not flexible enough for more complex descriptions (e.g. &amp;amp;lt;p/&amp;gt;)&lt;br /&gt;
* CW: there are – apart from the SIG – means to address such issues, for example the [http://sourceforge.net/tracker/?atid=644065&amp;amp;group_id=106328&amp;amp;func=browse Sourceforge Issue Tracker]&lt;br /&gt;
* EP: SIG can act as a proxy though&lt;br /&gt;
* RB: we could extend [http://tbe.kantl.be/TBE/ TEI by Example] with manuscript-related resources&lt;br /&gt;
* TS: refering to ENRICH again, there are existing resources and documentation for manuscript description, including conversion scenarios (TEI, EAD, ...)&lt;br /&gt;
* JV: structure of text is sometimes hard to describe with existing elements, not a problem of manuscripts alone, maybe some conceptual overlap with manuscript description&lt;br /&gt;
* EP: SIG &amp;quot;Text-Bearing Objects&amp;quot; might be a good place to address this&lt;br /&gt;
* idea about further modularizing the TEI, e.g. via use of different namespaces, address different use cases this way&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10141</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10141"/>
		<updated>2011-10-14T08:24:45Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* Torsten Schaßan (TS) &lt;br /&gt;
* Christian Wittern (CW)&lt;br /&gt;
* Ron Van den Branden (RB)&lt;br /&gt;
* Joachim Veit (JV)&lt;br /&gt;
* Gregor Middell (GM) – minutes&lt;br /&gt;
...&lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;br /&gt;
&lt;br /&gt;
== What should we do, what could you do? ==&lt;br /&gt;
&lt;br /&gt;
* TS: ODDs from [http://enrich.manuscriptorium.com/ ENRICH project] available&lt;br /&gt;
* support for marginal notes in the Guidelines/ SIG proposal&lt;br /&gt;
* manuscript description: &amp;amp;lt;summary/&amp;gt; content model is not flexible enough for more complex descriptions (e.g. &amp;amp;lt;p/&amp;gt;)&lt;br /&gt;
* CW: there are – apart from the SIG – means to address such issues, for example the [http://sourceforge.net/tracker/?atid=644065&amp;amp;group_id=106328&amp;amp;func=browse Sourceforge Issue Tracker]&lt;br /&gt;
* EP: SIG can act as a proxy though&lt;br /&gt;
* RB: we could extend [http://tbe.kantl.be/TBE/ TEI by Example] with manuscript-related resources&lt;br /&gt;
* TS: refering to ENRICH again, there are existing resources and documentation for manuscript description, including conversion scenarios (TEI, EAD, ...)&lt;br /&gt;
* JV: structure of text is sometimes hard to describe with existing elements, not a problem of manuscripts alone, maybe some conceptual overlap with manuscript description&lt;br /&gt;
* EP: SIG &amp;quot;Text-Bearing Objects&amp;quot; might be a good place to address this&lt;br /&gt;
* idea about further modularizing the TEI, e.g. via use of different namespaces&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10140</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10140"/>
		<updated>2011-10-14T08:19:50Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* Torsten Schaßan (TS) &lt;br /&gt;
* Christian Wittern (CW)&lt;br /&gt;
* Ron Van den Branden (RB)&lt;br /&gt;
* Joachim Veit (JV)&lt;br /&gt;
* Gregor Middell (GM) – minutes&lt;br /&gt;
...&lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;br /&gt;
&lt;br /&gt;
== What should we do, what could you do? ==&lt;br /&gt;
&lt;br /&gt;
* TS: ODDs from [http://enrich.manuscriptorium.com/ ENRICH project] available&lt;br /&gt;
* support for marginal notes in the Guidelines/ SIG proposal&lt;br /&gt;
* manuscript description: &amp;amp;lt;summary/&amp;gt; content model is not flexible enough for more complex descriptions (e.g. &amp;amp;lt;p/&amp;gt;)&lt;br /&gt;
* CW: there are – apart from the SIG – means to address such issues, for example the [http://sourceforge.net/tracker/?atid=644065&amp;amp;group_id=106328&amp;amp;func=browse Sourceforge Issue Tracker]&lt;br /&gt;
* EP: SIG can act as a proxy though&lt;br /&gt;
* RB: we could extend [http://tbe.kantl.be/TBE/ TEI by Example] with manuscript-related resources&lt;br /&gt;
* TS: refering to ENRICH again, there are existing resources and documentation for manuscript description, including conversion scenarios (TEI, EAD, ...)&lt;br /&gt;
* JV: structure of text is sometimes hard to describe with existing elements, not a problem of manuscripts alone, maybe some conceptual overlap with manuscript description&lt;br /&gt;
* EP: SIG &amp;quot;Text-Bearing Objects&amp;quot; might be a good place to address this&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10139</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10139"/>
		<updated>2011-10-14T08:17:07Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* Torsten Schaßan (TS) &lt;br /&gt;
* Christian Wittern (CW)&lt;br /&gt;
* Ron Van den Branden (RB)&lt;br /&gt;
* Joachim Veit (JV)&lt;br /&gt;
* Gregor Middell (GM) – minutes&lt;br /&gt;
...&lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;br /&gt;
&lt;br /&gt;
== What should we do, what could you do? ==&lt;br /&gt;
&lt;br /&gt;
* TS: ODDs from [http://enrich.manuscriptorium.com/ ENRICH project] available&lt;br /&gt;
* support for marginal notes in the Guidelines/ SIG proposal&lt;br /&gt;
* manuscript description: &amp;amp;lt;summary/&amp;gt; content model is not flexible enough for more complex descriptions (e.g. &amp;amp;lt;p/&amp;gt;)&lt;br /&gt;
* CW: there are – apart from the SIG – means to address such issues, for example the [http://sourceforge.net/tracker/?atid=644065&amp;amp;group_id=106328&amp;amp;func=browse Sourceforge Issue Tracker]&lt;br /&gt;
* EP: SIG can act as a proxy though&lt;br /&gt;
* RB: we could extend [http://tbe.kantl.be/TBE/ TEI by Example] with manuscript-related resources&lt;br /&gt;
* TS: refering to ENRICH again, there are existing resources and documentation for manuscript description, including conversion scenarios (TEI, EAD, ...)&lt;br /&gt;
* JV: structure of text is sometimes hard to describe with existing elements, not a problem of manuscripts alone, maybe some conceptual overlap with manuscript description&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10138</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10138"/>
		<updated>2011-10-14T08:16:18Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* Torsten Schaßan (TS) &lt;br /&gt;
* Christian Wittern (CW)&lt;br /&gt;
* Ron Van den Branden (RB)&lt;br /&gt;
* Joachim Veit (JV)&lt;br /&gt;
* Gregor Middell (GM) – minutes&lt;br /&gt;
...&lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;br /&gt;
&lt;br /&gt;
== What should we do, what could you do? ==&lt;br /&gt;
&lt;br /&gt;
* TS: ODDs from [http://enrich.manuscriptorium.com/ ENRICH project] available&lt;br /&gt;
* support for marginal notes in the Guidelines/ SIG proposal&lt;br /&gt;
* manuscript description: &amp;amp;lt;summary/&amp;gt; content model is not flexible enough for more complex descriptions (e.g. &amp;amp;lt;p/&amp;gt;)&lt;br /&gt;
* CW: there are – apart from the SIG – means to address such issues, for example the [http://sourceforge.net/tracker/?atid=644065&amp;amp;group_id=106328&amp;amp;func=browse Sourceforge Issue Tracker]&lt;br /&gt;
* EP: SIG can act as a proxy though&lt;br /&gt;
* RB: we could extend [http://tbe.kantl.be/TBE/ TEI by Example] with manuscript-related resources&lt;br /&gt;
* TS: refering to ENRICH again, there are existing resources and documentation for manuscript description, including conversion scenarios (TEI, EAD, ...)&lt;br /&gt;
* JV: structure of text is sometimes hard to describe with existing elements, maybe some conceptual overlap with manuscript description&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10137</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10137"/>
		<updated>2011-10-14T08:07:16Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* Torsten Schaßan (TS) &lt;br /&gt;
* Christian Wittern (CW)&lt;br /&gt;
* Ron Van den Branden (RB)&lt;br /&gt;
* Gregor Middell (GM) – minutes&lt;br /&gt;
...&lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;br /&gt;
&lt;br /&gt;
== What should we do, what could you do? ==&lt;br /&gt;
&lt;br /&gt;
* TS: ODDs from [http://enrich.manuscriptorium.com/ ENRICH project] available&lt;br /&gt;
* support for marginal notes in the Guidelines/ SIG proposal&lt;br /&gt;
* manuscript description: &amp;amp;lt;summary/&amp;gt; content model is not flexible enough for more complex descriptions (e.g. &amp;amp;lt;p/&amp;gt;)&lt;br /&gt;
* CW: there are – apart from the SIG – means to address such issues, for example the [http://sourceforge.net/tracker/?atid=644065&amp;amp;group_id=106328&amp;amp;func=browse Sourceforge Issue Tracker]&lt;br /&gt;
* EP: SIG can act as a proxy though&lt;br /&gt;
* RB: we could extend [http://tbe.kantl.be/TBE/ TEI by Example] with manuscript-related resources&lt;br /&gt;
* TS: refering to ENRICH again, there are existing resources and documentation for manuscript description, including conversion scenarios (TEI, EAD, ...)&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10136</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10136"/>
		<updated>2011-10-14T08:04:46Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* Torsten Schaßan (TS) &lt;br /&gt;
* Christian Wittern (CW)&lt;br /&gt;
* Ron Van den Branden (RB)&lt;br /&gt;
* Gregor Middell (GM) – minutes&lt;br /&gt;
...&lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;br /&gt;
&lt;br /&gt;
== What should we do, what could you do? ==&lt;br /&gt;
&lt;br /&gt;
* TS: ODDs from [http://enrich.manuscriptorium.com/ ENRICH project] available&lt;br /&gt;
* support for marginal notes in the Guidelines/ SIG proposal&lt;br /&gt;
* manuscript description: &amp;amp;lt;summary/&amp;gt; content model is not flexible enough for more complex descriptions (e.g. &amp;amp;lt;p/&amp;gt;)&lt;br /&gt;
* CW: there are – apart from the SIG – means to address such issues, for example the [http://sourceforge.net/tracker/?atid=644065&amp;amp;group_id=106328&amp;amp;func=browse Sourceforge Issue Tracker]&lt;br /&gt;
* EP: SIG can act as a proxy though&lt;br /&gt;
* RB: we could extend [http://tbe.kantl.be/TBE/ TEI by Example] with manuscript-related resources&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10135</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10135"/>
		<updated>2011-10-14T07:59:32Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* Torsten Schaßan (TS) &lt;br /&gt;
* Christian Wittern (CW)&lt;br /&gt;
* Gregor Middell (GM) – minutes&lt;br /&gt;
...&lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;br /&gt;
&lt;br /&gt;
== What should we do, what could you do? ==&lt;br /&gt;
&lt;br /&gt;
* TS: ODDs from [http://enrich.manuscriptorium.com/ ENRICH project] available&lt;br /&gt;
* support for marginal notes in the Guidelines/ SIG proposal&lt;br /&gt;
* manuscript description: &amp;amp;lt;summary/&amp;gt; content model is not flexible enough for more complex descriptions (e.g. &amp;amp;lt;p/&amp;gt;)&lt;br /&gt;
* CW: there are – apart from the SIG – means to address such issues, for example the [http://sourceforge.net/tracker/?atid=644065&amp;amp;group_id=106328&amp;amp;func=browse Sourceforge Issue Tracker]&lt;br /&gt;
* EP: SIG can act as a proxy though&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10134</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10134"/>
		<updated>2011-10-14T07:57:43Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* Torsten Schaßan (TS) &lt;br /&gt;
* Gregor Middell (GM) – minutes&lt;br /&gt;
...&lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;br /&gt;
&lt;br /&gt;
== What should we do, what could you do? ==&lt;br /&gt;
&lt;br /&gt;
* TS: ODDs from [http://enrich.manuscriptorium.com/ ENRICH project] available&lt;br /&gt;
* support for marginal notes in the Guidelines/ SIG proposal&lt;br /&gt;
* manuscript description: &amp;amp;lt;summary/&amp;gt; content model is not flexible enough for more complex descriptions (e.g. &amp;amp;lt;p/&amp;gt;)&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10133</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10133"/>
		<updated>2011-10-14T07:54:25Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* Torsten Schaßan (TS) &lt;br /&gt;
* Gregor Middell (GM) – minutes&lt;br /&gt;
...&lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;br /&gt;
&lt;br /&gt;
== What should we do, what could you do? ==&lt;br /&gt;
&lt;br /&gt;
* TS: ODDs from [http://enrich.manuscriptorium.com/ ENRICH project] available&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10132</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10132"/>
		<updated>2011-10-14T07:53:38Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* Torsten Schaßan (TS) &lt;br /&gt;
* Gregor Middell (GM) – minutes&lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;br /&gt;
&lt;br /&gt;
== What should we do, what could you do? ==&lt;br /&gt;
&lt;br /&gt;
* TS: ODDs from [http://enrich.manuscriptorium.com/ ENRICH project] available&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10131</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10131"/>
		<updated>2011-10-14T07:53:18Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* Torsten Schaßan (TS) &lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;br /&gt;
&lt;br /&gt;
== What should we do, what could you do? ==&lt;br /&gt;
&lt;br /&gt;
* TS: ODDs from [http://enrich.manuscriptorium.com/ ENRICH project] available&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10130</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10130"/>
		<updated>2011-10-14T07:51:53Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* Torsten Schaßan (TS) &lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;br /&gt;
&lt;br /&gt;
== Feedback, possible contributions ==&lt;br /&gt;
&lt;br /&gt;
* TS: ODDs from [http://enrich.manuscriptorium.com/ ENRICH project] available&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10129</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10129"/>
		<updated>2011-10-14T07:47:57Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* ... &lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;br /&gt;
* call for participation&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10128</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10128"/>
		<updated>2011-10-14T07:46:49Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* ... &lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via [http://listserv.brown.edu/tei-ms-sig.html mailing list], [[SIG:MSS|wiki]] and [http://www.tei-c.org/SIG/Manuscripts/ web-site]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10127</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10127"/>
		<updated>2011-10-14T07:43:34Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
* ... &lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via mailing list and Wiki&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10126</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10126"/>
		<updated>2011-10-14T07:43:20Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available, communication via mailing list and Wiki&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10125</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10125"/>
		<updated>2011-10-14T07:42:39Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
&lt;br /&gt;
== Introduction (EP) ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;br /&gt;
* organizational matters, some financial support by the TEI available&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10124</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10124"/>
		<updated>2011-10-14T07:40:27Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Participants ==&lt;br /&gt;
&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Marjorie Burghardt (MB)&lt;br /&gt;
&lt;br /&gt;
== Introduction ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* EP: explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs&lt;br /&gt;
* EP: outline of current activities&lt;br /&gt;
** '''Genetic editing''': Council adopts recommendations of the SIG WG, will be circulated for the first time shortly; likely to be included in the next release of the Guidelines&lt;br /&gt;
** '''Critical Apparatus''': chapter in the Guidelines has not been revised since P4, automatic collation vs. manual construction of a CA, question of relationship between the two, MB appointed as leader of a WG to improve on this situation&lt;br /&gt;
** '''Manuscript description''': extension of existing means with dimensions like time, geospatial information etc., problem of scope (to what extent are we talking about manuscripts or about cultural artifacts in general)&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10123</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10123"/>
		<updated>2011-10-14T07:32:28Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Introduction ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* EP: explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...), willingness to contribute vs. ability to do so on a persistent basis, problem of time constraints, financial and other resources as well as attribution of work in the SIGs, outline of current activities&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10122</id>
		<title>SIGMS Minutes 20111014</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIGMS_Minutes_20111014&amp;diff=10122"/>
		<updated>2011-10-14T07:30:22Z</updated>

		<summary type="html">&lt;p&gt;Gremid: New page: == Introduction ==  * introduction of participants * EP: explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...) * EP: willingness to contribute vs. ability t...&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Introduction ==&lt;br /&gt;
&lt;br /&gt;
* introduction of participants&lt;br /&gt;
* EP: explains relationship and interdependencies to other SIGs (Facsimile, Libraries, ...)&lt;br /&gt;
* EP: willingness to contribute vs. ability to do so on a persistent basis, attribution of work in the SIGs&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=SIG:MSS&amp;diff=10121</id>
		<title>SIG:MSS</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=SIG:MSS&amp;diff=10121"/>
		<updated>2011-10-14T07:27:46Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== TEI Special Interest Group on Manuscripts (TEI MS SIG) ==&lt;br /&gt;
&lt;br /&gt;
=== Agenda of this SIG ===&lt;br /&gt;
*revision of the chapter on critical apparatus&lt;br /&gt;
** see working group on [[Critical_Apparatus_Workgroup|Critical Apparatus]]/ [[Textual Variance]]&lt;br /&gt;
*revision of the chapter on manuscript description&lt;br /&gt;
**see working group on [[msDesc enhancement|MSS Description]]&lt;br /&gt;
*drafting a chapter (or section) on markup of genetic editions&lt;br /&gt;
**see working group on [[Genetic Editions]]&lt;br /&gt;
&lt;br /&gt;
=== News ===&lt;br /&gt;
&lt;br /&gt;
* [[SIGMS_Minutes_20111014|Minutes SIG meeting, Würzburg 14 October 2011]]&lt;br /&gt;
* [[SIGMS_Minutes_20101112|Minutes SIG meeting, Zadar 12 November 2010]]&lt;br /&gt;
* [[SIGMS_Minutes_20091114|Minutes SIG meeting, Ann Arbor 14 November 2009]]&lt;br /&gt;
* [[SIGMS_Agenda_20081108|Agenda SIG meeting, London 8 November 2008]]&lt;br /&gt;
&lt;br /&gt;
=== Introduction ===&lt;br /&gt;
&lt;br /&gt;
The goal of the TEI Special Interest Group on Manuscripts is to bring together users of the TEI who wish to improve the encoding strategies for marking up transcriptions and editions of manuscript materials.&lt;br /&gt;
&lt;br /&gt;
This SIG will explore a range of issues common to editing manuscripts, including: &lt;br /&gt;
# how to handle time based encoding&lt;br /&gt;
# how to record place based encoding&lt;br /&gt;
# how to encode fragments&lt;br /&gt;
# how to record codicology (the substance of the medium, ink stints, etc); Robinson noted that the TEI already has mechanisms to record this, but it needs to be better documented&lt;br /&gt;
# issues of substitutions&lt;br /&gt;
# issues of variation&lt;br /&gt;
# to clarify the role of using the critical apparatus tagset in manuscript transcription (which is dependent on whether one is encoding an edition or encoding a manuscript transcription)&lt;br /&gt;
&lt;br /&gt;
It will do this by: &lt;br /&gt;
* running a mailing list on this topic&lt;br /&gt;
* assess the TEI and suggest improvements/alterations to the TEI-Council&lt;br /&gt;
&lt;br /&gt;
The SIG is currently convened by Elena Pierazzo who also manages the TEI-MS-SIG List, Amanda Gailey, and Malte Rehbein. It was originally setup by Elena Pierazzo, Susan Schreibman, and Edward Vanhoutte.&lt;br /&gt;
&lt;br /&gt;
The SIG runs a mailing list on this topic. To join visit http://listserv.brown.edu/tei-ms-sig.html&lt;br /&gt;
&lt;br /&gt;
=== Activities ===&lt;br /&gt;
&lt;br /&gt;
The SIG met at the Third Annual TEI Members' Meeting in Nancy, France on 08 November 2003. The report can be found [http://www.tei-c.org.uk/Activities/SIG/Manuscript/mssigr01.html here].&lt;br /&gt;
&lt;br /&gt;
The second meeting of the SIG was held at the Fourth Annual TEI Members' Meeting at Johns Hopkins University, Baltimore, USA, 23 October 2004. The report can be found [http://www.tei-c.org.uk/Activities/SIG/Manuscript/mssigr02.html here].&lt;br /&gt;
&lt;br /&gt;
The third meeting of the SIG was held at Seventh Annual TEI Member's Meeting at University of Maryland, College Park, 3nd of November 2007. The agenda of the meeting can be found [[MS SIG Agenda 2007|here]]; the report can be found [[MS SIG Report 07|here]]. During the meeting several task forces were created to deal with specific problems: the task forces main page can be found [[MS task forces|here]].&lt;br /&gt;
&lt;br /&gt;
=== Useful links ===&lt;br /&gt;
&lt;br /&gt;
==== Manuscript Description ====&lt;br /&gt;
&lt;br /&gt;
Projects, resources, guidelines concerning MS description.&lt;br /&gt;
&lt;br /&gt;
* [http://www.tei-c.org/release/doc/tei-p5-doc/html/MS.html TEI P5: 13. Manuscript Description]&lt;br /&gt;
* [http://enrich.manuscriptorium.com ENRICH project] - standardising and aggregating medieval manuscript descriptions. See also [http://tei.oucs.ox.ac.uk/ENRICH/ OUCS ENRICH Website] containing ENRICH Schemas, ODD, Documentation and Training Materials.&lt;br /&gt;
* [http://sunsite3.berkeley.edu/Scriptorium/ Digital Scriptorium] (see especially the [http://sunsite3.berkeley.edu/Scriptorium/technical/description_dtd.html DESCRIPTION DTD] page).&lt;br /&gt;
* [http://www.hab.de/forschung/projekte/master-e.htm MASTER - Interface format for the description of medieval manuscripts]&lt;br /&gt;
* [http://www.tei-c.org/Activities/MS/msw05.xml Reviews of the manuscript description chapter: A summary]&lt;br /&gt;
&lt;br /&gt;
==== Manuscript Transcription ====&lt;br /&gt;
&lt;br /&gt;
Projects, resources, guidelines concerning MS transcription.&lt;br /&gt;
&lt;br /&gt;
* [http://www.cdlib.org/inside/diglib/stwg/ms/ CDL Encoding Guidelines for Manuscripts and Rare Books]&lt;br /&gt;
* [http://www.kantl.be/ctb/project/dalf/ DALF: Digital Archive of Letters in Flanders]&lt;br /&gt;
* [http://www.scriptorium.columbia.edu/ Digital Scriptorium] (see especially the [https://www1.columbia.edu/sec/cu/libraries/bts/digital_scriptorium/technical/ds-xml/transcription_dtd/index.html TRANSCRIPTION DTD] and [https://www1.columbia.edu/sec/cu/libraries/bts/digital_scriptorium/technical/ds-xml/description_dtd/index.html DESCRIPTION DTD] pages).&lt;br /&gt;
* [[EpiDoc]], guidelines for transcribing ancient texts&lt;br /&gt;
* [http://skaldic.arts.usyd.edu.au/docs/guidelines/guide.html Guidelines for the electronic edition of the skaldic corpus]&lt;br /&gt;
* [http://web.uvic.ca/hrd/lydgate/pilot.htm Partial Transcription of John Lydgate's &amp;quot;Fall of Princes&amp;quot;]&lt;br /&gt;
* [http://webapp1.dlib.indiana.edu/newton/index.jsp The Chymistry of Isaac Newton]&lt;br /&gt;
* [http://gandalf.aksis.uib.no/menota/guidelines/ The Menota handbook]&lt;br /&gt;
* [http://www.newtonproject.ic.ac.uk/prism.php?id=1 The Newton Project] (see also [http://ahds.ac.uk/creating/case-studies/newton/index.htm The Newton Project: Implementing and Exploiting XML]).&lt;br /&gt;
* [http://www.whitmanarchive.org/guidelines/ The Walt Whitman Archive Encoding Guidelines for Poetry Manuscripts]&lt;br /&gt;
* [http://jefferson.village.virginia.edu/seenet/piers/protocoltran.html Transcriptional Protocols: Piers Plowman Electronic Archive and SEENET]&lt;br /&gt;
* [http://elec.enc.sorbonne.fr/chroniqueslatines/ Chroniques latines de Saint-Denis] : critical edition using TEI P4 (but we hope to migrate in P5 this year) : [http://elec.enc.sorbonne.fr/chroniqueslatines/xml/chroniquesstdenis.xml XML Files]&lt;br /&gt;
* [http://lemo.irht.cnrs.fr/43/43-12.htm Réflexions sur l'utilisation de la TEI pour encoder les sources diplomatiques] and [http://elec.enc.sorbonne.fr/cartulaireblanc/guidebalisage/ Guide du balisage du Cartulaire blanc] : two resources in French&lt;br /&gt;
* [http://ccfm.ens-lsh.fr/IMG/pdf/BFM-Mss_Encodage-XML.pdf Base de Français Médiéval - MSS Encoding Guidelines (in French)]&lt;br /&gt;
* [http://marjorie.burghart.online.fr/?q=en/content/tei-critical-apparatus-cheatsheet TEI: Critical Apparatus Cheatsheet]&lt;br /&gt;
&lt;br /&gt;
[[Category:SIG|MSS]]&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9473</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9473"/>
		<updated>2011-05-31T08:30:35Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* An encoding proposal from the perspective of computer-aided collation tools */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup:&lt;br /&gt;
&lt;br /&gt;
* Marjorie Burghart (MB)&lt;br /&gt;
* James Cummings (JC)&lt;br /&gt;
* Fotis Jannidis (FJ)&lt;br /&gt;
* Gregor Middell (GM)&lt;br /&gt;
* Dan O'Donnell (DOD)&lt;br /&gt;
* Espen Ore (EO)&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Roberto Rosselli del Turco (RDT)&lt;br /&gt;
* Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== “Critical Apparatus” vs. “Textual Variance” vs. “Textual Variants”== &lt;br /&gt;
&lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the '''critical apparatus''' is just inherited from the printed world and one of the possible physical embodiment of '''textual variance'''. EP therefore proposes to use this new name, moving from &amp;quot;critical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use '''textual variants''' instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart of the current critical apparatus recommendations.  That of encoding an apparatus at the site of textual variance and encoding a structured view of a note entirely separate from the edited version of texts. (In mass digitization of critical editions, for example, one might have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are not encoded at the site of variance, or indeed necessarily connected with it.)  It is this striving to both be able to encode all sorts of various legacy forms of apparatus as well as simultaneously catering for those who are recording the structure by which they will generate an apparatus in producing some output.  So JC would argue that the first of these are apparatus and the second of these is a site/locus of textual variance.&lt;br /&gt;
&lt;br /&gt;
== Issues == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Specific phenomena ===&lt;br /&gt;
&lt;br /&gt;
==== Transpositions ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fuszgtpnn2ywf6bh&lt;br /&gt;
&lt;br /&gt;
==== Handling of punctuation ====&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
==== Representing omissions in an apparatus ====&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
* http://tei.markmail.org/thread/4sheu6nji3dvnf64&lt;br /&gt;
&lt;br /&gt;
=== Markup-related ===&lt;br /&gt;
&lt;br /&gt;
==== Inclusion of structural markup in the apparatus ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
* http://tei.markmail.org/thread/x5agpwzn4hiwwwcx&lt;br /&gt;
&lt;br /&gt;
==== Encoding variants in structural markup ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ap62n37uf6rbfds4&lt;br /&gt;
* http://tei.markmail.org/thread/hbmnsn3v4aqjabt3&lt;br /&gt;
&lt;br /&gt;
==== Conflicts between individual readings and the semantics of structural markup that surrounds it ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Workflow-related ===&lt;br /&gt;
&lt;br /&gt;
==== Scalability ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Scaling is generally a problem with methods of indicating textual variance, but in parallel segmentation this is exacerbated because as the number of witnesses increase, the likelihood of needing to reformulate the reading boundaries, never mind the difficulty in reading or understanding such encodings. This may be a problem not only when looking at a single text with many witnesses, where variation in structure may be extremely difficult to represent where conflicts occur which disrupt this very basic structure (for example, imagine a set of witnesses where some have lines in linegroups, some just lines, some paragraphs, some paragraphs in divisions, but all with the same underlying text). But also where parallel segmentation is being used to record divergent interpretations of these individual witnesses by many editors (for distributed co-operative editions generated from many editorial views of a text). A plausible recommendation is to use a form of stand-off apparatus for such editions rather than parallel segmentation. And while some of the current methods can be used in a stand-off method, they should be updated to reflect current P5 usage of URI-based pointers.&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Refactoring ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn.&lt;br /&gt;
&lt;br /&gt;
==== Complexity ====&lt;br /&gt;
&lt;br /&gt;
Manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
==== Feasibility of double-endpoint-attached method ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fsj7gvojds4mwcm5&lt;br /&gt;
* http://tei.markmail.org/thread/flwcnf4fxm4u7ebj&lt;br /&gt;
&lt;br /&gt;
=== Model vs. Representation ===&lt;br /&gt;
&lt;br /&gt;
==== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Representing &amp;quot;verbose&amp;quot; apparatus ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: when you want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
==== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
==== Collations of differing granularity ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/bonflsyb2d3ebtp2&lt;br /&gt;
* http://tei.markmail.org/thread/gqyymzd4a4xvhch7&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghart as well as following the [[Textual_Variance|“Gothenburg model” of textual variance]], a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot; xml:base=&amp;quot;http://edition.org/&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;witness_1#xpath1(/p[1]/w[1]) witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;witness_1#xpath1(/p[1]/w[2]) witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;witness_1#xpath1(/p[1]/w[3]) witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;witness_1#xpath1(/p[1]/w[4]) witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;witness_1#xpath1(/p[1]/w[5]) witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;witness_1#xpath1(/p[1]/w[6]) witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;br /&gt;
&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
[[Category:Critical Apparatus]]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9187</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9187"/>
		<updated>2011-04-09T13:12:12Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Modelling collated data: Encode the alignment/linking between tokens */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup:&lt;br /&gt;
&lt;br /&gt;
* Marjorie Burghart (MB)&lt;br /&gt;
* James Cummings (JC)&lt;br /&gt;
* Fotis Jannidis (FJ)&lt;br /&gt;
* Gregor Middell (GM)&lt;br /&gt;
* Dan O'Donnell (DOD)&lt;br /&gt;
* Espen Ore (EO)&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Roberto Rosselli del Turco (RDT)&lt;br /&gt;
* Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== “Critical Apparatus” vs. “Textual Variance” vs. “Textual Variants”== &lt;br /&gt;
&lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the '''critical apparatus''' is just inherited from the printed world and one of the possible physical embodiment of '''textual variance'''. EP therefore proposes to use this new name, moving from &amp;quot;critical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use '''textual variants''' instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart of the current critical apparatus recommendations.  That of encoding an apparatus at the site of textual variance and encoding a structured view of a note entirely separate from the edited version of texts. (In mass digitization of critical editions, for example, one might have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are not encoded at the site of variance, or indeed necessarily connected with it.)  It is this striving to both be able to encode all sorts of various legacy forms of apparatus as well as simultaneously catering for those who are recording the structure by which they will generate an apparatus in producing some output.  So JC would argue that the first of these are apparatus and the second of these is a site/locus of textual variance.&lt;br /&gt;
&lt;br /&gt;
== Issues == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Specific phenomena ===&lt;br /&gt;
&lt;br /&gt;
==== Transpositions ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fuszgtpnn2ywf6bh&lt;br /&gt;
&lt;br /&gt;
==== Handling of punctuation ====&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
==== Representing omissions in an apparatus ====&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
* http://tei.markmail.org/thread/4sheu6nji3dvnf64&lt;br /&gt;
&lt;br /&gt;
=== Markup-related ===&lt;br /&gt;
&lt;br /&gt;
==== Inclusion of structural markup in the apparatus ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
* http://tei.markmail.org/thread/x5agpwzn4hiwwwcx&lt;br /&gt;
&lt;br /&gt;
==== Encoding variants in structural markup ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ap62n37uf6rbfds4&lt;br /&gt;
* http://tei.markmail.org/thread/hbmnsn3v4aqjabt3&lt;br /&gt;
&lt;br /&gt;
==== Conflicts between individual readings and the semantics of structural markup that surrounds it ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Workflow-related ===&lt;br /&gt;
&lt;br /&gt;
==== Scalability ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
==== Refactoring ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn.&lt;br /&gt;
&lt;br /&gt;
==== Complexity ====&lt;br /&gt;
&lt;br /&gt;
Manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
==== Feasibility of double-endpoint-attached method ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fsj7gvojds4mwcm5&lt;br /&gt;
* http://tei.markmail.org/thread/flwcnf4fxm4u7ebj&lt;br /&gt;
&lt;br /&gt;
=== Model vs. Representation ===&lt;br /&gt;
&lt;br /&gt;
==== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Representing &amp;quot;verbose&amp;quot; apparatus ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
==== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
==== Collations of differing granularity ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/bonflsyb2d3ebtp2&lt;br /&gt;
* http://tei.markmail.org/thread/gqyymzd4a4xvhch7&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot; xml:base=&amp;quot;http://edition.org/&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;witness_1#xpath1(/p[1]/w[1]) witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;witness_1#xpath1(/p[1]/w[2]) witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;witness_1#xpath1(/p[1]/w[3]) witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;witness_1#xpath1(/p[1]/w[4]) witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;witness_1#xpath1(/p[1]/w[5]) witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;witness_1#xpath1(/p[1]/w[6]) witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;br /&gt;
&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
[[Category:Critical Apparatus]]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Textual_Variance&amp;diff=9179</id>
		<title>Textual Variance</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Textual_Variance&amp;diff=9179"/>
		<updated>2011-04-08T18:53:18Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The working group on [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter] is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
&lt;br /&gt;
== The “Gothenburg model”: A modular architecture for computer-aided collation ==&lt;br /&gt;
&lt;br /&gt;
Developers of [http://collatex.sourceforge.net/ CollateX] and [http://www.juxtasoftware.org/ Juxta] met in 2009 at a joint workshop of the EU-funded research projects [http://www.cost-a32.eu/ COST Action 32] and [http://www.interedition.eu/ Interedition] in Gothenburg. They wanted to agree on a modular software architecture, so these two as well as similar projects interested in collation software would have a common base for collaborating in the development of needed tools. As a first result the participants identified the following 4 modules/tasks, which were found to be essential to computer-aided collation. The underlying ideas might consequently need to be discussed in the context of encoding the in- and output of these modules as part of – or pre-stage to – a critical apparatus.&lt;br /&gt;
&lt;br /&gt;
=== Tokenizer ===&lt;br /&gt;
&lt;br /&gt;
[[Image:Collation_Tokenizer.png|frame|right|A tokenized text]]&lt;br /&gt;
&lt;br /&gt;
While collators can compare witnesses on a character-by-character basis, in the more common use case each comparand is split up into tokens/ segments before collation and compared on the token level. This preprocessing step called ''tokenization'' and performed by a ''tokenizer'' can happen on any level of granularity, e.g. on the level of syllables, words, lines, phrases, verses, paragraphes, text nodes in a normalized DOM etc.&lt;br /&gt;
&lt;br /&gt;
Another service provided by tokenizers and of special value to text-oriented collators relates to marked-up comparands: As these collators primarily compare witnesses based on their textual content, embedded markup would usually get in the way and therefore needs to be filtered out and/or “pushed in the background”, so the collator can operate on tokens of textual content. At the same time it might be valueable to have the markup context of every token available, e.g. in case one wants to make use of it in complex token comparator functions.&lt;br /&gt;
&lt;br /&gt;
The figure to the right depicts this process: Think of the upper line as a witness, its characters a, b, c, d as arbitrary tokens and e1, e2 as examples of embedded markup elements. A tokenizer would transform this marked-up text into a sequence of tokens, each referring to their respective markup/tagging context. From now on a collator can compare this witness to others based on its tokenized content and does not have to deal with it on a syntactic level anymore, that is rather specific to a particular markup language or dialect.&lt;br /&gt;
&lt;br /&gt;
==== Encoding challenges ====&lt;br /&gt;
&lt;br /&gt;
* Tokenization can rely on a [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SASE given segmentation expressed by existing markup] in the witness, but it might also introduce a new layer of markup representing its outcome, which can [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/NH.html overlap] with the existing markup hierarchy. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;br style=&amp;quot;clear: both&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Aligner ===&lt;br /&gt;
&lt;br /&gt;
[[Image:Collation_Aligner.png|frame|right|An alignment table]]&lt;br /&gt;
&lt;br /&gt;
After the witnesses have been tokenized, collators try to align all witnesses involved. Simply put, aligning the witnesses means in this case: Find matching tokens and insert empty tokens (''gap tokens'') such that the token sequences of all witness line up properly. Interestingly this problem is computationally similar to the problem of [http://en.wikipedia.org/wiki/Sequence_alignment sequence alignment] encountered in bioinformatics.&lt;br /&gt;
&lt;br /&gt;
Looking at an example, assume that we have three witnesses: the first is comprised of the token sequence (a, b, c, d), the second reads (a, c, d, b) and the third (b, c, d). A collator might align these three witnesses as depicted in a tabular fashion on the right. Each witness occupies a column, matching tokens are aligned in a row, necessary gap tokens as inserted during the alignment process are denoted via a hyphen. Depending from which perspective one interprets this ''alignment table'', one can say for example that the (b) in the second row was ommitted in the second witness or it has been added in the first and the third. A similar statement can be made about (b) in the last row by just inverting the relationship of being added/ommitted.&lt;br /&gt;
&lt;br /&gt;
Alignment tables like the one shown can be encoded losslessly with an existing apparatus encoding scheme, in parallel segmentation mode, as long as only the textual content of token needs to be represented. Each row is represented by a segment with empty readings for gap tokens. Optionally consecutive rows with identical readings for each witness can be compressed into a single segment, e.g.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;#w1 #w2&amp;quot;&amp;gt;a&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;#w3&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;#w1 #w3&amp;quot;&amp;gt;b&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;w2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;#w1 #w2 #w3&amp;quot;&amp;gt;cd&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;#w2&amp;quot;&amp;gt;b&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;#w1 #w3&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Encoding challenges ====&lt;br /&gt;
&lt;br /&gt;
* In case the aligned tokens shall be embedded into the apparatus encoding, only their textual content is guaranteed to be embeddable without causing markup validation problems. Otherwise there needs to be a defined way of referencing tokens in their respective markup context, because this context cannot be replicated in the apparatus without loosening the XML schema constraints substantially.&lt;br /&gt;
&lt;br /&gt;
=== Analyzer ===&lt;br /&gt;
&lt;br /&gt;
[[Image:Collation_Analyzer.png|frame|left|An analyzed alignment table]]&lt;br /&gt;
&lt;br /&gt;
On top of the results delivered by the alignment process, a further analysis can yield additional findings. Echoing the example from the above section, one might want to think of the token (b) in row 2 and 5 as being transposed instead of as being added/omitted separately. Some collators try to detect transpositions as part of the alignment process, some do it as a post-processing step and others do not handle transpositions at all and/or leave it to the user to declare those beforehand. Part of the reason for algorithmic differences in transpostion handling is the fact, that the question which tokens are actually transposed is much more a matter of interpretation than the question of matching and aligning them. While alignment results can still be judged in terms of their quality to some extent, transposition detection can only be done heuristically as one can easily think of cases, where it is impossible for a computer “to get it right”.&lt;br /&gt;
&lt;br /&gt;
Apart from the specific problem of transpositions, it seems generally necessary to incorporate a step in the collation process, in which the user can examine the preliminary collation result, edit and augment it according to her knowledge and possibly feed it back into the collator for another run yielding enhanced results.&lt;br /&gt;
&lt;br /&gt;
==== Encoding challenges ====&lt;br /&gt;
&lt;br /&gt;
* If we model transpositions as links between tokens/segments in an alignment/apparatus, we will also need a way to encode these links. Transpositions in general have not been supported in the TEI until recently when the [[Genetic_Editions|WG “Genetic editions”]] proposed an encoding scheme in the context of document-oriented markup.&lt;br /&gt;
* The proposed encoding scheme for transpositions should be reviewed, whether it is suitable as input for collation tools. An encoding scheme for user-declared alignments/transpositions would help in supporting roundtrip collation.&lt;br /&gt;
&lt;br /&gt;
=== Visualization ===&lt;br /&gt;
&lt;br /&gt;
The last module of the Gothenburg model deals with visualizing collation results. As we are concerned with modelling and encoding textual variance properly, the question of how to visualize it is of technical importance and should not be disregarded, but is essentially out of scope with regard to this discussion.&lt;br /&gt;
&lt;br /&gt;
== The “variant graph”: Schmidt’s model of textual variance ==&lt;br /&gt;
&lt;br /&gt;
In a [http://dx.doi.org/10.1016/j.ijhcs.2009.02.001 recent paper] D. Schmidt and R. Colomb proposed a data model of textual variance (or “multi-version texts” as they call it in the title), which they call a ''variant graph'':&lt;br /&gt;
&lt;br /&gt;
[[Image:Schmidt_Variant_Graph.png|frame|none|A variant graph]]&lt;br /&gt;
&lt;br /&gt;
In this model, varying texts/ a collation are/ is expressed in a [http://en.wikipedia.org/wiki/Directed_acyclic_graph directed acyclic graph] with each path through the graph representing one version/ witness. The textual data is annotated on the edges, each edge carrying a (common) segment of text(s) and a set of identifiers, that denotes the versions/witnesses, in which the segment appears. Transpositions can be superimposed on the graph by linking edges of transposed segments.&lt;br /&gt;
&lt;br /&gt;
The tabular model described above and the given graph-based model can be converted into each other, with Schmidt’s model having the advantage, that it is&lt;br /&gt;
&lt;br /&gt;
* more space-efficient as it combines matching segments into a single edge instead of duplicating them per row/column,&lt;br /&gt;
* more natural in expressing transpostions as matching segments are linked and not pairs of tokens.&lt;br /&gt;
&lt;br /&gt;
The tabular model on the other hand might be advantageous, if one wanted to keep collation results in a relational datastore.&lt;br /&gt;
&lt;br /&gt;
== Resources/ Bibliography ==&lt;br /&gt;
&lt;br /&gt;
=== Modelling textual variance ===&lt;br /&gt;
&lt;br /&gt;
*  Multi-Version Document Format. [http://dx.doi.org/10.1016/j.ijhcs.2009.02.001 Schmidt, D. and Colomb, R, 2009. A data structure for representing multi-version texts online, International Journal of Human-Computer Studies, 67.6, 497-514.] (See the related [http://multiversiondocs.blogspot.com/ blog] and [http://multiversiondocs.blogspot.com/2008/03/whats-multi-version-document.html the post  “What's a Multi-Version Document?”].&lt;br /&gt;
* Multi-Version Texts and Collation. [http://www.balisage.net/Proceedings/vol3/html/Schmidt01/BalisageVol3-Schmidt01.html Schmidt, Desmond. “Merging Multi-Version Texts: a Generic Solution to the Overlap Problem.”] Presented at Balisage: The Markup Conference 2009, Montréal, Canada, August 11 - 14, 2009. In Proceedings of Balisage: The Markup Conference 2009. Balisage Series on Markup Technologies, vol. 3 (2009). doi:10.4242/BalisageVol3.Schmidt01.] &lt;br /&gt;
&lt;br /&gt;
=== Computer-aided collation: Concepts and algorithms ===&lt;br /&gt;
&lt;br /&gt;
* Matthew Spencer, Christopher J. Howe. Collating Texts Using Progressive Multiple Alignment. Computers and the Humanities. 38/2004. S. 253–270.&lt;br /&gt;
* [http://edoc.bbaw.de/volltexte/2007/516/ Michael Stolz, Friedrich Michael Dimpel. Computergestützte Kollationierung und ihre Integration in den editorischen Arbeitsfluss. 2006.]&lt;br /&gt;
&lt;br /&gt;
=== Computer-aided collation: Software ===&lt;br /&gt;
&lt;br /&gt;
* [http://www.sd-editions.com/about/index.html Collate]&lt;br /&gt;
* [http://collatex.sourceforge.net/ CollateX]&lt;br /&gt;
* [http://www.juxtasoftware.org/ Juxta]&lt;br /&gt;
* [http://code.google.com/p/multiversiondocs/wiki/NMerge NMerge]&lt;br /&gt;
&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
[[Category:Critical Apparatus]]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Category:Critical_Apparatus&amp;diff=9178</id>
		<title>Category:Critical Apparatus</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Category:Critical_Apparatus&amp;diff=9178"/>
		<updated>2011-04-08T18:52:16Z</updated>

		<summary type="html">&lt;p&gt;Gremid: New page: http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9177</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9177"/>
		<updated>2011-04-08T18:49:16Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup:&lt;br /&gt;
&lt;br /&gt;
* Marjorie Burghart (MB)&lt;br /&gt;
* James Cummings (JC)&lt;br /&gt;
* Fotis Jannidis (FJ)&lt;br /&gt;
* Gregor Middell (GM)&lt;br /&gt;
* Dan O'Donnell (DOD)&lt;br /&gt;
* Espen Ore (EO)&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Roberto Rosselli del Turco (RDT)&lt;br /&gt;
* Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== “Critical Apparatus” vs. “Textual Variance” vs. “Textual Variants”== &lt;br /&gt;
&lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the '''critical apparatus''' is just inherited from the printed world and one of the possible physical embodiment of '''textual variance'''. EP therefore proposes to use this new name, moving from &amp;quot;critical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use '''textual variants''' instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart of the current critical apparatus recommendations.  That of encoding an apparatus at the site of textual variance and encoding a structured view of a note entirely separate from the edited version of texts. (In mass digitization of critical editions, for example, one might have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are not encoded at the site of variance, or indeed necessarily connected with it.)  It is this striving to both be able to encode all sorts of various legacy forms of apparatus as well as simultaneously catering for those who are recording the structure by which they will generate an apparatus in producing some output.  So JC would argue that the first of these are apparatus and the second of these is a site/locus of textual variance.&lt;br /&gt;
&lt;br /&gt;
== Issues == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Specific phenomena ===&lt;br /&gt;
&lt;br /&gt;
==== Transpositions ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fuszgtpnn2ywf6bh&lt;br /&gt;
&lt;br /&gt;
==== Handling of punctuation ====&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
==== Representing omissions in an apparatus ====&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
* http://tei.markmail.org/thread/4sheu6nji3dvnf64&lt;br /&gt;
&lt;br /&gt;
=== Markup-related ===&lt;br /&gt;
&lt;br /&gt;
==== Inclusion of structural markup in the apparatus ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
* http://tei.markmail.org/thread/x5agpwzn4hiwwwcx&lt;br /&gt;
&lt;br /&gt;
==== Encoding variants in structural markup ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ap62n37uf6rbfds4&lt;br /&gt;
* http://tei.markmail.org/thread/hbmnsn3v4aqjabt3&lt;br /&gt;
&lt;br /&gt;
==== Conflicts between individual readings and the semantics of structural markup that surrounds it ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Workflow-related ===&lt;br /&gt;
&lt;br /&gt;
==== Scalability ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
==== Refactoring ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn.&lt;br /&gt;
&lt;br /&gt;
==== Complexity ====&lt;br /&gt;
&lt;br /&gt;
Manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
==== Feasibility of double-endpoint-attached method ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fsj7gvojds4mwcm5&lt;br /&gt;
* http://tei.markmail.org/thread/flwcnf4fxm4u7ebj&lt;br /&gt;
&lt;br /&gt;
=== Model vs. Representation ===&lt;br /&gt;
&lt;br /&gt;
==== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Representing &amp;quot;verbose&amp;quot; apparatus ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
==== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
==== Collations of differing granularity ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/bonflsyb2d3ebtp2&lt;br /&gt;
* http://tei.markmail.org/thread/gqyymzd4a4xvhch7&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;br /&gt;
&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
[[Category:Critical Apparatus]]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9176</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9176"/>
		<updated>2011-04-08T17:19:09Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Inclusion of structural markup in the apparatus */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup:&lt;br /&gt;
&lt;br /&gt;
* Marjorie Burghart (MB)&lt;br /&gt;
* James Cummings (JC)&lt;br /&gt;
* Fotis Jannidis (FJ)&lt;br /&gt;
* Gregor Middell (GM)&lt;br /&gt;
* Dan O'Donnell (DOD)&lt;br /&gt;
* Espen Ore (EO)&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Roberto Rosselli del Turco (RDT)&lt;br /&gt;
* Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== “Critical Apparatus” vs. “Textual Variance” vs. “Textual Variants”== &lt;br /&gt;
&lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the '''critical apparatus''' is just inherited from the printed world and one of the possible physical embodiment of '''textual variance'''. EP therefore proposes to use this new name, moving from &amp;quot;critical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use '''textual variants''' instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart of the current critical apparatus recommendations.  That of encoding an apparatus at the site of textual variance and encoding a structured view of a note entirely separate from the edited version of texts. (In mass digitization of critical editions, for example, one might have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are not encoded at the site of variance, or indeed necessarily connected with it.)  It is this striving to both be able to encode all sorts of various legacy forms of apparatus as well as simultaneously catering for those who are recording the structure by which they will generate an apparatus in producing some output.  So JC would argue that the first of these are apparatus and the second of these is a site/locus of textual variance.&lt;br /&gt;
&lt;br /&gt;
== Issues == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Specific phenomena ===&lt;br /&gt;
&lt;br /&gt;
==== Transpositions ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fuszgtpnn2ywf6bh&lt;br /&gt;
&lt;br /&gt;
==== Handling of punctuation ====&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
==== Representing omissions in an apparatus ====&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
* http://tei.markmail.org/thread/4sheu6nji3dvnf64&lt;br /&gt;
&lt;br /&gt;
=== Markup-related ===&lt;br /&gt;
&lt;br /&gt;
==== Inclusion of structural markup in the apparatus ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
* http://tei.markmail.org/thread/x5agpwzn4hiwwwcx&lt;br /&gt;
&lt;br /&gt;
==== Encoding variants in structural markup ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ap62n37uf6rbfds4&lt;br /&gt;
* http://tei.markmail.org/thread/hbmnsn3v4aqjabt3&lt;br /&gt;
&lt;br /&gt;
==== Conflicts between individual readings and the semantics of structural markup that surrounds it ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Workflow-related ===&lt;br /&gt;
&lt;br /&gt;
==== Scalability ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
==== Refactoring ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn.&lt;br /&gt;
&lt;br /&gt;
==== Complexity ====&lt;br /&gt;
&lt;br /&gt;
Manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
==== Feasibility of double-endpoint-attached method ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fsj7gvojds4mwcm5&lt;br /&gt;
* http://tei.markmail.org/thread/flwcnf4fxm4u7ebj&lt;br /&gt;
&lt;br /&gt;
=== Model vs. Representation ===&lt;br /&gt;
&lt;br /&gt;
==== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Representing &amp;quot;verbose&amp;quot; apparatus ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
==== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
==== Collations of differing granularity ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/bonflsyb2d3ebtp2&lt;br /&gt;
* http://tei.markmail.org/thread/gqyymzd4a4xvhch7&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9175</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9175"/>
		<updated>2011-04-08T17:18:48Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Encoding variants in structural markup */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup:&lt;br /&gt;
&lt;br /&gt;
* Marjorie Burghart (MB)&lt;br /&gt;
* James Cummings (JC)&lt;br /&gt;
* Fotis Jannidis (FJ)&lt;br /&gt;
* Gregor Middell (GM)&lt;br /&gt;
* Dan O'Donnell (DOD)&lt;br /&gt;
* Espen Ore (EO)&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Roberto Rosselli del Turco (RDT)&lt;br /&gt;
* Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== “Critical Apparatus” vs. “Textual Variance” vs. “Textual Variants”== &lt;br /&gt;
&lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the '''critical apparatus''' is just inherited from the printed world and one of the possible physical embodiment of '''textual variance'''. EP therefore proposes to use this new name, moving from &amp;quot;critical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use '''textual variants''' instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart of the current critical apparatus recommendations.  That of encoding an apparatus at the site of textual variance and encoding a structured view of a note entirely separate from the edited version of texts. (In mass digitization of critical editions, for example, one might have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are not encoded at the site of variance, or indeed necessarily connected with it.)  It is this striving to both be able to encode all sorts of various legacy forms of apparatus as well as simultaneously catering for those who are recording the structure by which they will generate an apparatus in producing some output.  So JC would argue that the first of these are apparatus and the second of these is a site/locus of textual variance.&lt;br /&gt;
&lt;br /&gt;
== Issues == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Specific phenomena ===&lt;br /&gt;
&lt;br /&gt;
==== Transpositions ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fuszgtpnn2ywf6bh&lt;br /&gt;
&lt;br /&gt;
==== Handling of punctuation ====&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
==== Representing omissions in an apparatus ====&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
* http://tei.markmail.org/thread/4sheu6nji3dvnf64&lt;br /&gt;
&lt;br /&gt;
=== Markup-related ===&lt;br /&gt;
&lt;br /&gt;
==== Inclusion of structural markup in the apparatus ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
==== Encoding variants in structural markup ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ap62n37uf6rbfds4&lt;br /&gt;
* http://tei.markmail.org/thread/hbmnsn3v4aqjabt3&lt;br /&gt;
&lt;br /&gt;
==== Conflicts between individual readings and the semantics of structural markup that surrounds it ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Workflow-related ===&lt;br /&gt;
&lt;br /&gt;
==== Scalability ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
==== Refactoring ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn.&lt;br /&gt;
&lt;br /&gt;
==== Complexity ====&lt;br /&gt;
&lt;br /&gt;
Manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
==== Feasibility of double-endpoint-attached method ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fsj7gvojds4mwcm5&lt;br /&gt;
* http://tei.markmail.org/thread/flwcnf4fxm4u7ebj&lt;br /&gt;
&lt;br /&gt;
=== Model vs. Representation ===&lt;br /&gt;
&lt;br /&gt;
==== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Representing &amp;quot;verbose&amp;quot; apparatus ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
==== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
==== Collations of differing granularity ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/bonflsyb2d3ebtp2&lt;br /&gt;
* http://tei.markmail.org/thread/gqyymzd4a4xvhch7&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9174</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9174"/>
		<updated>2011-04-08T17:17:19Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup:&lt;br /&gt;
&lt;br /&gt;
* Marjorie Burghart (MB)&lt;br /&gt;
* James Cummings (JC)&lt;br /&gt;
* Fotis Jannidis (FJ)&lt;br /&gt;
* Gregor Middell (GM)&lt;br /&gt;
* Dan O'Donnell (DOD)&lt;br /&gt;
* Espen Ore (EO)&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Roberto Rosselli del Turco (RDT)&lt;br /&gt;
* Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== “Critical Apparatus” vs. “Textual Variance” vs. “Textual Variants”== &lt;br /&gt;
&lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the '''critical apparatus''' is just inherited from the printed world and one of the possible physical embodiment of '''textual variance'''. EP therefore proposes to use this new name, moving from &amp;quot;critical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use '''textual variants''' instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart of the current critical apparatus recommendations.  That of encoding an apparatus at the site of textual variance and encoding a structured view of a note entirely separate from the edited version of texts. (In mass digitization of critical editions, for example, one might have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are not encoded at the site of variance, or indeed necessarily connected with it.)  It is this striving to both be able to encode all sorts of various legacy forms of apparatus as well as simultaneously catering for those who are recording the structure by which they will generate an apparatus in producing some output.  So JC would argue that the first of these are apparatus and the second of these is a site/locus of textual variance.&lt;br /&gt;
&lt;br /&gt;
== Issues == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Specific phenomena ===&lt;br /&gt;
&lt;br /&gt;
==== Transpositions ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fuszgtpnn2ywf6bh&lt;br /&gt;
&lt;br /&gt;
==== Handling of punctuation ====&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
==== Representing omissions in an apparatus ====&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
* http://tei.markmail.org/thread/4sheu6nji3dvnf64&lt;br /&gt;
&lt;br /&gt;
=== Markup-related ===&lt;br /&gt;
&lt;br /&gt;
==== Inclusion of structural markup in the apparatus ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
==== Encoding variants in structural markup ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ap62n37uf6rbfds4&lt;br /&gt;
* http://tei.markmail.org/thread/x5agpwzn4hiwwwcx&lt;br /&gt;
* http://tei.markmail.org/thread/hbmnsn3v4aqjabt3&lt;br /&gt;
&lt;br /&gt;
==== Conflicts between individual readings and the semantics of structural markup that surrounds it ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Workflow-related ===&lt;br /&gt;
&lt;br /&gt;
==== Scalability ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
==== Refactoring ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn.&lt;br /&gt;
&lt;br /&gt;
==== Complexity ====&lt;br /&gt;
&lt;br /&gt;
Manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
==== Feasibility of double-endpoint-attached method ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fsj7gvojds4mwcm5&lt;br /&gt;
* http://tei.markmail.org/thread/flwcnf4fxm4u7ebj&lt;br /&gt;
&lt;br /&gt;
=== Model vs. Representation ===&lt;br /&gt;
&lt;br /&gt;
==== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Representing &amp;quot;verbose&amp;quot; apparatus ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
==== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
==== Collations of differing granularity ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/bonflsyb2d3ebtp2&lt;br /&gt;
* http://tei.markmail.org/thread/gqyymzd4a4xvhch7&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9173</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9173"/>
		<updated>2011-04-08T17:16:39Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* “Critical Apparatus” vs. “Textual Variance” */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup:&lt;br /&gt;
&lt;br /&gt;
* Marjorie Burghart (MB)&lt;br /&gt;
* James Cummings (JC)&lt;br /&gt;
* Fotis Jannidis (FJ)&lt;br /&gt;
* Gregor Middell (GM)&lt;br /&gt;
* Dan O'Donnell (DOD)&lt;br /&gt;
* Espen Ore (EO)&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Roberto Rosselli del Turco (RDT)&lt;br /&gt;
* Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== “Critical Apparatus” vs. “Textual Variance” vs. “Textual Variants”== &lt;br /&gt;
&lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the '''critical apparatus''' is just inherited from the printed world and one of the possible physical embodiment of '''textual variance'''. EP therefore proposes to use this new name, moving from &amp;quot;critical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use '''textual variants''' instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart of the current critical apparatus recommendations.  That of encoding an apparatus at the site of textual variance and encoding a structured view of a note entirely separate from the edited version of texts. (In mass digitization of critical editions, for example, one might have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are not encoded at the site of variance, or indeed necessarily connected with it.)  It is this striving to both be able to encode all sorts of various legacy forms of apparatus as well as simultaneously catering for those who are recording the structure by which they will generate an apparatus in producing some output.  So JC would argue that the first of these are apparatus and the second of these is a site/locus of textual variance.&lt;br /&gt;
&lt;br /&gt;
== Issues == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Specific phenomena ===&lt;br /&gt;
&lt;br /&gt;
==== Transpositions ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fuszgtpnn2ywf6bh&lt;br /&gt;
&lt;br /&gt;
==== Handling of punctuation ====&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
==== Representing omissions in an apparatus ====&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
* http://tei.markmail.org/thread/4sheu6nji3dvnf64&lt;br /&gt;
&lt;br /&gt;
=== Markup-related ===&lt;br /&gt;
&lt;br /&gt;
==== Inclusion of structural markup in the apparatus ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
==== Encoding variants in structural markup ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ap62n37uf6rbfds4&lt;br /&gt;
* http://tei.markmail.org/thread/x5agpwzn4hiwwwcx&lt;br /&gt;
* http://tei.markmail.org/thread/hbmnsn3v4aqjabt3&lt;br /&gt;
&lt;br /&gt;
==== Conflicts between individual readings and the semantics of structural markup that surrounds it ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Workflow-related ===&lt;br /&gt;
&lt;br /&gt;
==== Scalability ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
==== Refactoring ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn.&lt;br /&gt;
&lt;br /&gt;
==== Complexity ====&lt;br /&gt;
&lt;br /&gt;
Manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
==== Feasibility of double-endpoint-attached method ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fsj7gvojds4mwcm5&lt;br /&gt;
* http://tei.markmail.org/thread/flwcnf4fxm4u7ebj&lt;br /&gt;
&lt;br /&gt;
=== Model vs. Representation ===&lt;br /&gt;
&lt;br /&gt;
==== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Representing &amp;quot;verbose&amp;quot; apparatus ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
==== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
==== Collations of differing granularity ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/bonflsyb2d3ebtp2&lt;br /&gt;
* http://tei.markmail.org/thread/gqyymzd4a4xvhch7&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9172</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9172"/>
		<updated>2011-04-08T17:12:49Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup:&lt;br /&gt;
&lt;br /&gt;
* Marjorie Burghart (MB)&lt;br /&gt;
* James Cummings (JC)&lt;br /&gt;
* Fotis Jannidis (FJ)&lt;br /&gt;
* Gregor Middell (GM)&lt;br /&gt;
* Dan O'Donnell (DOD)&lt;br /&gt;
* Espen Ore (EO)&lt;br /&gt;
* Elena Pierazzo (EP)&lt;br /&gt;
* Roberto Rosselli del Turco (RDT)&lt;br /&gt;
* Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== “Critical Apparatus” vs. “Textual Variance” == &lt;br /&gt;
&lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the '''critical apparatus''' is just inherited from the printed world and one of the possible physical embodiment of '''textual variance'''. EP therefore proposes to use this new name, moving from &amp;quot;critical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use '''textual variants''' instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart of the current critical apparatus recommendations.  That of encoding an apparatus at the site of textual variance and encoding a structured view of a note entirely separate from the edited version of texts. (In mass digitization of critical editions, for example, one might have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are not encoded at the site of variance, or indeed necessarily connected with it.)  It is this striving to both be able to encode all sorts of various legacy forms of apparatus as well as simultaneously catering for those who are recording the structure by which they will generate an apparatus in producing some output.  So JC would argue that the first of these are apparatus and the second of these is a site/locus of textual variance.&lt;br /&gt;
&lt;br /&gt;
== Issues == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Specific phenomena ===&lt;br /&gt;
&lt;br /&gt;
==== Transpositions ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fuszgtpnn2ywf6bh&lt;br /&gt;
&lt;br /&gt;
==== Handling of punctuation ====&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
==== Representing omissions in an apparatus ====&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
* http://tei.markmail.org/thread/4sheu6nji3dvnf64&lt;br /&gt;
&lt;br /&gt;
=== Markup-related ===&lt;br /&gt;
&lt;br /&gt;
==== Inclusion of structural markup in the apparatus ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
==== Encoding variants in structural markup ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ap62n37uf6rbfds4&lt;br /&gt;
* http://tei.markmail.org/thread/x5agpwzn4hiwwwcx&lt;br /&gt;
* http://tei.markmail.org/thread/hbmnsn3v4aqjabt3&lt;br /&gt;
&lt;br /&gt;
==== Conflicts between individual readings and the semantics of structural markup that surrounds it ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Workflow-related ===&lt;br /&gt;
&lt;br /&gt;
==== Scalability ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
==== Refactoring ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn.&lt;br /&gt;
&lt;br /&gt;
==== Complexity ====&lt;br /&gt;
&lt;br /&gt;
Manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
==== Feasibility of double-endpoint-attached method ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fsj7gvojds4mwcm5&lt;br /&gt;
* http://tei.markmail.org/thread/flwcnf4fxm4u7ebj&lt;br /&gt;
&lt;br /&gt;
=== Model vs. Representation ===&lt;br /&gt;
&lt;br /&gt;
==== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Representing &amp;quot;verbose&amp;quot; apparatus ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
==== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ====&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
==== Collations of differing granularity ====&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/bonflsyb2d3ebtp2&lt;br /&gt;
* http://tei.markmail.org/thread/gqyymzd4a4xvhch7&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9171</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9171"/>
		<updated>2011-04-08T16:46:26Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Encoding variants in structure */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup: Marjorie Burghart (MB), James Cummings (JC), Fotis Jannidis (FJ), Gregor Middell (GM), Dan O'Donnell (DOD), Espen Ore (EO), Elena Pierazzo (EP), Roberto Rosselli del Turco (RDT), Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
== A preliminary vocabulary question == &lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the critical apparatus is just inherited from the printed world and one of the possible physical embodiment of TEXTUAL VARIANCE. EP therefore proposes to use this new name, moving from &amp;quot;citical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use &amp;quot;TEXTUAL VARIANTS&amp;quot; instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart&lt;br /&gt;
of the current critical apparatus recommendations.  That of encoding&lt;br /&gt;
an apparatus at the site of textual variance and encoding a structured&lt;br /&gt;
view of a note entirely separate from the edited version of texts.&lt;br /&gt;
(In mass digitization of critical editions, for example, one might&lt;br /&gt;
have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are&lt;br /&gt;
not encoded at the site of variance, or indeed necessarily connected&lt;br /&gt;
with it.)  It is this striving to both be able to encode all sorts of&lt;br /&gt;
various legacy forms of apparatus as well as simultaneously catering&lt;br /&gt;
for those who are recording the structure by which they will generate&lt;br /&gt;
an apparatus in producing some output.  So JC would argue that the first of&lt;br /&gt;
these are apparatus and the second of these is a site/locus of textual&lt;br /&gt;
variance.&lt;br /&gt;
&lt;br /&gt;
== Issues with the current Critical Apparatus chapter/module == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Inclusion of structural markup in the apparatus ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
=== Transpositions ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fuszgtpnn2ywf6bh&lt;br /&gt;
&lt;br /&gt;
=== Scalability ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
Also manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
=== Refactoring ===&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn. &lt;br /&gt;
&lt;br /&gt;
=== conflicts between individual readings and the semantics of structural markup that surrounds it ===&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representing &amp;quot;verbose&amp;quot; apparatus ===&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
=== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
=== Handling of punctuation ===&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
=== Representing omissions in an apparatus ===&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
* http://tei.markmail.org/thread/4sheu6nji3dvnf64&lt;br /&gt;
&lt;br /&gt;
=== Feasibility of double-endpoint-attached method ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fsj7gvojds4mwcm5&lt;br /&gt;
* http://tei.markmail.org/thread/flwcnf4fxm4u7ebj&lt;br /&gt;
&lt;br /&gt;
=== Collations of differing granularity ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/bonflsyb2d3ebtp2&lt;br /&gt;
* http://tei.markmail.org/thread/gqyymzd4a4xvhch7&lt;br /&gt;
&lt;br /&gt;
=== Encoding variants in structure ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ap62n37uf6rbfds4&lt;br /&gt;
* http://tei.markmail.org/thread/x5agpwzn4hiwwwcx&lt;br /&gt;
* http://tei.markmail.org/thread/hbmnsn3v4aqjabt3&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9170</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9170"/>
		<updated>2011-04-08T16:44:25Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Encoding variants in structure */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup: Marjorie Burghart (MB), James Cummings (JC), Fotis Jannidis (FJ), Gregor Middell (GM), Dan O'Donnell (DOD), Espen Ore (EO), Elena Pierazzo (EP), Roberto Rosselli del Turco (RDT), Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
== A preliminary vocabulary question == &lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the critical apparatus is just inherited from the printed world and one of the possible physical embodiment of TEXTUAL VARIANCE. EP therefore proposes to use this new name, moving from &amp;quot;citical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use &amp;quot;TEXTUAL VARIANTS&amp;quot; instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart&lt;br /&gt;
of the current critical apparatus recommendations.  That of encoding&lt;br /&gt;
an apparatus at the site of textual variance and encoding a structured&lt;br /&gt;
view of a note entirely separate from the edited version of texts.&lt;br /&gt;
(In mass digitization of critical editions, for example, one might&lt;br /&gt;
have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are&lt;br /&gt;
not encoded at the site of variance, or indeed necessarily connected&lt;br /&gt;
with it.)  It is this striving to both be able to encode all sorts of&lt;br /&gt;
various legacy forms of apparatus as well as simultaneously catering&lt;br /&gt;
for those who are recording the structure by which they will generate&lt;br /&gt;
an apparatus in producing some output.  So JC would argue that the first of&lt;br /&gt;
these are apparatus and the second of these is a site/locus of textual&lt;br /&gt;
variance.&lt;br /&gt;
&lt;br /&gt;
== Issues with the current Critical Apparatus chapter/module == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Inclusion of structural markup in the apparatus ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
=== Transpositions ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fuszgtpnn2ywf6bh&lt;br /&gt;
&lt;br /&gt;
=== Scalability ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
Also manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
=== Refactoring ===&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn. &lt;br /&gt;
&lt;br /&gt;
=== conflicts between individual readings and the semantics of structural markup that surrounds it ===&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representing &amp;quot;verbose&amp;quot; apparatus ===&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
=== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
=== Handling of punctuation ===&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
=== Representing omissions in an apparatus ===&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
* http://tei.markmail.org/thread/4sheu6nji3dvnf64&lt;br /&gt;
&lt;br /&gt;
=== Feasibility of double-endpoint-attached method ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fsj7gvojds4mwcm5&lt;br /&gt;
* http://tei.markmail.org/thread/flwcnf4fxm4u7ebj&lt;br /&gt;
&lt;br /&gt;
=== Collations of differing granularity ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/bonflsyb2d3ebtp2&lt;br /&gt;
* http://tei.markmail.org/thread/gqyymzd4a4xvhch7&lt;br /&gt;
&lt;br /&gt;
=== Encoding variants in structure ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ap62n37uf6rbfds4&lt;br /&gt;
* http://tei.markmail.org/thread/x5agpwzn4hiwwwcx&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9169</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9169"/>
		<updated>2011-04-08T16:36:27Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Collations of differing granularity */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup: Marjorie Burghart (MB), James Cummings (JC), Fotis Jannidis (FJ), Gregor Middell (GM), Dan O'Donnell (DOD), Espen Ore (EO), Elena Pierazzo (EP), Roberto Rosselli del Turco (RDT), Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
== A preliminary vocabulary question == &lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the critical apparatus is just inherited from the printed world and one of the possible physical embodiment of TEXTUAL VARIANCE. EP therefore proposes to use this new name, moving from &amp;quot;citical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use &amp;quot;TEXTUAL VARIANTS&amp;quot; instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart&lt;br /&gt;
of the current critical apparatus recommendations.  That of encoding&lt;br /&gt;
an apparatus at the site of textual variance and encoding a structured&lt;br /&gt;
view of a note entirely separate from the edited version of texts.&lt;br /&gt;
(In mass digitization of critical editions, for example, one might&lt;br /&gt;
have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are&lt;br /&gt;
not encoded at the site of variance, or indeed necessarily connected&lt;br /&gt;
with it.)  It is this striving to both be able to encode all sorts of&lt;br /&gt;
various legacy forms of apparatus as well as simultaneously catering&lt;br /&gt;
for those who are recording the structure by which they will generate&lt;br /&gt;
an apparatus in producing some output.  So JC would argue that the first of&lt;br /&gt;
these are apparatus and the second of these is a site/locus of textual&lt;br /&gt;
variance.&lt;br /&gt;
&lt;br /&gt;
== Issues with the current Critical Apparatus chapter/module == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Inclusion of structural markup in the apparatus ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
=== Transpositions ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fuszgtpnn2ywf6bh&lt;br /&gt;
&lt;br /&gt;
=== Scalability ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
Also manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
=== Refactoring ===&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn. &lt;br /&gt;
&lt;br /&gt;
=== conflicts between individual readings and the semantics of structural markup that surrounds it ===&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representing &amp;quot;verbose&amp;quot; apparatus ===&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
=== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
=== Handling of punctuation ===&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
=== Representing omissions in an apparatus ===&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
* http://tei.markmail.org/thread/4sheu6nji3dvnf64&lt;br /&gt;
&lt;br /&gt;
=== Feasibility of double-endpoint-attached method ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fsj7gvojds4mwcm5&lt;br /&gt;
* http://tei.markmail.org/thread/flwcnf4fxm4u7ebj&lt;br /&gt;
&lt;br /&gt;
=== Collations of differing granularity ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/bonflsyb2d3ebtp2&lt;br /&gt;
* http://tei.markmail.org/thread/gqyymzd4a4xvhch7&lt;br /&gt;
&lt;br /&gt;
=== Encoding variants in structure ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ap62n37uf6rbfds4&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9168</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9168"/>
		<updated>2011-04-08T16:33:36Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Transpositions */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup: Marjorie Burghart (MB), James Cummings (JC), Fotis Jannidis (FJ), Gregor Middell (GM), Dan O'Donnell (DOD), Espen Ore (EO), Elena Pierazzo (EP), Roberto Rosselli del Turco (RDT), Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
== A preliminary vocabulary question == &lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the critical apparatus is just inherited from the printed world and one of the possible physical embodiment of TEXTUAL VARIANCE. EP therefore proposes to use this new name, moving from &amp;quot;citical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use &amp;quot;TEXTUAL VARIANTS&amp;quot; instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart&lt;br /&gt;
of the current critical apparatus recommendations.  That of encoding&lt;br /&gt;
an apparatus at the site of textual variance and encoding a structured&lt;br /&gt;
view of a note entirely separate from the edited version of texts.&lt;br /&gt;
(In mass digitization of critical editions, for example, one might&lt;br /&gt;
have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are&lt;br /&gt;
not encoded at the site of variance, or indeed necessarily connected&lt;br /&gt;
with it.)  It is this striving to both be able to encode all sorts of&lt;br /&gt;
various legacy forms of apparatus as well as simultaneously catering&lt;br /&gt;
for those who are recording the structure by which they will generate&lt;br /&gt;
an apparatus in producing some output.  So JC would argue that the first of&lt;br /&gt;
these are apparatus and the second of these is a site/locus of textual&lt;br /&gt;
variance.&lt;br /&gt;
&lt;br /&gt;
== Issues with the current Critical Apparatus chapter/module == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Inclusion of structural markup in the apparatus ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
=== Transpositions ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fuszgtpnn2ywf6bh&lt;br /&gt;
&lt;br /&gt;
=== Scalability ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
Also manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
=== Refactoring ===&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn. &lt;br /&gt;
&lt;br /&gt;
=== conflicts between individual readings and the semantics of structural markup that surrounds it ===&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representing &amp;quot;verbose&amp;quot; apparatus ===&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
=== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
=== Handling of punctuation ===&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
=== Representing omissions in an apparatus ===&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
* http://tei.markmail.org/thread/4sheu6nji3dvnf64&lt;br /&gt;
&lt;br /&gt;
=== Feasibility of double-endpoint-attached method ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fsj7gvojds4mwcm5&lt;br /&gt;
* http://tei.markmail.org/thread/flwcnf4fxm4u7ebj&lt;br /&gt;
&lt;br /&gt;
=== Collations of differing granularity ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/bonflsyb2d3ebtp2&lt;br /&gt;
&lt;br /&gt;
=== Encoding variants in structure ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ap62n37uf6rbfds4&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9167</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9167"/>
		<updated>2011-04-08T16:30:39Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Issues with the current Critical Apparatus chapter/module */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup: Marjorie Burghart (MB), James Cummings (JC), Fotis Jannidis (FJ), Gregor Middell (GM), Dan O'Donnell (DOD), Espen Ore (EO), Elena Pierazzo (EP), Roberto Rosselli del Turco (RDT), Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
== A preliminary vocabulary question == &lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the critical apparatus is just inherited from the printed world and one of the possible physical embodiment of TEXTUAL VARIANCE. EP therefore proposes to use this new name, moving from &amp;quot;citical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use &amp;quot;TEXTUAL VARIANTS&amp;quot; instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart&lt;br /&gt;
of the current critical apparatus recommendations.  That of encoding&lt;br /&gt;
an apparatus at the site of textual variance and encoding a structured&lt;br /&gt;
view of a note entirely separate from the edited version of texts.&lt;br /&gt;
(In mass digitization of critical editions, for example, one might&lt;br /&gt;
have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are&lt;br /&gt;
not encoded at the site of variance, or indeed necessarily connected&lt;br /&gt;
with it.)  It is this striving to both be able to encode all sorts of&lt;br /&gt;
various legacy forms of apparatus as well as simultaneously catering&lt;br /&gt;
for those who are recording the structure by which they will generate&lt;br /&gt;
an apparatus in producing some output.  So JC would argue that the first of&lt;br /&gt;
these are apparatus and the second of these is a site/locus of textual&lt;br /&gt;
variance.&lt;br /&gt;
&lt;br /&gt;
== Issues with the current Critical Apparatus chapter/module == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Inclusion of structural markup in the apparatus ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
=== Transpositions ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
=== Scalability ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
Also manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
=== Refactoring ===&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn. &lt;br /&gt;
&lt;br /&gt;
=== conflicts between individual readings and the semantics of structural markup that surrounds it ===&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representing &amp;quot;verbose&amp;quot; apparatus ===&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
=== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
=== Handling of punctuation ===&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
=== Representing omissions in an apparatus ===&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
* http://tei.markmail.org/thread/4sheu6nji3dvnf64&lt;br /&gt;
&lt;br /&gt;
=== Feasibility of double-endpoint-attached method ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fsj7gvojds4mwcm5&lt;br /&gt;
* http://tei.markmail.org/thread/flwcnf4fxm4u7ebj&lt;br /&gt;
&lt;br /&gt;
=== Collations of differing granularity ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/bonflsyb2d3ebtp2&lt;br /&gt;
&lt;br /&gt;
=== Encoding variants in structure ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ap62n37uf6rbfds4&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9166</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9166"/>
		<updated>2011-04-08T16:27:44Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Issues with the current Critical Apparatus chapter/module */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup: Marjorie Burghart (MB), James Cummings (JC), Fotis Jannidis (FJ), Gregor Middell (GM), Dan O'Donnell (DOD), Espen Ore (EO), Elena Pierazzo (EP), Roberto Rosselli del Turco (RDT), Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
== A preliminary vocabulary question == &lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the critical apparatus is just inherited from the printed world and one of the possible physical embodiment of TEXTUAL VARIANCE. EP therefore proposes to use this new name, moving from &amp;quot;citical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use &amp;quot;TEXTUAL VARIANTS&amp;quot; instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart&lt;br /&gt;
of the current critical apparatus recommendations.  That of encoding&lt;br /&gt;
an apparatus at the site of textual variance and encoding a structured&lt;br /&gt;
view of a note entirely separate from the edited version of texts.&lt;br /&gt;
(In mass digitization of critical editions, for example, one might&lt;br /&gt;
have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are&lt;br /&gt;
not encoded at the site of variance, or indeed necessarily connected&lt;br /&gt;
with it.)  It is this striving to both be able to encode all sorts of&lt;br /&gt;
various legacy forms of apparatus as well as simultaneously catering&lt;br /&gt;
for those who are recording the structure by which they will generate&lt;br /&gt;
an apparatus in producing some output.  So JC would argue that the first of&lt;br /&gt;
these are apparatus and the second of these is a site/locus of textual&lt;br /&gt;
variance.&lt;br /&gt;
&lt;br /&gt;
== Issues with the current Critical Apparatus chapter/module == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Inclusion of structural markup in the apparatus ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
=== Transpositions ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
=== Scalability ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
Also manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
=== Refactoring ===&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn. &lt;br /&gt;
&lt;br /&gt;
=== conflicts between individual readings and the semantics of structural markup that surrounds it ===&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representing &amp;quot;verbose&amp;quot; apparatus ===&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
=== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
=== Handling of punctuation ===&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
=== Representing omissions in an apparatus ===&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
* http://tei.markmail.org/thread/4sheu6nji3dvnf64&lt;br /&gt;
&lt;br /&gt;
=== Feasibility of double-endpoint-attached method ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fsj7gvojds4mwcm5&lt;br /&gt;
* http://tei.markmail.org/thread/flwcnf4fxm4u7ebj&lt;br /&gt;
&lt;br /&gt;
=== Collations of differing granularity ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/bonflsyb2d3ebtp2&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9165</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9165"/>
		<updated>2011-04-08T16:23:12Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Representing omissions in an apparatus */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup: Marjorie Burghart (MB), James Cummings (JC), Fotis Jannidis (FJ), Gregor Middell (GM), Dan O'Donnell (DOD), Espen Ore (EO), Elena Pierazzo (EP), Roberto Rosselli del Turco (RDT), Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
== A preliminary vocabulary question == &lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the critical apparatus is just inherited from the printed world and one of the possible physical embodiment of TEXTUAL VARIANCE. EP therefore proposes to use this new name, moving from &amp;quot;citical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use &amp;quot;TEXTUAL VARIANTS&amp;quot; instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart&lt;br /&gt;
of the current critical apparatus recommendations.  That of encoding&lt;br /&gt;
an apparatus at the site of textual variance and encoding a structured&lt;br /&gt;
view of a note entirely separate from the edited version of texts.&lt;br /&gt;
(In mass digitization of critical editions, for example, one might&lt;br /&gt;
have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are&lt;br /&gt;
not encoded at the site of variance, or indeed necessarily connected&lt;br /&gt;
with it.)  It is this striving to both be able to encode all sorts of&lt;br /&gt;
various legacy forms of apparatus as well as simultaneously catering&lt;br /&gt;
for those who are recording the structure by which they will generate&lt;br /&gt;
an apparatus in producing some output.  So JC would argue that the first of&lt;br /&gt;
these are apparatus and the second of these is a site/locus of textual&lt;br /&gt;
variance.&lt;br /&gt;
&lt;br /&gt;
== Issues with the current Critical Apparatus chapter/module == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Inclusion of structural markup in the apparatus ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
=== Transpositions ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
=== Scalability ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
Also manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
=== Refactoring ===&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn. &lt;br /&gt;
&lt;br /&gt;
=== conflicts between individual readings and the semantics of structural markup that surrounds it ===&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representing &amp;quot;verbose&amp;quot; apparatus ===&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
=== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
=== Handling of punctuation ===&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
=== Representing omissions in an apparatus ===&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
* http://tei.markmail.org/thread/4sheu6nji3dvnf64&lt;br /&gt;
&lt;br /&gt;
=== Feasibility of double-endpoint-attached method ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fsj7gvojds4mwcm5&lt;br /&gt;
* http://tei.markmail.org/thread/flwcnf4fxm4u7ebj&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9164</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9164"/>
		<updated>2011-04-08T16:20:49Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Issues with the current Critical Apparatus chapter/module */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup: Marjorie Burghart (MB), James Cummings (JC), Fotis Jannidis (FJ), Gregor Middell (GM), Dan O'Donnell (DOD), Espen Ore (EO), Elena Pierazzo (EP), Roberto Rosselli del Turco (RDT), Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
== A preliminary vocabulary question == &lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the critical apparatus is just inherited from the printed world and one of the possible physical embodiment of TEXTUAL VARIANCE. EP therefore proposes to use this new name, moving from &amp;quot;citical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use &amp;quot;TEXTUAL VARIANTS&amp;quot; instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart&lt;br /&gt;
of the current critical apparatus recommendations.  That of encoding&lt;br /&gt;
an apparatus at the site of textual variance and encoding a structured&lt;br /&gt;
view of a note entirely separate from the edited version of texts.&lt;br /&gt;
(In mass digitization of critical editions, for example, one might&lt;br /&gt;
have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are&lt;br /&gt;
not encoded at the site of variance, or indeed necessarily connected&lt;br /&gt;
with it.)  It is this striving to both be able to encode all sorts of&lt;br /&gt;
various legacy forms of apparatus as well as simultaneously catering&lt;br /&gt;
for those who are recording the structure by which they will generate&lt;br /&gt;
an apparatus in producing some output.  So JC would argue that the first of&lt;br /&gt;
these are apparatus and the second of these is a site/locus of textual&lt;br /&gt;
variance.&lt;br /&gt;
&lt;br /&gt;
== Issues with the current Critical Apparatus chapter/module == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Inclusion of structural markup in the apparatus ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
=== Transpositions ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
=== Scalability ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
Also manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
=== Refactoring ===&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn. &lt;br /&gt;
&lt;br /&gt;
=== conflicts between individual readings and the semantics of structural markup that surrounds it ===&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representing &amp;quot;verbose&amp;quot; apparatus ===&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
=== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
=== Handling of punctuation ===&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
=== Representing omissions in an apparatus ===&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
&lt;br /&gt;
=== Feasibility of double-endpoint-attached method ===&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/fsj7gvojds4mwcm5&lt;br /&gt;
* http://tei.markmail.org/thread/flwcnf4fxm4u7ebj&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9163</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9163"/>
		<updated>2011-04-08T16:14:37Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Representing &amp;quot;verbose&amp;quot; apparatus */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup: Marjorie Burghart (MB), James Cummings (JC), Fotis Jannidis (FJ), Gregor Middell (GM), Dan O'Donnell (DOD), Espen Ore (EO), Elena Pierazzo (EP), Roberto Rosselli del Turco (RDT), Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
== A preliminary vocabulary question == &lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the critical apparatus is just inherited from the printed world and one of the possible physical embodiment of TEXTUAL VARIANCE. EP therefore proposes to use this new name, moving from &amp;quot;citical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use &amp;quot;TEXTUAL VARIANTS&amp;quot; instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart&lt;br /&gt;
of the current critical apparatus recommendations.  That of encoding&lt;br /&gt;
an apparatus at the site of textual variance and encoding a structured&lt;br /&gt;
view of a note entirely separate from the edited version of texts.&lt;br /&gt;
(In mass digitization of critical editions, for example, one might&lt;br /&gt;
have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are&lt;br /&gt;
not encoded at the site of variance, or indeed necessarily connected&lt;br /&gt;
with it.)  It is this striving to both be able to encode all sorts of&lt;br /&gt;
various legacy forms of apparatus as well as simultaneously catering&lt;br /&gt;
for those who are recording the structure by which they will generate&lt;br /&gt;
an apparatus in producing some output.  So JC would argue that the first of&lt;br /&gt;
these are apparatus and the second of these is a site/locus of textual&lt;br /&gt;
variance.&lt;br /&gt;
&lt;br /&gt;
== Issues with the current Critical Apparatus chapter/module == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Inclusion of structural markup in the apparatus ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
=== Transpositions ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
=== Scalability ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
Also manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
=== Refactoring ===&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn. &lt;br /&gt;
&lt;br /&gt;
=== conflicts between individual readings and the semantics of structural markup that surrounds it ===&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representing &amp;quot;verbose&amp;quot; apparatus ===&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
* http://tei.markmail.org/thread/diubpw5adw6ntcas&lt;br /&gt;
&lt;br /&gt;
=== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
=== Handling of punctuation ===&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
=== Representing omissions in an apparatus ===&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9162</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9162"/>
		<updated>2011-04-08T16:07:24Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Issues with the current Critical Apparatus chapter/module */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup: Marjorie Burghart (MB), James Cummings (JC), Fotis Jannidis (FJ), Gregor Middell (GM), Dan O'Donnell (DOD), Espen Ore (EO), Elena Pierazzo (EP), Roberto Rosselli del Turco (RDT), Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
== A preliminary vocabulary question == &lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the critical apparatus is just inherited from the printed world and one of the possible physical embodiment of TEXTUAL VARIANCE. EP therefore proposes to use this new name, moving from &amp;quot;citical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use &amp;quot;TEXTUAL VARIANTS&amp;quot; instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart&lt;br /&gt;
of the current critical apparatus recommendations.  That of encoding&lt;br /&gt;
an apparatus at the site of textual variance and encoding a structured&lt;br /&gt;
view of a note entirely separate from the edited version of texts.&lt;br /&gt;
(In mass digitization of critical editions, for example, one might&lt;br /&gt;
have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are&lt;br /&gt;
not encoded at the site of variance, or indeed necessarily connected&lt;br /&gt;
with it.)  It is this striving to both be able to encode all sorts of&lt;br /&gt;
various legacy forms of apparatus as well as simultaneously catering&lt;br /&gt;
for those who are recording the structure by which they will generate&lt;br /&gt;
an apparatus in producing some output.  So JC would argue that the first of&lt;br /&gt;
these are apparatus and the second of these is a site/locus of textual&lt;br /&gt;
variance.&lt;br /&gt;
&lt;br /&gt;
== Issues with the current Critical Apparatus chapter/module == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Inclusion of structural markup in the apparatus ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
=== Transpositions ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
=== Scalability ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
Also manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
=== Refactoring ===&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn. &lt;br /&gt;
&lt;br /&gt;
=== conflicts between individual readings and the semantics of structural markup that surrounds it ===&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representing &amp;quot;verbose&amp;quot; apparatus ===&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
&lt;br /&gt;
=== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
=== Handling of punctuation ===&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
=== Representing omissions in an apparatus ===&lt;br /&gt;
&lt;br /&gt;
What's the proper way to represent missing lines/ paragraphs/ verses?&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/parztmwmlx5mqsof&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9161</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9161"/>
		<updated>2011-04-08T16:02:03Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Representing &amp;quot;verbose&amp;quot; apparatus */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup: Marjorie Burghart (MB), James Cummings (JC), Fotis Jannidis (FJ), Gregor Middell (GM), Dan O'Donnell (DOD), Espen Ore (EO), Elena Pierazzo (EP), Roberto Rosselli del Turco (RDT), Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
== A preliminary vocabulary question == &lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the critical apparatus is just inherited from the printed world and one of the possible physical embodiment of TEXTUAL VARIANCE. EP therefore proposes to use this new name, moving from &amp;quot;citical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use &amp;quot;TEXTUAL VARIANTS&amp;quot; instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart&lt;br /&gt;
of the current critical apparatus recommendations.  That of encoding&lt;br /&gt;
an apparatus at the site of textual variance and encoding a structured&lt;br /&gt;
view of a note entirely separate from the edited version of texts.&lt;br /&gt;
(In mass digitization of critical editions, for example, one might&lt;br /&gt;
have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are&lt;br /&gt;
not encoded at the site of variance, or indeed necessarily connected&lt;br /&gt;
with it.)  It is this striving to both be able to encode all sorts of&lt;br /&gt;
various legacy forms of apparatus as well as simultaneously catering&lt;br /&gt;
for those who are recording the structure by which they will generate&lt;br /&gt;
an apparatus in producing some output.  So JC would argue that the first of&lt;br /&gt;
these are apparatus and the second of these is a site/locus of textual&lt;br /&gt;
variance.&lt;br /&gt;
&lt;br /&gt;
== Issues with the current Critical Apparatus chapter/module == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Inclusion of structural markup in the apparatus ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
=== Transpositions ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
=== Scalability ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
Also manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
=== Refactoring ===&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn. &lt;br /&gt;
&lt;br /&gt;
=== conflicts between individual readings and the semantics of structural markup that surrounds it ===&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representing &amp;quot;verbose&amp;quot; apparatus ===&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
See also:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/ib3bsrpirepp4ibc&lt;br /&gt;
&lt;br /&gt;
=== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
=== Handling of punctuation ===&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9160</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9160"/>
		<updated>2011-04-08T15:49:18Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Representation of suggestions by the editor: ''lege'' ''dele'' etc. */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup: Marjorie Burghart (MB), James Cummings (JC), Fotis Jannidis (FJ), Gregor Middell (GM), Dan O'Donnell (DOD), Espen Ore (EO), Elena Pierazzo (EP), Roberto Rosselli del Turco (RDT), Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
== A preliminary vocabulary question == &lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the critical apparatus is just inherited from the printed world and one of the possible physical embodiment of TEXTUAL VARIANCE. EP therefore proposes to use this new name, moving from &amp;quot;citical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use &amp;quot;TEXTUAL VARIANTS&amp;quot; instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart&lt;br /&gt;
of the current critical apparatus recommendations.  That of encoding&lt;br /&gt;
an apparatus at the site of textual variance and encoding a structured&lt;br /&gt;
view of a note entirely separate from the edited version of texts.&lt;br /&gt;
(In mass digitization of critical editions, for example, one might&lt;br /&gt;
have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are&lt;br /&gt;
not encoded at the site of variance, or indeed necessarily connected&lt;br /&gt;
with it.)  It is this striving to both be able to encode all sorts of&lt;br /&gt;
various legacy forms of apparatus as well as simultaneously catering&lt;br /&gt;
for those who are recording the structure by which they will generate&lt;br /&gt;
an apparatus in producing some output.  So JC would argue that the first of&lt;br /&gt;
these are apparatus and the second of these is a site/locus of textual&lt;br /&gt;
variance.&lt;br /&gt;
&lt;br /&gt;
== Issues with the current Critical Apparatus chapter/module == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Inclusion of structural markup in the apparatus ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
=== Transpositions ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
=== Scalability ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
Also manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
=== Refactoring ===&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn. &lt;br /&gt;
&lt;br /&gt;
=== conflicts between individual readings and the semantics of structural markup that surrounds it ===&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representing &amp;quot;verbose&amp;quot; apparatus ===&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus.&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/vfw25psb5vgdiftw&lt;br /&gt;
&lt;br /&gt;
=== Handling of punctuation ===&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9159</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9159"/>
		<updated>2011-04-08T15:45:57Z</updated>

		<summary type="html">&lt;p&gt;Gremid: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup: Marjorie Burghart (MB), James Cummings (JC), Fotis Jannidis (FJ), Gregor Middell (GM), Dan O'Donnell (DOD), Espen Ore (EO), Elena Pierazzo (EP), Roberto Rosselli del Turco (RDT), Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
== A preliminary vocabulary question == &lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the critical apparatus is just inherited from the printed world and one of the possible physical embodiment of TEXTUAL VARIANCE. EP therefore proposes to use this new name, moving from &amp;quot;citical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use &amp;quot;TEXTUAL VARIANTS&amp;quot; instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart&lt;br /&gt;
of the current critical apparatus recommendations.  That of encoding&lt;br /&gt;
an apparatus at the site of textual variance and encoding a structured&lt;br /&gt;
view of a note entirely separate from the edited version of texts.&lt;br /&gt;
(In mass digitization of critical editions, for example, one might&lt;br /&gt;
have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are&lt;br /&gt;
not encoded at the site of variance, or indeed necessarily connected&lt;br /&gt;
with it.)  It is this striving to both be able to encode all sorts of&lt;br /&gt;
various legacy forms of apparatus as well as simultaneously catering&lt;br /&gt;
for those who are recording the structure by which they will generate&lt;br /&gt;
an apparatus in producing some output.  So JC would argue that the first of&lt;br /&gt;
these are apparatus and the second of these is a site/locus of textual&lt;br /&gt;
variance.&lt;br /&gt;
&lt;br /&gt;
== Issues with the current Critical Apparatus chapter/module == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Inclusion of structural markup in the apparatus ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
=== Transpositions ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
=== Scalability ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
Also manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
=== Refactoring ===&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn. &lt;br /&gt;
&lt;br /&gt;
=== conflicts between individual readings and the semantics of structural markup that surrounds it ===&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representing &amp;quot;verbose&amp;quot; apparatus ===&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus. &lt;br /&gt;
&lt;br /&gt;
=== Handling of punctuation ===&lt;br /&gt;
&lt;br /&gt;
Seems to be a common problem in textual criticism/ apparatus creation, but lacks guidelines/ encoding examples:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/es6byhxpsbgkrxzo&lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9158</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9158"/>
		<updated>2011-04-08T15:34:12Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Scalability */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup: Marjorie Burghart (MB), James Cummings (JC), Fotis Jannidis (FJ), Gregor Middell (GM), Dan O'Donnell (DOD), Espen Ore (EO), Elena Pierazzo (EP), Roberto Rosselli del Turco (RDT), Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
== A preliminary vocabulary question == &lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the critical apparatus is just inherited from the printed world and one of the possible physical embodiment of TEXTUAL VARIANCE. EP therefore proposes to use this new name, moving from &amp;quot;citical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use &amp;quot;TEXTUAL VARIANTS&amp;quot; instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart&lt;br /&gt;
of the current critical apparatus recommendations.  That of encoding&lt;br /&gt;
an apparatus at the site of textual variance and encoding a structured&lt;br /&gt;
view of a note entirely separate from the edited version of texts.&lt;br /&gt;
(In mass digitization of critical editions, for example, one might&lt;br /&gt;
have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are&lt;br /&gt;
not encoded at the site of variance, or indeed necessarily connected&lt;br /&gt;
with it.)  It is this striving to both be able to encode all sorts of&lt;br /&gt;
various legacy forms of apparatus as well as simultaneously catering&lt;br /&gt;
for those who are recording the structure by which they will generate&lt;br /&gt;
an apparatus in producing some output.  So JC would argue that the first of&lt;br /&gt;
these are apparatus and the second of these is a site/locus of textual&lt;br /&gt;
variance.&lt;br /&gt;
&lt;br /&gt;
== Issues with the current Critical Apparatus chapter/module == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Inclusion of structural markup in the apparatus ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
=== Transpositions ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
=== Scalability ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses.&lt;br /&gt;
&lt;br /&gt;
Also manually crafting an apparatus is error-prone:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/yuxqotf5aynxznq5&lt;br /&gt;
&lt;br /&gt;
=== Refactoring ===&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn. &lt;br /&gt;
&lt;br /&gt;
=== conflicts between individual readings and the semantics of structural markup that surrounds it ===&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representing &amp;quot;verbose&amp;quot; apparatus ===&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus. &lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
	<entry>
		<id>https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9157</id>
		<title>Critical Apparatus Workgroup</title>
		<link rel="alternate" type="text/html" href="https://wiki.tei-c.org/index.php?title=Critical_Apparatus_Workgroup&amp;diff=9157"/>
		<updated>2011-04-08T15:24:01Z</updated>

		<summary type="html">&lt;p&gt;Gremid: /* Inclusion of structural markup in the apparatus */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html Critical Apparatus] workgroup is part of the TEI special interest group on manuscript [[SIG:MSS]].&lt;br /&gt;
This page provides a summary of the preliminary discussions regarding the current issues with [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TC.html the critical apparatus chapter]. &lt;br /&gt;
&lt;br /&gt;
Participants to the preliminary workgroup: Marjorie Burghart (MB), James Cummings (JC), Fotis Jannidis (FJ), Gregor Middell (GM), Dan O'Donnell (DOD), Espen Ore (EO), Elena Pierazzo (EP), Roberto Rosselli del Turco (RDT), Chris Wittern (CW)&lt;br /&gt;
&lt;br /&gt;
== A preliminary vocabulary question == &lt;br /&gt;
The very name of the chapter, &amp;quot;Critical apparatus&amp;quot;, is felt by some to be be a problem: the critical apparatus is just inherited from the printed world and one of the possible physical embodiment of TEXTUAL VARIANCE. EP therefore proposes to use this new name, moving from &amp;quot;citical apparatus&amp;quot; to textual variance.&lt;br /&gt;
&lt;br /&gt;
MB argues that, oddly, &amp;quot;textual variance&amp;quot; feels more restrictive to her than &amp;quot;critical apparatus&amp;quot;: it is a notion linked with Cerquiglini's work, which does not correspond to '''every''' branch of textual criticism. On the other hand, strictly speaking, the &amp;quot;critical apparatus&amp;quot; is not limited to registering the variants of the several witnesses of a text. It also includes various kinds of notes (identification of the sources of the text, historical notes, etc.). Even texts with a single witness may have a critical apparatus. Maybe the problem with the name has its origins in the choice of giving the name &amp;quot;critical apparatus&amp;quot; to a part of the guidelines dedicated solely to the registration of textual variants. &lt;br /&gt;
&lt;br /&gt;
FJ argues that for German ears the concept of textual variance is not closely connected to a specific scholar.&lt;br /&gt;
&lt;br /&gt;
MB proposes to use &amp;quot;TEXTUAL VARIANTS&amp;quot; instead, since it focuses more on actual elements in the edition, when &amp;quot;variance&amp;quot; is nothing concrete but a phenomenon. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Side remarks by MB: this vocabulary queston might prove sticky in the end. The &amp;lt;app&amp;gt; elements is named &amp;lt;app&amp;gt; because it is considered &amp;quot;an apparatus entry&amp;quot;, so unless we end up recommending to change the elements names, the phrase &amp;quot;critical apparatus&amp;quot; will still be used in the module, at least to explain the tag names?&lt;br /&gt;
&lt;br /&gt;
RDT argues that while backward compatibility is clearly a bonus, as MB states &amp;lt;app&amp;gt; stands for 'apparatus entry': we shouldn't be afraid to change its function, for instance making it a container instead of a phrase level element. RDT stresses that he is proposing this by way of example, and to stress that our focus is on variants: these might then be organised in &amp;lt;app&amp;gt;s for traditional CA display, and/or in other, new ways for electronic display. Note that this might mean no traditional critical apparatus in a digital edition.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
MB: It is characteristic of a print-based approach to encoding that the &amp;lt;app&amp;gt; element was considered as encoding an apparatus entry (hence the &amp;lt;app&amp;gt; name), when what it really encodes is a locus where different witnesses have variant readings (whch would probably have justified a name along the lines of &amp;lt;locus&amp;gt; or whatnot).   &lt;br /&gt;
&lt;br /&gt;
JC: Thinks this points to a slight divergent nature at the heart&lt;br /&gt;
of the current critical apparatus recommendations.  That of encoding&lt;br /&gt;
an apparatus at the site of textual variance and encoding a structured&lt;br /&gt;
view of a note entirely separate from the edited version of texts.&lt;br /&gt;
(In mass digitization of critical editions, for example, one might&lt;br /&gt;
have an &amp;lt;app&amp;gt; in a set of notes at the bottom of the page which are&lt;br /&gt;
not encoded at the site of variance, or indeed necessarily connected&lt;br /&gt;
with it.)  It is this striving to both be able to encode all sorts of&lt;br /&gt;
various legacy forms of apparatus as well as simultaneously catering&lt;br /&gt;
for those who are recording the structure by which they will generate&lt;br /&gt;
an apparatus in producing some output.  So JC would argue that the first of&lt;br /&gt;
these are apparatus and the second of these is a site/locus of textual&lt;br /&gt;
variance.&lt;br /&gt;
&lt;br /&gt;
== Issues with the current Critical Apparatus chapter/module == &lt;br /&gt;
&lt;br /&gt;
Preliminary notice: most of the issues raised here are connected with the parallel segmentation method, not because it is the more flawed, but because it is the more used by the members of this group. While location-referenced and double-end-point-attachment might be useful for mass conversion of printed material (for the former) and/or when using a piece of software handling the encoding (for the latter), the parallel segmentation method seems to be the easiest and more powerful way to encode the critical apparatus &amp;quot;by hand&amp;quot;. &lt;br /&gt;
&lt;br /&gt;
Also, one might point out that most of the issues raised here might be solved with standoff encoding. But this is extremely cumbersome to handle without the aid of a software, and it does not correspond to the way most people work. &lt;br /&gt;
&lt;br /&gt;
=== Inclusion of structural markup in the apparatus ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the &amp;lt;app&amp;gt; element is phrase-level, when it really should be allowed to include paragraphs, and even &amp;amp;lt;div&amp;amp;gt;s.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;I'm encoding a 19th c. edition of a medieval text, and one of the &lt;br /&gt;
witness has omissions of several paragraphs. Of course, the TEI schema &lt;br /&gt;
won't let me put &amp;amp;lt;p&amp;amp;gt; elements inside an &amp;lt;app&amp;gt;/&amp;lt;lem&amp;gt; element...&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
- I use the parallel segmentation method&amp;lt;br/&amp;gt;&lt;br /&gt;
- It is important to me to keep a methodical link between the encoded &lt;br /&gt;
apparatus and the notes numbers in the original edition (the &lt;br /&gt;
@n of each &amp;lt;app&amp;gt; tag bears the number of the footnote in the original &lt;br /&gt;
edition)&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/tbzi2yj5xd4dto34&lt;br /&gt;
&lt;br /&gt;
More use cases from TEI-L:&lt;br /&gt;
&lt;br /&gt;
* http://tei.markmail.org/thread/jyezaqfycaldtdcv&lt;br /&gt;
* http://tei.markmail.org/thread/fbyuxyabbxq4rwbr&lt;br /&gt;
* http://tei.markmail.org/thread/vrwkl7kkruulyjzh&lt;br /&gt;
&lt;br /&gt;
=== Transpositions ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: with the parallel segmentation method, it is often cumbersome to render transpositions.&lt;br /&gt;
&lt;br /&gt;
Additionally it is not possible to mark them up explicitly. [http://juxtasoftware.org/ Juxta] for example works around that by storing transposition data in a custom XML format:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;moves&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10462&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;9872&amp;quot; end2=&amp;quot;10467&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10483&amp;quot; doc2=&amp;quot;1870 2nd Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;7781&amp;quot; end2=&amp;quot;8376&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9679&amp;quot; end1=&amp;quot;10504&amp;quot; doc2=&amp;quot;1870 Proof&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8458&amp;quot; end2=&amp;quot;9056&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1855 MS&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;9886&amp;quot; end1=&amp;quot;10525&amp;quot; doc2=&amp;quot;1870 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;8546&amp;quot; end2=&amp;quot;9141&amp;quot; /&amp;gt;&lt;br /&gt;
        &amp;lt;move doc1=&amp;quot;1870 Proof&amp;quot; space1=&amp;quot;original&amp;quot; start1=&amp;quot;1640&amp;quot; end1=&amp;quot;1850&amp;quot; doc2=&amp;quot;1881 1st Ed.&amp;quot; space2=&amp;quot;original&amp;quot; start2=&amp;quot;2961&amp;quot; end2=&amp;quot;3070&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/moves&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Neither is this TEI-compliant, nor is the offset/range-based addressing (@start1/@start2 and @end1/@end2) proper XML markup. A standardized encoding would be helpful.&lt;br /&gt;
&lt;br /&gt;
=== Scalability ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: the parallel segmentation method is difficult to handle when adding hundreds of conflicting witnesses. &lt;br /&gt;
&lt;br /&gt;
=== Refactoring ===&lt;br /&gt;
In a nutshell: with the the parallel segmentation method, it is cumbersome to add a new reading that necessitates changing where the borders of readings are drawn. &lt;br /&gt;
&lt;br /&gt;
=== conflicts between individual readings and the semantics of structural markup that surrounds it ===&lt;br /&gt;
In a nutshell: with the parallel segmentation method, witnesses with different forms of lineation pose a problem.&lt;br /&gt;
&lt;br /&gt;
=== Showing a lemma different from the content of the &amp;lt;lem&amp;gt; or chosen reading in an apparatus note ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: depending on the desired output of your digital edition, you may need to show in the apparatus entry a lemma text different from the content of the &amp;lt;lem&amp;gt; or desired &amp;lt;rdg&amp;gt;. This is typically the case for long omissions, when one does not display the full text that is omitted by one or more witnesses, but only the beginning and end of the omitted span of text.&lt;br /&gt;
&lt;br /&gt;
Use case: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;Let's consider again the example used in a previous use case: &lt;br /&gt;
Here is the [http://baluze.univ-avignon.fr/scan/t1/%285%29.jpg scan of a page from this edition], please consider footnote number 9. &lt;br /&gt;
The note contains: &amp;quot;9. Eodem anno, rex Francie… dampnificati, paragraphes omis par Bal.&amp;quot;, meaning that the ''Bal.'' witness has an omission where other witnesses have two long paragraphs, the first one beginning on the previous page (see the [http://baluze.univ-avignon.fr/scan/t1/%284%29.jpg previous page scanned]). &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
You certainly do not want to generate a footnote with these two full paragraphs to tell the reader that one witness omits them, but on the other hand you want to be able to represent the source according to its various witnesses, so location-referenced is not in order. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representing &amp;quot;verbose&amp;quot; apparatus ===&lt;br /&gt;
In a nutshell: when ou want to represent an apparatus entry written in a rather verbose way (in a print-to-digital edition). The same is true if you want to be able to generate a verbose apparatus note in a &amp;quot;born digital&amp;quot; edition. &lt;br /&gt;
&lt;br /&gt;
Use cases: &lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;You're encoding an existing edition, and want to represent the source it edits, while keeping intact the text / apparatus of the existing edition. Some apparatus entries are easy to represent with the &amp;lt;app&amp;gt; / &amp;lt;lem&amp;gt; / &amp;lt;rdg&amp;gt; elements, some others add editorial comments to the listing of the variants, and are quite difficult to represent. BTW, the same goes when you are encoding a born-digital edition for which you want to be able to generate an alternative print output corresponding to the traditional standards of a collection. &lt;br /&gt;
&amp;lt;br/&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
A - When I have a footnote giving two &amp;lt;i&amp;gt;lectiones&amp;lt;/i&amp;gt; from the same manuscrip, one before correction and the other after: &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Text&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: ad lectorem Venetum (b) .&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;b class=&amp;quot;moz-txt-star&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;Note&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;: b) &amp;lt;i&amp;gt;ms.&amp;lt;/i&amp;gt;, lectionem venerum &amp;lt;i class=&amp;quot;moz-txt-slash&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;corrigé postérieurement en&amp;lt;span class=&amp;quot;moz-txt-tag&amp;quot;&amp;gt;&amp;lt;/span&amp;gt;&amp;lt;/i&amp;gt; lectorem Venetum&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
If I encode it like this, with two seprate rdg for the same&lt;br /&gt;
witness, each with a different @type (for instance, &amp;quot;anteCorr&amp;quot; and&lt;br /&gt;
&amp;quot;postCorr&amp;quot;), it gives an accurate account of the state of the witness, BUT it is an&lt;br /&gt;
interpretation of the original note in the critical apparatus, i.e. if&lt;br /&gt;
I do this I delete some text added by the original editor. &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;app n=&amp;quot;b&amp;quot;&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;lem&amp;amp;gt;lectorem Venetum&amp;amp;lt;/lem&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;anteCorr&amp;quot;&amp;amp;gt;lectionem venerum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;rdg wit=&amp;quot;#ms.2&amp;quot; type=&amp;quot;postCorr&amp;quot;&amp;amp;gt;lectorem Venetum&amp;amp;lt;/rdg&amp;amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
Let's consider this other note. There is some text added verbosely within the apparatus note by the editor. &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Text&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: Hiis diebus civitas&lt;br /&gt;
Pergamensis(b) tenebat exersitum&amp;lt;/span&amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;b style=&amp;quot;&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Note&amp;lt;/span&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: b) se, &amp;lt;i&amp;gt;mis indûment avant&amp;lt;/i&amp;gt; tenebat &amp;lt;i&amp;gt;par le ms&amp;lt;/i&amp;gt;.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot;&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;Should I encode it as: &amp;lt;br&amp;gt;&lt;br /&gt;
... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;... &amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
I one represents this note strictly with the &amp;lt;app&amp;gt; / &amp;lt;rdg&amp;gt;, it leads to suppress remarks by the original editor. Adding a note in the rdg to preserve the editor's comments could work here, ut it's not always the case&amp;lt;br/&amp;gt;&lt;br /&gt;
Like: &amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;... Pergamensis &amp;amp;lt;app&lt;br /&gt;
n=&amp;quot;b&amp;quot;&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;lem/&amp;amp;gt;&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp; &amp;amp;lt;rdg&lt;br /&gt;
type=&amp;quot;addition&amp;quot; wit=&amp;quot;#ms&amp;quot;&amp;amp;gt;&amp;amp;lt;sic&amp;amp;gt;se&amp;amp;lt;/sic&amp;amp;gt; &amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;note&amp;amp;gt;&amp;amp;lt;hi&lt;br /&gt;
rend=&amp;quot;italics&amp;quot;&amp;amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;mis&lt;br /&gt;
indûment avant&amp;lt;/span&amp;gt;&amp;amp;lt;/hi&amp;amp;gt; tenebat.&amp;lt;/span&amp;gt;&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;&amp;amp;lt;/note&amp;amp;gt;&amp;amp;lt;/rdg&amp;amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;amp;lt;/app&amp;amp;gt;&amp;lt;/span&amp;gt;&lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;blockquote style=&amp;quot;background:#FFEAEA&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;MsoNormal&amp;quot; style=&amp;quot;text-align: justify;&amp;quot;&amp;gt;'''Text'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: …reliqui demum meos socios (d)&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
'''Note'''&amp;lt;span style=&amp;quot;&amp;quot;&amp;gt;: d) domum&lt;br /&gt;
meam solito, &amp;lt;i&amp;gt;Bal.;&amp;lt;/i&amp;gt; dni &amp;lt;i&amp;gt;ou&amp;lt;/i&amp;gt; dm, &amp;lt;i&amp;gt;ms.; en note&amp;lt;/i&amp;gt; meam solita.&amp;lt;/span&amp;gt;&amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here we have 2 witnesses (Bal. et ms.), the latter with a) an uncertain&lt;br /&gt;
lectio (&amp;quot;dni&amp;quot; or &amp;quot;dm&amp;quot;) and b) a part of the lectio which is written as&lt;br /&gt;
a note (&amp;quot;meam solita&amp;quot;). This is tricky to encode. &lt;br /&gt;
&amp;lt;/blockquote&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Representation of suggestions by the editor: ''lege'' ''dele'' etc. ===&lt;br /&gt;
&lt;br /&gt;
In a nutshell: Sometimes, the editor provides working suggestions through apparatus notes such as ''lege(ndum)'' (&amp;quot;read&amp;quot;), ''dele(ndum)'' (&amp;quot;delete)&amp;quot; etc. They do not belong in the textual variants ''per se'', and are not attached to witnesses, although they do belong in the critical apparatus. &lt;br /&gt;
&lt;br /&gt;
== An encoding proposal from the perspective of computer-aided collation tools == &lt;br /&gt;
&lt;br /&gt;
Gregor Middell gave an overview of textual variance from a software developer's perspective for the workgroup on a [[Textual_Variance|separate page]]. The models described there are used in tools like [http://collatex.sourceforge.net/ CollateX], [http://www.juxtasoftware.org/ Juxta] and [http://code.google.com/p/multiversiondocs/ nmerge].&lt;br /&gt;
&lt;br /&gt;
Collecting ideas from the mailinglist by James Cummings, Dan O'Donnell and Marjorie Burghardt as well as following the “Gothenburg model” of textual variance, a first take at separating the [http://en.wikipedia.org/wiki/Model%E2%80%93view%E2%80%93controller model from the representation] of textual variance could be structured as follows.&lt;br /&gt;
&lt;br /&gt;
=== Modelling input data: Make the units of a collation addressable in the witnesses ===&lt;br /&gt;
&lt;br /&gt;
The Gothenburg model assumes a [[Textual_Variance#Tokenizer|preprocessing step]] by which the witnesses get split up into '''tokens''' of desired granularity. This granularity becomes the minimal unit of collation and can defined as pages, paragraphs, verses, lines, words, characters or any other unit that makes sense in the context of a particular tradition under investigation. To model collation results on top of tokenized witnesses, those tokens have to be addressable.&lt;br /&gt;
&lt;br /&gt;
The TEI defines an [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SAXP array of pointing mechanisms], which can be used to address anything from a whole XML document via URIs down to arbitrary content of those documents via sophisticated XPointer schemes. Projects would be free to choose among those mechanisms as long as each token is made available for later reference.&lt;br /&gt;
&lt;br /&gt;
Examples:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;The&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;quickly&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;p xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;w&amp;gt;Quickly&amp;lt;/w&amp;gt;, &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;cat&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;ate&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;the&amp;lt;/w&amp;gt; &amp;lt;w&amp;gt;food&amp;lt;/w&amp;gt;.&lt;br /&gt;
&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Here tokens on the word-level could be addressed via the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSXP xpath1() XPointer scheme]:&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[1])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;http://edition.org/witness_1#xpath1(/p[1]/w[2])&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
A less verbose scheme would rely on each container element of a token being identified via a (possibly autogenerated) &amp;lt;code&amp;gt;xml:id&amp;lt;/code&amp;gt; attribute, like in the following verse-level tokenization.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;lg xml:base=&amp;quot;urn:goethe:faust2&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_1&amp;quot;&amp;gt;Die Sonne sinkt, die letzten Schiffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_2&amp;quot;&amp;gt;Sie ziehen munter hafenein.&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_3&amp;quot;&amp;gt;Ein großer Kahn ist im Begriffe&amp;lt;/l&amp;gt;&lt;br /&gt;
  &amp;lt;l xml:id=&amp;quot;l_4&amp;quot;&amp;gt;Auf dem Canale hier zu sein.&amp;lt;/l&amp;gt;&lt;br /&gt;
&amp;lt;/lg&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_1&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# &amp;lt;nowiki&amp;gt;urn:goethe:faust2#l_2&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
# ...&lt;br /&gt;
&lt;br /&gt;
One can even think of reference schemes, which are as independent of existing markup as possible. By introducing &amp;amp;lt;anchor/&amp;gt; milestone elements at token boundaries and using the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATSRN range() XPointer scheme] the tokenization of arbitrary TEI documents can be accomplished, because &amp;amp;lt;anchor/&amp;gt; is part of [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-model.global.html model.global].&lt;br /&gt;
&lt;br /&gt;
=== Modelling collated data: Encode the alignment/linking between tokens ===&lt;br /&gt;
&lt;br /&gt;
After tokens in the different witnesses have been made addressable, collation data can be modelled on top of that as [[Textual_Variance#Aligner|alignments of tokens]]. An '''alignment''' can be expressed as a set of tokens from different witnesses or, in accordance with the [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html corresponding guidelines chapter] as a link between two or more tokens.&lt;br /&gt;
&lt;br /&gt;
Taking the first example from above, a collation of the two given witnesses could be expressed as&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;linkGrp type=&amp;quot;collation&amp;quot;&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[1]) http://edition.org/witness_2#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[2]) http://edition.org/witness_2#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[3]) http://edition.org/witness_2#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[4]) http://edition.org/witness_2#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[5]) http://edition.org/witness_2#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;link target=&amp;quot;http://edition.org/witness_1#xpath1(/p[1]/w[6]) http://edition.org/witness_2#xpath1(/p[1]/w[1])&amp;quot; type=&amp;quot;transposition&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/linkGrp&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Each link in this example corresponds to a row in an alignment table as depicted in the Gothenburg model description. Omitted/ added tokens are expressed implictly by not linking to tokens in other witnesses, this is to say: Whether a set of tokens has been added to a witness or has been omitted from it, is a matter of interpreting collation data as expressed above from the perspective of one witness or another and with regard to the way, this witness aligns with others.&lt;br /&gt;
&lt;br /&gt;
One advantage of encoding collation data in such a set-oriented way is its '''scalability''':&lt;br /&gt;
&lt;br /&gt;
# Gradually adding witnesses to the collation may amount to adding alignments to the existing ones or modifying/augmenting the latter, depending on whether the collation is done pairwise (e. g. in relation to a base text) or via multiple alignment (e. g. without a prechosen base).&lt;br /&gt;
# Guiding a collation tool in producing ever more precise aligments in consecutive runs can be achieved by [[Textual_Variance#Analyzer|declaring alignments]] (for example transpositions), feeding those into the collator, adjusting the resulting alignment set, feeding it back into the collator for another run and so forth. Being able to encode the initial/preliminary results of such an iterative process in a standardized way, makes it possible to run different collation tools on the same text tradition, ideally each being able to make use of former results by other tools and to contribute to the overall result.&lt;br /&gt;
&lt;br /&gt;
The major disadvantage of encoding collation data this way is its apparant lack of human readability and that it is hardly possible to edit it by hand, especially when the collated text tradition grows larger. This problem can only be solved via tool support.&lt;br /&gt;
&lt;br /&gt;
=== Encoding the interpretation/ representation: Derive an apparatus from the collation ===&lt;br /&gt;
&lt;br /&gt;
A TEI-encoded critical apparatus is one possible rendition of collation data, possibly enhanced with information yielded from interpreting the alignments. There are a couple of ways how we could encode the above collation as an apparatus.&lt;br /&gt;
&lt;br /&gt;
==== Apparatus pointing to the collated tokens (for easier post-processing) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[1])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[2])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[3])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[4])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[5])&amp;quot; /&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; xml:base=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;&lt;br /&gt;
    &amp;lt;ptr target=&amp;quot;#xpath1(/p[1]/w[6])&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:base=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==== Apparatus with embedded textual content (for readability) ====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; /&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; xml:id=&amp;quot;w2_1&amp;quot;&amp;gt;Quickly,&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot;&amp;gt;The cat ate the food&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot;&amp;gt;the cat ate the food.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;app&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_1&amp;quot; corresp=&amp;quot;#w2_1&amp;quot;&amp;gt;quickly.&amp;lt;/rdg&amp;gt;&lt;br /&gt;
  &amp;lt;rdg wit=&amp;quot;http://edition.org/witness_2&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/app&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Some problems here:&lt;br /&gt;
&lt;br /&gt;
* @corresp vs. &amp;lt;link/&amp;gt; for transpositions over more than two witnesses&lt;br /&gt;
* How to derive the segment content from the original witness automatically, if the token content does not add up to it (e. g. because of punctuation being excluded from the tokens from the start)?&lt;br /&gt;
&lt;br /&gt;
== Bibliography ==&lt;br /&gt;
&lt;br /&gt;
* O'Donnell, Daniel Paul. [http://etjanst.hb.se/bhs/ith/1-8/dpo.pdf “The Ghost in the Machine: Revisiting an Old Model for the Dynamic Generation of Digital Editions.”] HumanIT 8.1 (2005): 51­71.&lt;br /&gt;
[[Category:SIG:Manuscripts]]&lt;br /&gt;
* Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.&lt;br /&gt;
* [http://eprints.qut.edu.au/38436/ Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.]&lt;/div&gt;</summary>
		<author><name>Gremid</name></author>
		
	</entry>
</feed>