Difference between revisions of "Textual Variance"
(→A modular architecture for computer-aided collation tools) |
(→A modular architecture for computer-aided collation tools) |
||
Line 3: | Line 3: | ||
== A modular architecture for computer-aided collation tools == | == A modular architecture for computer-aided collation tools == | ||
− | Developers of [http://collatex.sourceforge.net/ CollateX] and [http://www.juxtasoftware.org/ Juxta] met in 2009 at a joint workshop of the EU-funded research projects [http://www.cost-a32.eu/ COST Action 32] and [http://www.interedition.eu/ Interedition] in Gothenburg. They wanted to agree on a modular software architecture, so these two as well as similar projects interested in collation software would have a common base for collaborating in the development of needed tools. As a first result the participants identified the following 4 modules/tasks, which were found to be essential to computer-aided collation. | + | Developers of [http://collatex.sourceforge.net/ CollateX] and [http://www.juxtasoftware.org/ Juxta] met in 2009 at a joint workshop of the EU-funded research projects [http://www.cost-a32.eu/ COST Action 32] and [http://www.interedition.eu/ Interedition] in Gothenburg. They wanted to agree on a modular software architecture, so these two as well as similar projects interested in collation software would have a common base for collaborating in the development of needed tools. As a first result the participants identified the following 4 modules/tasks, which were found to be essential to computer-aided collation. The underlying ideas might consequently need to be discussed in the context of encoding the in- and output of these modules as part of – or pre-stage to – a critical apparatus. |
=== Tokenizer === | === Tokenizer === |
Revision as of 19:59, 8 March 2011
The working group on genetic editions is part of the TEI special interest group on manuscript SIG:MSS.
A modular architecture for computer-aided collation tools
Developers of CollateX and Juxta met in 2009 at a joint workshop of the EU-funded research projects COST Action 32 and Interedition in Gothenburg. They wanted to agree on a modular software architecture, so these two as well as similar projects interested in collation software would have a common base for collaborating in the development of needed tools. As a first result the participants identified the following 4 modules/tasks, which were found to be essential to computer-aided collation. The underlying ideas might consequently need to be discussed in the context of encoding the in- and output of these modules as part of – or pre-stage to – a critical apparatus.
Tokenizer
Aligner
Analyzer
Visualization
...
Resources/ Bibliography
Modelling textual variance
- Multi-Version Document Format. Schmidt, D. and Colomb, R, 2009. A data structure for representing multi-version texts online, International Journal of Human-Computer Studies, 67.6, 497-514. (See the related blog and the post “What's a Multi-Version Document?”.
- Multi-Version Texts and Collation. Schmidt, Desmond. “Merging Multi-Version Texts: a Generic Solution to the Overlap Problem.” Presented at Balisage: The Markup Conference 2009, Montréal, Canada, August 11 - 14, 2009. In Proceedings of Balisage: The Markup Conference 2009. Balisage Series on Markup Technologies, vol. 3 (2009). doi:10.4242/BalisageVol3.Schmidt01.]
Criticism of the current critical apparatus encoding scheme
- Vetter, L. and McDonald, J. ‘Witnessing Dickinson’s Witnesses’, Literary and Linguistic Computing, 18.2: 2003, 151-165.
- Schmidt, D., 2010. The inadequacy of embedded markup for cultural heritage texts. Literary and Linguistic Computing, 25(3), pp. 337-356.
Computer-aided collation: Concepts and algorithms
- Matthew Spencer, Christopher J. Howe. Collating Texts Using Progressive Multiple Alignment. Computers and the Humanities. 38/2004. S. 253–270.
- Michael Stolz, Friedrich Michael Dimpel. Computergestützte Kollationierung und ihre Integration in den editorischen Arbeitsfluss. 2006.