Difference between revisions of "Oddbyexample"
(Created page with "Category:Tools Category:Development tools Category:Conversion and preprocessing tools == Synopsis == This utility attempts to work out the minimal TEI customization...") |
(added Category:XSLT) |
||
Line 1: | Line 1: | ||
+ | [[Category:XSLT]] | ||
+ | |||
[[Category:Tools]] | [[Category:Tools]] | ||
Revision as of 12:49, 20 September 2012
Contents
- 1 Synopsis
- 2 Features
- 3 User commentary
- 4 System requirements
- 5 Source code and licensing
- 6 Support for TEI
- 7 Language(s)
- 8 Documentation
- 9 Tech support
- 10 User community
- 11 Sample implementations
- 12 Current version number and date of release
- 13 History of versions
- 14 How to download or buy
- 15 Additional notes
Synopsis
This utility attempts to work out the minimal TEI customization needed to validate a collection of files. The XSLT (version 2) stylesheet which traverses a nominated directory tree looking for *.xml files which have <TEI> or <teiCorpus7gt; root elements. It analyzes the collection of elements and attributes in the resulting corpus, and compares that to the whole of TEI P5. An ODD file is generated which:
- loads the required modules
- deletes any elements which are not used
- deletes any attributes (including class attributes) which are not used by each element
- for every attribute which has a TEI "data.enumerated" datatype, constructs a closed <valList> enumerating the values actually used.
From this you can construct a target schema.
Features
User commentary
Please sign all comments.
System requirements
Memory capacity is an issue. It's not going to read a giant corpus without you have a big load of memory to assign to Java.
Source code and licensing
open source
Support for TEI
Limitations:
- deriving simplified content models (beyond what Roma already does)
- adding new elements and deriving a content model
- dealing with non-TEI namespaces
- generating attribute datatypes with complex regexps
- working out Schematron constraints etc
Language(s)
XSLT
Documentation
The script assumes you have the TEI package which has a file called "/usr/share/xml/tei/odd/p5subset.xml". If you don't have that, grab http://www.tei-c.org/release/xml/tei/odd/p5subset.xml, put the file somewhere, and add a "tei" parameter to point at it.
Here's a sample command to run it:
saxon -o my.odd oddbyexample.xsl oddbyexample.xsl corpus=/wherever/you/have/yourfiles/
Tech support
User community
Sample implementations
Current version number and date of release
History of versions
How to download or buy
Grab getfiles.xsl and oddbyexample.xsl from Sourceforge (http://tei.svn.sourceforge.net/viewvc/tei/trunk/Stylesheets2/tools2/)