Difference between revisions of "GutenbergToTei.py"

From TEIWiki
Jump to navigation Jump to search
(Created page with "Category:Tools Category:Conversion and preprocessing tools == Synopsis == Converts a Gutenberg text into a minimally encoded and TEI-compliant XML file. The script buil...")
 
(comment)
 
Line 11: Line 11:
 
== User commentary ==
 
== User commentary ==
 
'''Please sign all comments.'''
 
'''Please sign all comments.'''
 +
 +
Creates an essentially blank header, but should be fairly trivially to improve to include at least some details for those people interested. [[User:Stuartyeates|Stuartyeates]] 17:45, 6 June 2012 (EDT)
  
 
== System requirements ==
 
== System requirements ==

Latest revision as of 23:45, 6 June 2012


Synopsis

Converts a Gutenberg text into a minimally encoded and TEI-compliant XML file. The script builds a teiHeader that includes the author and title of the work and then adds “text”, “body”, div, and all the p tags. The final result is a document that meets basic TEI requirements.

Features

User commentary

Please sign all comments.

Creates an essentially blank header, but should be fairly trivially to improve to include at least some details for those people interested. Stuartyeates 17:45, 6 June 2012 (EDT)

System requirements

Source code and licensing

Support for TEI

Language(s)

Written in Python

Documentation

Tech support

User community

Sample implementations

Current version number and date of release

History of versions

How to download or buy

http://www.matthewjockers.net/2010/08/26/auto-converting-project-gutenberg-text-to-tei/

Additional notes