GutenbergToTei.py
Jump to navigation
Jump to search
Contents
- 1 Synopsis
- 2 Features
- 3 User commentary
- 4 System requirements
- 5 Source code and licensing
- 6 Support for TEI
- 7 Language(s)
- 8 Documentation
- 9 Tech support
- 10 User community
- 11 Sample implementations
- 12 Current version number and date of release
- 13 History of versions
- 14 How to download or buy
- 15 Additional notes
Synopsis
Converts a Gutenberg text into a minimally encoded and TEI-compliant XML file. The script builds a teiHeader that includes the author and title of the work and then adds “text”, “body”, div, and all the p tags. The final result is a document that meets basic TEI requirements.
Features
User commentary
Please sign all comments.
Creates an essentially blank header, but should be fairly trivially to improve to include at least some details for those people interested. Stuartyeates 17:45, 6 June 2012 (EDT)
System requirements
Source code and licensing
Support for TEI
Language(s)
Written in Python
Documentation
Tech support
User community
Sample implementations
Current version number and date of release
History of versions
How to download or buy
http://www.matthewjockers.net/2010/08/26/auto-converting-project-gutenberg-text-to-tei/