GutenbergToTei.py

Synopsis
Converts a Gutenberg text into a minimally encoded and TEI-compliant XML file. The script builds a teiHeader that includes the author and title of the work and then adds “text”, “body”, div, and all the p tags. The final result is a document that meets basic TEI requirements.

User commentary
Please sign all comments.

Language(s)
Written in Python

How to download or buy
http://www.matthewjockers.net/2010/08/26/auto-converting-project-gutenberg-text-to-tei/