Converts a Gutenberg text into a minimally encoded and TEI-compliant XML file. The script builds a teiHeader that includes the author and title of the work and then adds “text”, “body”, div, and all the p tags. The final result is a document that meets basic TEI requirements.


User commentary

Creates an essentially blank header, but should be fairly trivially to improve to include at least some details for those people interested. Stuartyeates 17:45, 6 June 2012 (EDT)

