Difference between revisions of "Samples of TEI texts"

From TEIWiki
Jump to navigation Jump to search
m
m
 
(105 intermediate revisions by 27 users not shown)
Line 1: Line 1:
[http://grhenric.angelfire.com/html/prendimi-cos/ prendimi cos] [http://badanglican.cn/content/view/pt-cruiser/ pt cruiser 1 6] [http://badanglican.cn/content/view/garmin/ garmin 101] [http://badanglican.cn/content/view/el-jefe/ el jefe - il capo] [http://badanglican.cn/content/view/danzel-punp/ danzel punp it up] [http://merengo.netfirms.com/html/master-cordless.htm master cordless] [http://hilyndel.angelfire.com/lib/seat-italia.htm seat italia] [http://jite.netfirms.com/new/amd-/ amd 939 processore] [http://jite.netfirms.com/new/tavoletta-grafica/ tavoletta grafica pc] [http://cacogdil.angelfire.com/web/www-glow.htm www glow com co] [http://jite.netfirms.com/new/camping-ulivi/ camping ulivi di peschici] [http://badanglican.cn/content/view/sandra-bullock/ sandra bullock] [http://badanglican.cn/content/view/videos-zion/ videos zion y lennox gratis] [http://prhaffen.angelfire.com/library/oops-the.htm oops the real slim shady] [http://syashwor.angelfire.com/directory/inferno-e.htm inferno e passione] [http://xrikuxsorax103.netfirms.com/foto-grtais/ foto grtais] [http://hilyndel.angelfire.com/lib/toner-per.htm toner per hp laserjet 1320] [http://broken-lens.netfirms.com/web/lo-sparviero/ lo sparviero del mare] [http://kehampsh.angelfire.com/view/kingston-/ kingston  512mb reduced size] [http://badanglican.cn/content/view/norme-uni/ norme uni] [http://whrichey.angelfire.com/content/koochy/ koochy] [http://themarjorie.netfirms.com/collezione-baby.htm collezione baby expert] [http://whrichey.angelfire.com/content/coppie-livorno/ coppie livorno] [http://badanglican.cn/content/view/fungo-gorizia/ fungo gorizia] [http://badanglican.cn/content/view/konfrontacja/ konfrontacja] [http://merengo.netfirms.com/html/nun-e.htm nun e acqua] [http://myasa-kusochek.netfirms.com/lib/toner-w/ toner 1300w] [http://grhenric.angelfire.com/html/www-muevamueva/ www muevamueva com] [http://eiwiggin.angelfire.com/small/bronco/ bronco] [http://webancks.angelfire.com/bcl.htm bcl2] [http://blgottlo.angelfire.com/stitichezza/ stitichezza] [http://xrikuxsorax103.netfirms.com/singole-tardone/ singole tardone] [http://zaweaver.angelfire.com/html/planning/ planning] [http://myasa-kusochek.netfirms.com/lib/imperia-pasta/ imperia pasta presto 700] [http://badanglican.cn/content/view/bed-and/ bed and breakfast roma centro] [http://badanglican.cn/content/view/western-digital/ western digital 200gb] [http://grhenric.angelfire.com/html/clarinet-polka/ clarinet polka] [http://badanglican.cn/content/view/import-export/ import export catania] [http://webancks.angelfire.com/escucha-me.htm escucha me] [http://badanglican.cn/content/view/mustek/ mustek 10] [http://jite.netfirms.com/new/sparvieri-di/ sparvieri di fuoco] [http://badanglican.cn/content/view/lazzarella/ lazzarella] [http://badanglican.cn/content/view/tre-donne/ tre donne per uno scapolo] [http://domiver.angelfire.com/data/eau-de.htm eau de toilette moschino] [http://hilyndel.angelfire.com/lib/boa-atoll.htm boa atoll] [http://diferente.netfirms.com/view/nokia-.htm nokia 6680 caricabatteria] [http://syashwor.angelfire.com/directory/amore-cartolina.htm amore cartolina] [http://merengo.netfirms.com/html/macromedia-flash.htm macromedia flash mx 2004] [http://badanglican.cn/content/view/bubble-bable/ bubble bable] [http://kimccanl.angelfire.com/stampante-a.htm stampante a1] [http://badanglican.cn/content/view/ragazzo-pordenone/ ragazzo pordenone] [http://eiwiggin.angelfire.com/small/import-export/ import export arredamento] [http://blgottlo.angelfire.com/colonne-sonore/ colonne sonore di a time for dancing] [http://kimccanl.angelfire.com/gazzette-regione.htm gazzette regione siciliana] [http://blgottlo.angelfire.com/aumentare-la/ aumentare la memoria] [http://badanglican.cn/content/view/ritmo-son/ ritmo son latino] [http://merengo.netfirms.com/html/www-igus.htm www igus it] [http://badanglican.cn/content/view/www-pizza/ www pizza it rai it] [http://badanglican.cn/content/view/musica-e/ musica e totalitarismo] [http://badanglican.cn/content/view/doppia-sim/ doppia sim 6600] [http://badanglican.cn/content/view/dio-perdoni/ dio perdoni la mia pistola] [http://themarjorie.netfirms.com/www-rincondelvago.htm www rincondelvago com] [http://xrikuxsorax103.netfirms.com/rom-pokemon/ rom pokemon rubino] [http://lubrevel.angelfire.com/new/hotel-riva/ hotel riva garda] [http://jite.netfirms.com/new/orthoclone-okt/ orthoclone okt3 iv 5 f 5 ml] [http://blgottlo.angelfire.com/selphy-cp/ selphy cp600] [http://rohutsonik.angelfire.com/blog/foto-hard.htm foto hard di eva henger] [http://xrikuxsorax103.netfirms.com/cosa-faro/ cosa faro da grande] [http://diferente.netfirms.com/view/gay-hardcore.htm gay hardcore] [http://lubrevel.angelfire.com/new/ml-/ ml 1210 toner] [http://blgottlo.angelfire.com/www-aja/ www aja it] [http://zaweaver.angelfire.com/html/marcelo-sosa/ marcelo sosa] [http://mybabydontsleep.netfirms.com/text/pelle-tatto.htm pelle tatto] [http://grhenric.angelfire.com/html/stamp/ stamp] [http://hungry87.netfirms.com/styles/attivo/ attivo] [http://michelletrach.netfirms.com/robby-william.htm robby william] [http://badanglican.cn/content/view/acqua-d/ acqua d gio] [http://rohutsonik.angelfire.com/blog/lavastoviglie-.htm lavastoviglie 50 cm] [http://domiver.angelfire.com/data/e-samsung.htm e330 samsung] [http://syashwor.angelfire.com/directory/cressi-astra.htm cressi astra] [http://grhenric.angelfire.com/html/palmone-gps/ palmone gps] [http://blgottlo.angelfire.com/lost-in/ lost in translation lamore tradotto] [http://syashwor.angelfire.com/directory/auto-insurance.htm auto insurance quotes] [http://myasa-kusochek.netfirms.com/lib/mercedes-slk/ mercedes slk 200 nuova] [http://kehampsh.angelfire.com/view/parker-pesacra/ parker pesacra e prpvincia] [http://knbrenne.angelfire.com/data/filt-uil.htm filt uil] [http://merengo.netfirms.com/html/tu-vipera.htm tu vipera gentile] [http://badanglican.cn/content/view/tunisia/ tunisia] [http://rospear.angelfire.com/styles/la-rush/ la rush] [http://badanglican.cn/content/view/hp-tomtom/ hp tomtom 5] [http://kehampsh.angelfire.com/view/cannes/ cannes] [http://badanglican.cn/content/view/cuando-sientas/ cuando sientas tristeza] [http://broken-lens.netfirms.com/web/renato-zero/ renato zero basi musicali] [http://badanglican.cn/content/view/coolscan-ls/ coolscan ls-4000 ed] [http://mybabydontsleep.netfirms.com/text/red-hot.htm red hot chili peppers john frusciante] [http://merengo.netfirms.com/html/teoria-degli.htm teoria degli scopi] [http://mastrong.angelfire.com/articles/mercedes-slk.htm mercedes slk 200 evo] [http://xrikuxsorax103.netfirms.com/dean-martin/ dean martin] [http://knbrenne.angelfire.com/data/lavoro-acqui.htm lavoro acqui terme] [http://eiwiggin.angelfire.com/small/hp-compaq/ hp compaq proliant] [http://knbrenne.angelfire.com/data/djubre.htm djubre] [http://badanglican.cn/content/view/era-una/ era una notte] [http://grhenric.angelfire.com/html/ruffo-di/ ruffo di aversa] [http://mastrong.angelfire.com/articles/url-manager.htm url manager pro] [http://knbrenne.angelfire.com/data/arthur-golden.htm arthur golden] [http://lubrevel.angelfire.com/new/cazzi-pelosi/ cazzi pelosi] [http://hungry87.netfirms.com/styles/penna/ penna] [http://themarjorie.netfirms.com/r-direct.htm r4 direct audio] [http://lubrevel.angelfire.com/new/manula-arcuri/ manula arcuri ultime foto] [http://rospear.angelfire.com/styles/frigorifero-mta/ frigorifero mta 241] [http://mastrong.angelfire.com/articles/parruchiere.htm parruchiere] [http://rospear.angelfire.com/styles/auricolari-in/ auricolari in ear] [http://mybabydontsleep.netfirms.com/text/samsung-fotocamera.htm samsung fotocamera digitale  digimax] [http://blgottlo.angelfire.com/via-lattea/ via lattea la prima a destra] [http://eiwiggin.angelfire.com/small/sole-rosso/ sole rosso sul bosforo] [http://whrichey.angelfire.com/content/filosofiche/ filosofiche] [http://broken-lens.netfirms.com/web/knneo-platinum/ k8nneo2 platinum] [http://domiver.angelfire.com/data/www-pippo.htm www pippo it] [http://jite.netfirms.com/new/piscina-mozzate/ piscina mozzate] [http://hungry87.netfirms.com/styles/trailer-gay/ trailer gay] [http://zaweaver.angelfire.com/html/marcin-daniec/ marcin daniec] [http://badanglican.cn/content/view/basi-grease/ basi grease] [http://myasa-kusochek.netfirms.com/lib/scheda-video/ scheda video ati radeon 256 mb agp 8x] [http://webancks.angelfire.com/giulia-montanarini.htm giulia montanarini] [http://badanglican.cn/content/view/ram-/ ram 256 mb pc2100] [http://badanglican.cn/content/view/finanziamento-fondo/ finanziamento fondo perduto] [http://kimccanl.angelfire.com/bando-concorso.htm bando concorso guardia forestale 2004] [http://piratical-map.netfirms.com/lib/nigar-khan/ nigar khan] [http://myasa-kusochek.netfirms.com/lib/dragon-ball/ dragon ball dvd box] [http://knbrenne.angelfire.com/data/terence-trent.htm terence trent d arby] [http://badanglican.cn/content/view/mercedes-provincia/ mercedes (provincia di san luis)] [http://domiver.angelfire.com/data/comune-pisa.htm comune pisa] [http://badanglican.cn/content/view/lettori-cd/ lettori cd portatili philips] [http://grhenric.angelfire.com/html/nuova-volkswagen/ nuova volkswagen sportline diesel auto] [http://cacogdil.angelfire.com/web/yamaha-v.htm yamaha v2500] [http://mastrong.angelfire.com/articles/masterizzatori-per.htm masterizzatori per dvd samsung] [http://mybabydontsleep.netfirms.com/text/bmw-i.htm bmw 740i] [http://eiwiggin.angelfire.com/small/depeche-mode/ depeche mode singles 8698] [http://badanglican.cn/content/view/lucho-gatica/ lucho gatica] [http://badanglican.cn/content/view/palmare-hp/ palmare hp rz 1710] [http://blgottlo.angelfire.com/hit-parade/ hit parade italia singoli] [http://knbrenne.angelfire.com/data/free-cubase.htm free cubase software] [http://lubrevel.angelfire.com/new/cyber-dsc/ cyber dsc t33] [http://badanglican.cn/content/view/forno-/ forno 90 gas] [http://eiwiggin.angelfire.com/small/bowling-realistico/ bowling realistico] [http://grhenric.angelfire.com/html/splitter-sys/ splitter sys] [http://badanglican.cn/content/view/lancia-lybra/ lancia lybra sw jtd km 0] [http://prhaffen.angelfire.com/library/piccola-stella.htm piccola stella senza cielo 2003] [http://cacogdil.angelfire.com/web/samsung-toner.htm samsung toner scx 4216d3 see] [http://domiver.angelfire.com/data/volo-cairo.htm volo cairo] [http://kimccanl.angelfire.com/tru-calling.htm tru calling] [http://knbrenne.angelfire.com/data/antenna-cobra.htm antenna cobra] [http://grhenric.angelfire.com/html/fighe-senza/ fighe senza peli] [http://knbrenne.angelfire.com/data/donne-al.htm donne al cesso] [http://mybabydontsleep.netfirms.com/text/agenzia-immobiliare.htm agenzia immobiliare milano] [http://diferente.netfirms.com/view/scheda-madre.htm scheda madre asus p5ld2] [http://zaweaver.angelfire.com/html/che-ne/ che ne so] [http://badanglican.cn/content/view/launch-music/ launch music video com] [http://eiwiggin.angelfire.com/small/goodnight-tonight/ goodnight tonight] [http://badanglican.cn/content/view/altar/ altar] [http://badanglican.cn/content/view/right-here/ right here waiting] [http://badanglican.cn/content/view/il-processo/ il processo del sesso] [http://rospear.angelfire.com/styles/ypoferw/ ypoferw] [http://kehampsh.angelfire.com/view/coltelli-sub/ coltelli sub] [http://myasa-kusochek.netfirms.com/lib/taquilla-hex/ taquilla hex keys] [http://eiwiggin.angelfire.com/small/helmut-newton/ helmut newton] [http://jite.netfirms.com/new/olio-di/ olio di olive] [http://hungry87.netfirms.com/styles/street-map/ street map] [http://whrichey.angelfire.com/content/la-demo/ la demo di wheel of time] [http://piratical-map.netfirms.com/lib/ibanez-grgdx/ ibanez grg170dx] [http://mybabydontsleep.netfirms.com/text/prosa-a.htm prosa a silvia] [http://webancks.angelfire.com/universita-torino.htm universita torino psicologia] [http://blgottlo.angelfire.com/viacess-new/ viacess new version] [http://badanglican.cn/content/view/mandy-moore/ mandy moore jonathan foreman someday] [http://badanglican.cn/content/view/trio-cordless/ trio cordless brondi] [http://whrichey.angelfire.com/content/sydney-intrattenimento/ sydney intrattenimento] [http://rospear.angelfire.com/styles/www-gogoboys/ www gogoboys com br] [http://knbrenne.angelfire.com/data/dsl.htm dsl 504] [http://michelletrach.netfirms.com/il-gioco.htm il gioco di  007 online] [http://hungry87.netfirms.com/styles/wer-ist/ wer ist the love] [http://hungry87.netfirms.com/styles/ps-trucco/ ps2 trucco] [http://cacogdil.angelfire.com/web/black-box.htm black box] [http://prhaffen.angelfire.com/library/formla.htm formla 1] [http://badanglican.cn/content/view/housing-server/ housing server] [http://themarjorie.netfirms.com/quiche-lorraine.htm quiche lorraine] [http://badanglican.cn/content/view/panasonic-x/ panasonic x400 alimentatore] [http://kehampsh.angelfire.com/view/super-robot/ super robot wars impact] [http://domiver.angelfire.com/data/huai-he.htm huai he] [http://hungry87.netfirms.com/styles/auto-provincia/ auto provincia varese] [http://domiver.angelfire.com/data/nikon.htm nikon 28] [http://badanglican.cn/content/view/liberty-hotel/ liberty hotel] [http://syashwor.angelfire.com/directory/imprese-costruzioni.htm imprese costruzioni pescara] [http://blgottlo.angelfire.com/kamasutr/ kamasutr] [http://lubrevel.angelfire.com/new/testo-sunrise/ testo sunrise] [http://badanglican.cn/content/view/bci-/ bci 24 canon cartucce] [http://kimccanl.angelfire.com/hamlet-zelig.htm hamlet zelig one 512] [http://badanglican.cn/content/view/libri-conversazione/ libri conversazione con dio] [http://xrikuxsorax103.netfirms.com/dual-intel/ dual intel xeon] [http://badanglican.cn/content/view/importatori-di/ importatori di cascami di fibre tessili] [http://hungry87.netfirms.com/styles/pinocchio-/ pinocchio  la chiavetta doro] = Samples of TEI Texts =
+
[[Category: Markup]] [[Category: TEI:P4]] [[Category: TEI:P5]] __NOTOC__
 +
The availability of texts enables others to learn by example; it fosters similar approaches to solving the same problems across the entire community of practice; and it gives developers of TEI-based tools a broader sample of texts to test against. The fact a text is listed here should not be taken as a licence to redistribute the text, please check with text owners should they wish to make any more in-depth use of these materials.
  
Please add links below to any TEI sample texts that are freely available for use by developers working on TEI-related software. By listing the texts here you are allowing the developers the right to test their software with your texts, but are not necessarily licensing any other use of these texts. Developers should ask permission of the text owners should they wish to make any more in-depth use of these materials.
+
== Explicitly Pedagogical Samples ==
 +
 
 +
* [http://teibyexample.org TEI By Example] is a set of examples design to teach the basics of TEI. Includes TEI P3 (SGML) and P5 examples. [http://creativecommons.org/licenses/by-sa/3.0/ CC licensed].
  
 
== Texts ==  
 
== Texts ==  
  
* [http://www.ota.ox.ac.uk/texts/2493.html The Auchinleck Manuscript], made available by the [http://www.ota.ox.ac.uk/ Oxford Text Archive] contact [mailto:ota-info@rt.oucs.ox.ac.uk ota-info@rt.oucs.ox.ac.uk].  This text originates from the [http://www.nls.uk/auchinleck/ Auchinleck Manuscript Project] at the National Library of Scotland, please see their website for more contextual material. Format: TEI P5.
+
* [http://insaph.kcl.ac.uk/ala2004/redist/inscriptions/inscriptions.zip ala2004] ([[EpiDoc]] XML) from the [http://insaph.kcl.ac.uk/ala2004 '''Aphrodisias in Late Antiquity'''] publication. The downloadable .zip archive contains 230 XML files, each containing an ancient Greek inscription, which validate to the version 4 of the [http://epidoc.sf.net/ EpiDoc] DTD (a TEI localization)--the DTD is also included in the archive. These files are licenced under [http://creativecommons.org/licenses/by/2.5/ Creative Commons Attribution], so please feel free to do whatever you like with them! (Format: TEI P4)
 +
 
 +
* [http://archimedespalimpsest.net/ '''Archimedes Palimpsest'''], XML files containing the transcriptions of the Archimedes text, released (like all the Palimpsest data and metadata) under Creative Commons Attribution 3.0 Unported. Texts validate to TEI P5. One XML file per folio page (scroll down list of hi-res photographs in each directory). Format: TEI P5
 +
 
 +
* [https://github.com/StanfordUniversityLibraries/ap_tei '''FRDA'''] (French Revolution Digital Archive) TEI of full text for 82 volumes of French Revolution parliamentary debates from 1787 to 1794
 +
 
 +
* [http://www.ota.ox.ac.uk/headers/2493.xml The '''Auchinleck Manuscript'''], made available by the [http://www.ota.ox.ac.uk/ Oxford Text Archive] contact [mailto:ota-info@rt.oucs.ox.ac.uk ota-info@rt.oucs.ox.ac.uk].  This text originates from the [http://www.nls.uk/auchinleck/ Auchinleck Manuscript Project] at the National Library of Scotland, please see their website for more contextual material. Format: TEI P5.
 +
 
 +
* [https://github.com/CopticScriptorium/corpora Coptic SCRIPTORIUM corpora] for the [http://copticscriptorium.org/ Coptic SCRIPTORIUM]
 +
 
 +
* [http://www.deutschestextarchiv.de/download '''DTA'''] Deutsches Textarchiv (2435 texts as of 2016-12-21)
 +
 
 +
* [http://dramacode.github.io Dramacode], French classical plays (Corneille, Molière, Racine, etc.)
 +
 
 +
* [https://github.com/textcreationpartnership '''EEBO'''] collection Phase 1 TEI P5 XML versions of texts (Text Creation Partnership's Early English Books Online) ([https://raw.githubusercontent.com/textcreationpartnership/Texts/master/TCP.csv CSV file listing all the texts]) (32853 texts as of 2015-01-01)
 +
 
 +
* [https://github.com/papyri/idp.data '''Duke Databank'''/Heidelberg/APIS] ([[EpiDoc]] XML) aggregated data from the Duke Databank of Documentary Papyri (DDbDP: transcribed Greek texts) the Heidelberger Gesamtverzeichnis der griechischen Papyrusurkunden Ägyptens (HGV: metadata), and the Advanced Papyrological Information System. Approx 145,000 XML files released under Creative Commons Attribution license (CC-BY), by the [http://idp.atlantides.org/trac/idp/wiki Integrating Digital Papyrology] project. Format: TEI P5.
 +
 
 +
* [http://epidoc.cch.kcl.ac.uk/inscriptions/index.html '''EpiDoc Demo''' Website], a growing collection of sample [[EpiDoc]] XML files, including examples from epigraphic, papyrological, and other ancient projects. XML downloadable from each transformed inscription. (Vintage 2007.) (Format: TEI P4)
 +
 
 +
* [http://www.folgerdigitaltexts.org/ '''Folger Digital Texts''']: From the Folger Shakespeare Library, "Each play in Folger Digital Texts is rigorously encoded: every word, every punctuation mark, every space, within a sophisticated, TEI-compliant XML structure."
 +
 
 +
* A subset of [http://www.gutenberg.org/ Project '''Gutenberg'''] is available as TEI, go to [http://www.gutenberg.org/catalog/world/search http://www.gutenberg.org/catalog/world/search] and select "TEI Text Encoding Initiative (tei)" as the file type.
 +
 
 +
* [http://insaph.kcl.ac.uk/iaph2007/inscriptions/xml-repo.html '''IAph2007''' ([[EpiDoc]] XML files)] from the [http://insaph.kcl.ac.uk/iaph2007/ Inscriptions of Aphrodisias (2007)] publication. There are approx 1500 XML files available (either in a single .zip or as individual files either downloadable or linkable directly for dynamic processing), each containing an ancient Greek or Latin inscription. All files validate to the [[EpiDoc]] DTD (version 5). These files are licensed under [http://creativecommons.org/licenses/by/2.0/uk/ Creative Commons Attribution (UK)], so please feel free to do exciting things with them. (Format: TEI P4)
 +
 
 +
* [http://irt.kcl.ac.uk/irt2009/inscr/xmlrepo.html '''Inscriptions of Roman Tripolitania''' 2009] ([[EpiDoc]] XML), about 1000 Latin and Greek inscriptions available for download under Creative Commons Attribution (CC-BY) licence. Format: TEI P4.
 +
 
 +
* [http://www.sbl-site.org/Resources/Resources_ManuscriptMarkup.aspx Files] referenced in Timothy J. Finney, "'''Manuscript Markup'''," in ''The Freer Biblical Manuscripts: Fresh Studies of an American Treasure Trove'' (ed. Larry W. Hurtado; SBLTCS 6; Atlanta: Society of Biblical Literature, 2006), 263-87. These include a partial [http://www.sbl-site.org/assets/U16/U16.xml transcription] of the Freer manuscript of Paul (Gregory-Aland I 016), a [http://www.sbl-site.org/assets/U16/U16.xsl transform], a [http://www.sbl-site.org/assets/U16/U16.css stylesheet] and a [http://www.sbl-site.org/assets/U16/U16.htm web page] produced from the transcription by the transform. (Format: TEI P5)
 +
 
 +
* The [http://www.nzetc.org/ NZETC] has a range of '''New Zealand and Pacific-Islands''' texts. The texts are P5 encoded and the TEI is generally downloadable from the document table of contents. Features include:
 +
** Use of <revisionDesc> and <change> tags to implement workflow
 +
** <name> tag used extensively for personal, ship, place, organisation and work names (keyed to external authority at [http://authority.nzetc.org/])
 +
** Use of  xml:lang="en" and  xml:lang="mi" for texts with English and Maori (plus small amounts of other languages)
 +
** Page images, facsimile PDFs and typeset PDFs  (some texts only, for example [http://www.nzetc.org/tm/scholarly/tei-JCB-001.html this letter])
 +
** Document-by-document licensing, some documents under a creative commons license (licensing info not currently stored in the TEI).
 +
 
 +
* The University of [http://www.ota.ox.ac.uk/ '''Oxford Text Archive'''] (OTA) is home to some 2685 TEI P5 texts, including all of the ECCO texts which are in the public domain, all available under CC licences, plus some TEI P5 linguistic corpora, and others following older editions of the guidelines, with legacy licences. The OTA exists as a community resource, and projects and people are encouraged to offer texts for deposit in the archive.
 +
 
 +
* The [http://www.perseus.tufts.edu/hopper/opensource Perseus Project] makes its TEI P4 XML collections in Greek, Latin, and English available from http://www.perseus.tufts.edu/hopper/opensource under a Creative Commons Sharalike/Non-Commercial/Attribution license.
 +
 
 +
* The [http://buddhistinformatics.dila.edu.tw/BZA/bzaComCatWeb.html '''Shorter Saṃyukta Āgama''' Project] at Dharma Drum Buddhist College provides access to its more than 1000 TEI source files. Click on any cluster and find the link to the TEI source at the bottom of each column. The files are in Chinese, Pali and Sanskrit. Markup documentation, schemas and stylesheets are available as a zip archive at the website.
 +
 
 +
* The [http://buddhistinformatics.dila.edu.tw/biographies/gis/ '''Chinese Buddhist Bibliographies 中國佛教傳記''' Project] at Dharma Drum Buddhist College provides access to different collections with more than 1000 biographies marked up in TEI for place and person names as well as dates. The archives contain basic documentation, schema etc. The data is in Chinese, linked to authority databases and available through three different interfaces visualizing it as GIS light, social network and on a timeline. All the data is published under a CC licence.
 +
 
 +
* The [http://buddhistinformatics.dila.edu.tw/fosizhi/ '''Chinese Buddhist Temple Gazetteers 中國佛寺志''' Project] at Dharma Drum Buddhist College provides access to topographical descriptions of Buddhist temple marked up for place and person names as well as dates. All together there are 237 gazetteers, 13 of which are available with TEI markup and new punctuation. The archives contain the TEI, image files referenced in the TEI, schema, METS wrapper etc. The data is linked to authority databases and available through an interface that displays the marked-up edition next to the images. The data is published under a CC licence.
 +
 
 +
* ''Four Early Chan Texts from Dunhuang - A TEI based Edition'' 早期禪宗文獻四部 —— 以TEI標記重訂敦煌寫卷:楞伽師資記,傳法寶紀,修心要論,觀心論. Densely marked-up texts from Dunhuang manuscripts, with special attention to Chinese character variants (異體字). [https://zenodo.org/record/1133490#.WkXojPYxUyc '''Texts, data, and documentation archived at Zenodo'''].
 +
 
 +
* The [http://www.tei-c.org/Activities/MI/Samples/ '''Migration Samples'''] page on the main TEI website includes sample texts from (inter alia) the British National Corpus, the Thomas McGreevey Archive, Early English Books Online, Multext East, Documenting the American South, and the Women Writers Project which were prepared as part of the TEI P4 Migration Work Group, the purpose of which was to demonstrate how to migrate TEI P3 (SGML) to TEI P4 (XML). Most of the material here is therefore of a certain antiquity.
 +
 
 +
* The [http://www.bvh.univ-tours.fr/ BVH] project ('''Virtual Humanistic Libraries''')  is a virtual library of high-quality digitised documents, offering a selection of Renaissance books located in the libraries of the Région Centre, Paris, Poitiers, Lyons, Troyes, etc. Three samples of TEI texts are proposed in html, pdf and xml/tei on [http://www.bvh.univ-tours.fr:8080/xtf/search?title=&creator=&year=&keyword=&type=tei Epistemon]. These files are licenced under Creative Commons Attribution.
 +
 
 +
* [https://github.com/iulibdcs/tei_text TEI and Plain Text from Digital Collections Services, Indiana University Libraries]
 +
 
 +
* TEI in dspace example http://dspace.nitle.org/handle/10090/11695 (P4?) (seems broken May 2012)
 +
 
 +
* The [http://sarit.indology.info SARIT] project has recently brought out an electronic TEI-encoded edition of a 2007 print publication.  It is a work on Buddhist tantric religion:  Christian K. Wedemeyer, ed., ''Āryadeva's Lamp that Integrates the Practices (Caryāmelāpakapradīpa): The Gradual Path of Vajrayāna Buddhism According to the Esoteric Community Noble Tradition - Part Three: Critically Edited Sanskrit Text of Āryadeva's Caryāmelāpakapradīpa,'' (New York: The American Institute of Buddhist Studies at Columbia University in New York with Columbia University's Center for Buddhist Studies and Tibet House US, 2007). E-details and full text can be seen [[http://sarit.indology.info/newphilo/navigate.pl?indologica.16 here]].  Clicking [[http://sarit.indology.info/downloads.shtml Downloads]] on the above screen offers downloadable TEI, PDF and HTML versions of this e-text, and several others. The interesting thing about this e-text from the TEI point of view is the encoding and display of the manuscript variants to the critical edition.  It was good of the publishers and editors to give their permission for the e-dissemination of this work just three years after print publication. Best, Dr Dominik Wujastyk.
 +
 
 +
* [http://txm.bfm-corpus.org/?command=documentation&path=/GRAAL '''La Queste del Saint Graal'''] (The Quest of the Holy Grail) online interactive edition offers a parallel multi-level (normalized, diplomatic and imitative) transcription of the Lyon MN PA 77 manuscript along with manuscript images and a translation in modern French, powered by TXM text search and statistical analysis platform. The complete source XML-TEI P5 encoded with Menota extensions manuscript transcriptions and an ODD customization file, as well as the stylesheets used to produce HTML editions and a PDF printable version are freely available for [http://txm.bfm-corpus.org/txm/images/graal_src.zip download] under a CC BY-NC-SA 3.0 license.
 +
 
 +
* [http://docsouth.unc.edu/southlit/poe/menu.html "Tales" by Edgar Allan Poe] at University of North Carolina at Chapel Hill. Uses mnemonic entity references for non-ASCII characters.
 +
 
 +
* [https://github.com/TEI-examples/tei-examples tei-examples] -- Examples of TEI documents dealing with different use-cases.
 +
 
 +
* [https://textgrid.de/digitale-bibliothek '''TextGrid''' Digital Library] conversion from Zeno-XML-Markup to XML-TEI and additional markup
 +
 
 +
* [https://github.com/stuartyeates/sampler URLs to diverse TEI files] in terms of language, structure, linguistics, coding, tools in use, hosting method, etc.
 +
 
 +
* [http://vangoghletters.org/vg/ Vincent Van Gogh: The Letters] -- See [http://vangoghletters.org/vg/about_6.html "About this edition"].
 +
 
 +
* [https://github.com/oeuvres Works in French and TEI] (Baudelaire, Hugo, Rimbaud, Verlaine, Balzac, Descartes, La Fayette, Sade, Saint-Simon, etc.)
  
* The [http://www.tei-c.org/Activities/MI/Samples/ Migration Samples] page on the main TEI website includes sample texts from (inter alia) the British National Corpus, the Thomas McGreevey Archive, Early English Books Online, Multext East, Documenting the American South, and the Women Writers Project which were prepared as part of the TEI P4 Migration Work Group, the purpose of which was to demonstrate how to migrate TEI P3 (SGML) to TEI P4 (XML). Most of the material here is therefore of a certain antiquity.
+
== Dictionaries ==
 +
* [[FreeDict]] is a repository of various TEI-encoded bilingual translating dictionaries on free licenses (http://www.freedict.org/). Most of the dictionaries have been converted from TEI P4 to TEI P5, but not all of the changes can be found in the official releases yet. Visiting [http://freedict.svn.sourceforge.net/viewvc/freedict/trunk/ the SVN repository] directly may be the better way out.
 +
* [http://ducange.enc.sorbonne.fr/ Du Cange] is a medieval latin dictionary (mostly written during XVIIe XVIIIe). The printed text is encoded in TEI-P5, freely available at http://svn.code.sf.net/p/ducange/code/xml/ as an [http://sourceforge.net/p/ducange/wiki/Home/ open source project]. The TEI choices are [http://svn.code.sf.net/p/ducange/code/xml/ducange.html documented (in french)].
 +
* [http://algone.net/littre/ Littré] a classical French dictionary, encoded in TEI-P5, freely available at https://svn.code.sf.net/p/javacrim/code/littre/xml/, [https://svn.code.sf.net/p/javacrim/code/littre/xml/schema.html documented in French with the words of Littré himself]

Latest revision as of 19:14, 12 July 2021

The availability of texts enables others to learn by example; it fosters similar approaches to solving the same problems across the entire community of practice; and it gives developers of TEI-based tools a broader sample of texts to test against. The fact a text is listed here should not be taken as a licence to redistribute the text, please check with text owners should they wish to make any more in-depth use of these materials.

Explicitly Pedagogical Samples

Texts

  • ala2004 (EpiDoc XML) from the Aphrodisias in Late Antiquity publication. The downloadable .zip archive contains 230 XML files, each containing an ancient Greek inscription, which validate to the version 4 of the EpiDoc DTD (a TEI localization)--the DTD is also included in the archive. These files are licenced under Creative Commons Attribution, so please feel free to do whatever you like with them! (Format: TEI P4)
  • Archimedes Palimpsest, XML files containing the transcriptions of the Archimedes text, released (like all the Palimpsest data and metadata) under Creative Commons Attribution 3.0 Unported. Texts validate to TEI P5. One XML file per folio page (scroll down list of hi-res photographs in each directory). Format: TEI P5
  • FRDA (French Revolution Digital Archive) TEI of full text for 82 volumes of French Revolution parliamentary debates from 1787 to 1794
  • DTA Deutsches Textarchiv (2435 texts as of 2016-12-21)
  • Dramacode, French classical plays (Corneille, Molière, Racine, etc.)
  • EEBO collection Phase 1 TEI P5 XML versions of texts (Text Creation Partnership's Early English Books Online) (CSV file listing all the texts) (32853 texts as of 2015-01-01)
  • Duke Databank/Heidelberg/APIS (EpiDoc XML) aggregated data from the Duke Databank of Documentary Papyri (DDbDP: transcribed Greek texts) the Heidelberger Gesamtverzeichnis der griechischen Papyrusurkunden Ägyptens (HGV: metadata), and the Advanced Papyrological Information System. Approx 145,000 XML files released under Creative Commons Attribution license (CC-BY), by the Integrating Digital Papyrology project. Format: TEI P5.
  • EpiDoc Demo Website, a growing collection of sample EpiDoc XML files, including examples from epigraphic, papyrological, and other ancient projects. XML downloadable from each transformed inscription. (Vintage 2007.) (Format: TEI P4)
  • Folger Digital Texts: From the Folger Shakespeare Library, "Each play in Folger Digital Texts is rigorously encoded: every word, every punctuation mark, every space, within a sophisticated, TEI-compliant XML structure."
  • IAph2007 (EpiDoc XML files) from the Inscriptions of Aphrodisias (2007) publication. There are approx 1500 XML files available (either in a single .zip or as individual files either downloadable or linkable directly for dynamic processing), each containing an ancient Greek or Latin inscription. All files validate to the EpiDoc DTD (version 5). These files are licensed under Creative Commons Attribution (UK), so please feel free to do exciting things with them. (Format: TEI P4)
  • Files referenced in Timothy J. Finney, "Manuscript Markup," in The Freer Biblical Manuscripts: Fresh Studies of an American Treasure Trove (ed. Larry W. Hurtado; SBLTCS 6; Atlanta: Society of Biblical Literature, 2006), 263-87. These include a partial transcription of the Freer manuscript of Paul (Gregory-Aland I 016), a transform, a stylesheet and a web page produced from the transcription by the transform. (Format: TEI P5)
  • The NZETC has a range of New Zealand and Pacific-Islands texts. The texts are P5 encoded and the TEI is generally downloadable from the document table of contents. Features include:
    • Use of <revisionDesc> and <change> tags to implement workflow
    • <name> tag used extensively for personal, ship, place, organisation and work names (keyed to external authority at [1])
    • Use of xml:lang="en" and xml:lang="mi" for texts with English and Maori (plus small amounts of other languages)
    • Page images, facsimile PDFs and typeset PDFs (some texts only, for example this letter)
    • Document-by-document licensing, some documents under a creative commons license (licensing info not currently stored in the TEI).
  • The University of Oxford Text Archive (OTA) is home to some 2685 TEI P5 texts, including all of the ECCO texts which are in the public domain, all available under CC licences, plus some TEI P5 linguistic corpora, and others following older editions of the guidelines, with legacy licences. The OTA exists as a community resource, and projects and people are encouraged to offer texts for deposit in the archive.
  • The Shorter Saṃyukta Āgama Project at Dharma Drum Buddhist College provides access to its more than 1000 TEI source files. Click on any cluster and find the link to the TEI source at the bottom of each column. The files are in Chinese, Pali and Sanskrit. Markup documentation, schemas and stylesheets are available as a zip archive at the website.
  • The Chinese Buddhist Bibliographies 中國佛教傳記 Project at Dharma Drum Buddhist College provides access to different collections with more than 1000 biographies marked up in TEI for place and person names as well as dates. The archives contain basic documentation, schema etc. The data is in Chinese, linked to authority databases and available through three different interfaces visualizing it as GIS light, social network and on a timeline. All the data is published under a CC licence.
  • The Chinese Buddhist Temple Gazetteers 中國佛寺志 Project at Dharma Drum Buddhist College provides access to topographical descriptions of Buddhist temple marked up for place and person names as well as dates. All together there are 237 gazetteers, 13 of which are available with TEI markup and new punctuation. The archives contain the TEI, image files referenced in the TEI, schema, METS wrapper etc. The data is linked to authority databases and available through an interface that displays the marked-up edition next to the images. The data is published under a CC licence.
  • Four Early Chan Texts from Dunhuang - A TEI based Edition 早期禪宗文獻四部 —— 以TEI標記重訂敦煌寫卷:楞伽師資記,傳法寶紀,修心要論,觀心論. Densely marked-up texts from Dunhuang manuscripts, with special attention to Chinese character variants (異體字). Texts, data, and documentation archived at Zenodo.
  • The Migration Samples page on the main TEI website includes sample texts from (inter alia) the British National Corpus, the Thomas McGreevey Archive, Early English Books Online, Multext East, Documenting the American South, and the Women Writers Project which were prepared as part of the TEI P4 Migration Work Group, the purpose of which was to demonstrate how to migrate TEI P3 (SGML) to TEI P4 (XML). Most of the material here is therefore of a certain antiquity.
  • The BVH project (Virtual Humanistic Libraries) is a virtual library of high-quality digitised documents, offering a selection of Renaissance books located in the libraries of the Région Centre, Paris, Poitiers, Lyons, Troyes, etc. Three samples of TEI texts are proposed in html, pdf and xml/tei on Epistemon. These files are licenced under Creative Commons Attribution.
  • The SARIT project has recently brought out an electronic TEI-encoded edition of a 2007 print publication. It is a work on Buddhist tantric religion: Christian K. Wedemeyer, ed., Āryadeva's Lamp that Integrates the Practices (Caryāmelāpakapradīpa): The Gradual Path of Vajrayāna Buddhism According to the Esoteric Community Noble Tradition - Part Three: Critically Edited Sanskrit Text of Āryadeva's Caryāmelāpakapradīpa, (New York: The American Institute of Buddhist Studies at Columbia University in New York with Columbia University's Center for Buddhist Studies and Tibet House US, 2007). E-details and full text can be seen [here]. Clicking [Downloads] on the above screen offers downloadable TEI, PDF and HTML versions of this e-text, and several others. The interesting thing about this e-text from the TEI point of view is the encoding and display of the manuscript variants to the critical edition. It was good of the publishers and editors to give their permission for the e-dissemination of this work just three years after print publication. Best, Dr Dominik Wujastyk.
  • La Queste del Saint Graal (The Quest of the Holy Grail) online interactive edition offers a parallel multi-level (normalized, diplomatic and imitative) transcription of the Lyon MN PA 77 manuscript along with manuscript images and a translation in modern French, powered by TXM text search and statistical analysis platform. The complete source XML-TEI P5 encoded with Menota extensions manuscript transcriptions and an ODD customization file, as well as the stylesheets used to produce HTML editions and a PDF printable version are freely available for download under a CC BY-NC-SA 3.0 license.
  • "Tales" by Edgar Allan Poe at University of North Carolina at Chapel Hill. Uses mnemonic entity references for non-ASCII characters.
  • tei-examples -- Examples of TEI documents dealing with different use-cases.
  • Works in French and TEI (Baudelaire, Hugo, Rimbaud, Verlaine, Balzac, Descartes, La Fayette, Sade, Saint-Simon, etc.)

Dictionaries