SIG:TEI for Linguists

This is an informal meeting point for those interested in linguistics, in the TEI, and in putting the two together.

Mailing list:

History

 * Here's how it began
 * TEI Guidelines have their apocrypha as well, here's one on corpus annotation. Note that it is absolutely non-normative, included here to give credit to the original Working Group and to provide a platform to either elaborate on or to diverge from.

The most relevant chapters of the Guidelines

 * 8. Transcriptions of Speech
 * 9. Dictionaries: we need to have a plan so that the NLP community does consider this as a default vocabulary for representing NLP lexica (e.g. full form lexica)
 * 15. Language Corpora
 * 17. Simple Analytic Mechanisms
 * 18. Feature Structures
 * 20. Non-hierarchical Structures

Related SIGs

 * Ontologies
 * Tools
 * Overlap

Papers/presentations?
On using the TEI dictionary chapter as a default implementation of ISO 24613 (Lexical Markup Framework), let me quote http://hal.archives-ouvertes.fr/hal-00436328/fr/ ("Standardization of the formal representation of lexical information for NLP").

Projects
TEI projects with a linguistic focus
 * FreeDict http://freedict.org/en/

Tools - reports of non-TEI linguistic tools working / not working with TEI

 * GATE doesn't do XML see XML parsing issue: consecutive empty elements mishandled
 * See also the TEI-influenced or TEI-based tools: Xaira, Textometrie, Poliqarp and Anotatornia

LLiZ (Linguistic Lunch in Zadar)
The idea is to meet at an informal lunch during the TEI-MM in Zadar to see what common goals we may have and what we want to do about them.

Date: (let's decide around October)

Place: (let's decide in November)

List of participants (add your name):
 * Elena Pierazzo (who lit the spark, inspired by Piotr and Adam's talk -- or so they want to think)
 * Piotr Bański (who dropped the last drop and suggested the meeting)
 * Espen Ore
 * Eleonora Litta Modignani Picozzi
 * Sabine Bartsch
 * Andreas Witt
 * Laurent Romary
 * Lou Burnard