SIG:TEI for Linguists

Aims
This Special Interest Grooup is meant for those interested in linguistics, in the TEI, and in putting the two together.

Contact details, information

 * Mailing list subscription page
 * Official TEI SIG page

Activities
The SIG activities (will) include official meetings at TEI-MMs, conference reports and e-mail exchange on the mailing list. As part of its activity, the SIG will attempt to [[TEI for Linguists - bibliography|track and record] papers that deal with using various markup standards for the purpose of encoding linguistic analyses and language resources.

Meetings

 * First official meeting: Zadar, 13 November 2010, Agenda. (See the preliminary minutes, to be moved to the official space when ready.) The meeting was preceded by the LLiZ (Linguistic Lunch in Zadar) and a poster presentation.

History

 * Here's how it began
 * TEI Guidelines have their apocrypha as well, here's one on corpus annotation. Note that it is absolutely non-normative, included here to give credit to the original Working Group and to provide a platform to either elaborate on or to diverge from.

The first official meeting of the SIG took place on 13 November 2010 at the TEI-MM in Zadar. This meeting was preceded by a reconnaissance lunch (we liked both the intel and the food) and a Poster Slam presentation (buyakasha...).

The most relevant chapters of the Guidelines

 * 8. Transcriptions of Speech
 * 9. Dictionaries: we need to have a plan so that the NLP community does consider this as a default vocabulary for representing NLP lexica (e.g. full form lexica)
 * 15. Language Corpora
 * 17. Simple Analytic Mechanisms
 * 18. Feature Structures
 * 20. Non-hierarchical Structures

Related SIGs

 * Ontologies
 * Tools
 * Overlap

Papers/presentations?
On using the TEI dictionary chapter as a default implementation of ISO 24613 (Lexical Markup Framework), let me quote http://hal.archives-ouvertes.fr/hal-00436328/fr/ ("Standardization of the formal representation of lexical information for NLP").

A presentation by the SIG conveners is scheduled during the TEI-MM-2010 Poster Slam and the poster session on Friday the 12th at 4 p.m. in Aula Magna.

Projects
TEI projects with a linguistic focus
 * FreeDict http://freedict.org/en/

Tools - reports of non-TEI linguistic tools working / not working with TEI

 * GATE doesn't do XML see XML parsing issue: consecutive empty elements mishandled
 * See also the TEI-influenced or TEI-based tools: Xaira, Textometrie, Poliqarp and Anotatornia