SIG:TEI for Linguists
Contents
Aims
This Special Interest Grooup is meant for those interested in linguistics, in the TEI, and in putting the two together.
Contact details, information
Activities
The first official meeting of the SIG is scheduler for Saturday, 13 November 2010 at the TEI-MM in Zadar, in room PDS-1.
See also the section on LLiZ below.
History
- Here's how it began
- TEI Guidelines have their apocrypha as well, here's one on corpus annotation. Note that it is absolutely non-normative, included here to give credit to the original Working Group and to provide a platform to either elaborate on or to diverge from.
The most relevant chapters of the Guidelines
- 8. Transcriptions of Speech
- 9. Dictionaries: we need to have a plan so that the NLP community does consider this as a default vocabulary for representing NLP lexica (e.g. full form lexica)
- 15. Language Corpora
- 17. Simple Analytic Mechanisms
- 18. Feature Structures
- 20. Non-hierarchical Structures
Related SIGs
Papers/presentations?
On using the TEI dictionary chapter as a default implementation of ISO 24613 (Lexical Markup Framework), let me quote http://hal.archives-ouvertes.fr/hal-00436328/fr/ ("Standardization of the formal representation of lexical information for NLP").
A presentation by the SIG conveners is scheduled during the TEI-MM-2010 Poster Slam and the poster session on Friday the 12th at 4 p.m. in Aula Magna.
Projects
TEI projects with a linguistic focus
- FreeDict http://freedict.org/en/
Tools - reports of non-TEI linguistic tools working / not working with TEI
- GATE doesn't do XML see XML parsing issue: consecutive empty elements mishandled
- See also the TEI-influenced or TEI-based tools: Xaira, Textometrie, Poliqarp and Anotatornia
LLiZ (Linguistic Lunch in Zadar)
The idea is to meet at an informal lunch during the TEI-MM in Zadar to see what common goals we may have and what we want to do about them.
Date: Thursday the 11th, slightly after 12.30
Place: We will gather at Aula Magna and then proceed "Pet bunara".
List of participants (add your name):
- Elena Pierazzo (who lit the spark, inspired by Piotr and Adam's talk -- or so they want to think)
- Piotr Bański (who dropped the last drop and suggested the meeting)
- Espen Ore
- Eleonora Litta (sadly not present at LLiZ, but she'll be there for the SIG meeting, via skype, if it works)
- Sabine Bartsch
- Andreas Witt
- Laurent Romary
- Lou Burnard
- (quite possibly Tomaž Erjavec and Damir Ćavar, says Piotr)
- Beata Wójtowicz
- Serge Heiden
- ...
SIG Meetings
- First meeting: Zadar, 13 November 2010, Agenda.