Difference between revisions of "SIG:TEI for Linguists"
Piotr Banski (talk | contribs) m (+ category) |
Piotr Banski (talk | contribs) (a few changes for the beginning) |
||
Line 13: | Line 13: | ||
== Activities == | == Activities == | ||
− | The | + | The SIG activities (will) include official meetings at TEI-MMs, conference reports and e-mail exchange on the mailing list. As part of its activity, the SIG will attempt to [[TEI for Linguists - bibliography|track and record] papers that deal with using various markup standards for the purpose of encoding linguistic analyses and language resources. |
+ | |||
+ | === Meetings === | ||
+ | * First official meeting: [[Zadar, 13 November 2010, Agenda]]. (See the [[TEI for Linguists - minutes - 13nov10|preliminary minutes]], to be moved to the official space when ready.) The meeting was preceded by the LLiZ (Linguistic Lunch in Zadar) and a poster presentation. | ||
− | |||
== History == | == History == | ||
* Here's [http://listserv.brown.edu/archives/cgi-bin/wa?A2=ind1007&L=TEI-L&T=0&F=&S=&P=1668 how it began] | * Here's [http://listserv.brown.edu/archives/cgi-bin/wa?A2=ind1007&L=TEI-L&T=0&F=&S=&P=1668 how it began] | ||
* TEI Guidelines have their apocrypha as well, here's one on [http://www.tei-c.org/Activities/Workgroups/SO/sow05.xml corpus annotation]. Note that it is absolutely non-normative, included here to give credit to the original Working Group and to provide a platform to either elaborate on or to diverge from. | * TEI Guidelines have their apocrypha as well, here's one on [http://www.tei-c.org/Activities/Workgroups/SO/sow05.xml corpus annotation]. Note that it is absolutely non-normative, included here to give credit to the original Working Group and to provide a platform to either elaborate on or to diverge from. | ||
+ | |||
+ | The first official meeting of the SIG took place on 13 November 2010 at the [http://ling.unizd.hr/~tei2010/index.en.html TEI-MM in Zadar]. This meeting was preceded by a reconnaissance lunch (we liked both the intel and the food) and a Poster Slam presentation (buyakasha...). | ||
== The most relevant chapters of the Guidelines == | == The most relevant chapters of the Guidelines == | ||
Line 46: | Line 50: | ||
* [http://gate.ac.uk/ GATE] doesn't do XML see [http://thread.gmane.org/gmane.comp.ai.gate.general/5257/focus=5301 XML parsing issue: consecutive empty elements mishandled] | * [http://gate.ac.uk/ GATE] doesn't do XML see [http://thread.gmane.org/gmane.comp.ai.gate.general/5257/focus=5301 XML parsing issue: consecutive empty elements mishandled] | ||
* See also the TEI-influenced or TEI-based tools: [[Xaira]], [[Textometrie]], [[Poliqarp]] and [[Anotatornia]] | * See also the TEI-influenced or TEI-based tools: [[Xaira]], [[Textometrie]], [[Poliqarp]] and [[Anotatornia]] | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− |
Revision as of 03:55, 17 November 2010
Contents
Aims
This Special Interest Grooup is meant for those interested in linguistics, in the TEI, and in putting the two together.
Contact details, information
Activities
The SIG activities (will) include official meetings at TEI-MMs, conference reports and e-mail exchange on the mailing list. As part of its activity, the SIG will attempt to [[TEI for Linguists - bibliography|track and record] papers that deal with using various markup standards for the purpose of encoding linguistic analyses and language resources.
Meetings
- First official meeting: Zadar, 13 November 2010, Agenda. (See the preliminary minutes, to be moved to the official space when ready.) The meeting was preceded by the LLiZ (Linguistic Lunch in Zadar) and a poster presentation.
History
- Here's how it began
- TEI Guidelines have their apocrypha as well, here's one on corpus annotation. Note that it is absolutely non-normative, included here to give credit to the original Working Group and to provide a platform to either elaborate on or to diverge from.
The first official meeting of the SIG took place on 13 November 2010 at the TEI-MM in Zadar. This meeting was preceded by a reconnaissance lunch (we liked both the intel and the food) and a Poster Slam presentation (buyakasha...).
The most relevant chapters of the Guidelines
- 8. Transcriptions of Speech
- 9. Dictionaries: we need to have a plan so that the NLP community does consider this as a default vocabulary for representing NLP lexica (e.g. full form lexica)
- 15. Language Corpora
- 17. Simple Analytic Mechanisms
- 18. Feature Structures
- 20. Non-hierarchical Structures
Related SIGs
Papers/presentations?
On using the TEI dictionary chapter as a default implementation of ISO 24613 (Lexical Markup Framework), let me quote http://hal.archives-ouvertes.fr/hal-00436328/fr/ ("Standardization of the formal representation of lexical information for NLP").
A presentation by the SIG conveners is scheduled during the TEI-MM-2010 Poster Slam and the poster session on Friday the 12th at 4 p.m. in Aula Magna.
Projects
TEI projects with a linguistic focus
- FreeDict http://freedict.org/en/
Tools - reports of non-TEI linguistic tools working / not working with TEI
- GATE doesn't do XML see XML parsing issue: consecutive empty elements mishandled
- See also the TEI-influenced or TEI-based tools: Xaira, Textometrie, Poliqarp and Anotatornia