Difference between revisions of "SIG:TEI for Linguists"
Piotr Banski (talk | contribs) m (TEI for linguists moved to SIG: TEI for Linguists: to keep the historical records... :-)) |
Piotr Banski (talk | contribs) (quick make-up) |
||
Line 1: | Line 1: | ||
[[Category:Community]] | [[Category:Community]] | ||
− | + | [[Category:SIG|Linguistics]] | |
− | + | == Aims == | |
− | == | + | This Special Interest Grooup is meant for those interested in linguistics, in the TEI, and in putting the two together. |
− | ==== | + | |
+ | == Contact details, information == | ||
+ | |||
+ | * [http://listserv.brown.edu/archives/cgi-bin/wa?A0=TEI-LINGUISTICS Mailing list subscription page] | ||
+ | * [http://www.tei-c.org/Activities/SIG/TEI_for_Linguists/ Official TEI SIG page] (forthcoming, expect 404/500 error) | ||
+ | |||
+ | == Activities == | ||
+ | The first official meeting of the SIG is scheduler for Saturday, 13 November 2010 at the [http://www.tei-c.org/conftool/index.php?page=browseSessions TEI-MM in Zadar], in room PDS-1. | ||
+ | |||
+ | See also the section on [[#LLiZ (Linguistic Lunch in Zadar)|LLiZ]] below. | ||
+ | |||
+ | == History == | ||
* Here's [http://listserv.brown.edu/archives/cgi-bin/wa?A2=ind1007&L=TEI-L&T=0&F=&S=&P=1668 how it began] | * Here's [http://listserv.brown.edu/archives/cgi-bin/wa?A2=ind1007&L=TEI-L&T=0&F=&S=&P=1668 how it began] | ||
* TEI Guidelines have their apocrypha as well, here's one on [http://www.tei-c.org/Activities/Workgroups/SO/sow05.xml corpus annotation]. Note that it is absolutely non-normative, included here to give credit to the original Working Group and to provide a platform to either elaborate on or to diverge from. | * TEI Guidelines have their apocrypha as well, here's one on [http://www.tei-c.org/Activities/Workgroups/SO/sow05.xml corpus annotation]. Note that it is absolutely non-normative, included here to give credit to the original Working Group and to provide a platform to either elaborate on or to diverge from. | ||
− | + | ||
+ | == The most relevant chapters of the Guidelines == | ||
* [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TS.html 8. Transcriptions of Speech] | * [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TS.html 8. Transcriptions of Speech] | ||
* [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/DI.html 9. Dictionaries]: we need to have a plan so that the NLP community does consider this as a default vocabulary for representing NLP lexica (e.g. full form lexica) | * [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/DI.html 9. Dictionaries]: we need to have a plan so that the NLP community does consider this as a default vocabulary for representing NLP lexica (e.g. full form lexica) | ||
Line 16: | Line 28: | ||
* [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/NH.html 20. Non-hierarchical Structures] | * [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/NH.html 20. Non-hierarchical Structures] | ||
− | + | == Related SIGs == | |
* [[SIG:Ontologies|Ontologies]] | * [[SIG:Ontologies|Ontologies]] | ||
* [[SIG:Tools|Tools]] | * [[SIG:Tools|Tools]] | ||
* [[SIG:Overlap|Overlap]] | * [[SIG:Overlap|Overlap]] | ||
− | + | ||
+ | == Papers/presentations? == | ||
On using the TEI dictionary chapter as a default implementation of ISO 24613 (Lexical Markup Framework), let me quote http://hal.archives-ouvertes.fr/hal-00436328/fr/ ("Standardization of the formal representation of lexical information for NLP"). | On using the TEI dictionary chapter as a default implementation of ISO 24613 (Lexical Markup Framework), let me quote http://hal.archives-ouvertes.fr/hal-00436328/fr/ ("Standardization of the formal representation of lexical information for NLP"). | ||
+ | |||
+ | A presentation by the SIG conveners is scheduled during the TEI-MM-2010 Poster Slam and the poster session on Friday the 12th at 4 p.m. in Aula Magna. | ||
==== Projects ==== | ==== Projects ==== | ||
Line 27: | Line 42: | ||
* FreeDict http://freedict.org/en/ | * FreeDict http://freedict.org/en/ | ||
− | + | == Tools - reports of non-TEI linguistic tools working / not working with TEI == | |
* [http://gate.ac.uk/ GATE] doesn't do XML see [http://thread.gmane.org/gmane.comp.ai.gate.general/5257/focus=5301 XML parsing issue: consecutive empty elements mishandled] | * [http://gate.ac.uk/ GATE] doesn't do XML see [http://thread.gmane.org/gmane.comp.ai.gate.general/5257/focus=5301 XML parsing issue: consecutive empty elements mishandled] | ||
* See also the TEI-influenced or TEI-based tools: [[Xaira]], [[Textometrie]], [[Poliqarp]] and [[Anotatornia]] | * See also the TEI-influenced or TEI-based tools: [[Xaira]], [[Textometrie]], [[Poliqarp]] and [[Anotatornia]] | ||
− | + | == LLiZ (Linguistic Lunch in Zadar) == | |
The idea is to meet at an informal lunch during the [http://ling.unizd.hr/~tei2010/index.en.html TEI-MM in Zadar] to see what common goals we may have and what we want to do about them. | The idea is to meet at an informal lunch during the [http://ling.unizd.hr/~tei2010/index.en.html TEI-MM in Zadar] to see what common goals we may have and what we want to do about them. | ||
− | '''Date''': (let's decide | + | '''Date''': <span style="color:red">(let's decide whether we want a separate lunch, e.g. on Wednesday? or whether we turn it into a SIG breakfast on [http://www.tei-c.org/conftool/index.php?page=browseSessions Sunday at 9.00 in room PDS-1])</span> |
− | '''Place''': (let's decide in November) | + | '''Place''': (let's decide in November; see above) |
'''List of participants''' (add your name): | '''List of participants''' (add your name): |
Revision as of 15:58, 2 November 2010
Contents
Aims
This Special Interest Grooup is meant for those interested in linguistics, in the TEI, and in putting the two together.
Contact details, information
- Mailing list subscription page
- Official TEI SIG page (forthcoming, expect 404/500 error)
Activities
The first official meeting of the SIG is scheduler for Saturday, 13 November 2010 at the TEI-MM in Zadar, in room PDS-1.
See also the section on LLiZ below.
History
- Here's how it began
- TEI Guidelines have their apocrypha as well, here's one on corpus annotation. Note that it is absolutely non-normative, included here to give credit to the original Working Group and to provide a platform to either elaborate on or to diverge from.
The most relevant chapters of the Guidelines
- 8. Transcriptions of Speech
- 9. Dictionaries: we need to have a plan so that the NLP community does consider this as a default vocabulary for representing NLP lexica (e.g. full form lexica)
- 15. Language Corpora
- 17. Simple Analytic Mechanisms
- 18. Feature Structures
- 20. Non-hierarchical Structures
Related SIGs
Papers/presentations?
On using the TEI dictionary chapter as a default implementation of ISO 24613 (Lexical Markup Framework), let me quote http://hal.archives-ouvertes.fr/hal-00436328/fr/ ("Standardization of the formal representation of lexical information for NLP").
A presentation by the SIG conveners is scheduled during the TEI-MM-2010 Poster Slam and the poster session on Friday the 12th at 4 p.m. in Aula Magna.
Projects
TEI projects with a linguistic focus
- FreeDict http://freedict.org/en/
Tools - reports of non-TEI linguistic tools working / not working with TEI
- GATE doesn't do XML see XML parsing issue: consecutive empty elements mishandled
- See also the TEI-influenced or TEI-based tools: Xaira, Textometrie, Poliqarp and Anotatornia
LLiZ (Linguistic Lunch in Zadar)
The idea is to meet at an informal lunch during the TEI-MM in Zadar to see what common goals we may have and what we want to do about them.
Date: (let's decide whether we want a separate lunch, e.g. on Wednesday? or whether we turn it into a SIG breakfast on Sunday at 9.00 in room PDS-1)
Place: (let's decide in November; see above)
List of participants (add your name):
- Elena Pierazzo (who lit the spark, inspired by Piotr and Adam's talk -- or so they want to think)
- Piotr Bański (who dropped the last drop and suggested the meeting)
- Espen Ore
- Eleonora Litta
- Sabine Bartsch
- Andreas Witt
- Laurent Romary
- Lou Burnard
- ...