Difference between revisions of "SIG:TEI for Linguists"
Piotr Banski (talk | contribs) (update) |
Piotr Banski (talk | contribs) m (Reverted edits by Piotr Banski (talk) to last revision by Sheiden) |
||
Line 1: | Line 1: | ||
+ | [[Category:Community]] | ||
+ | [[Category:SIG|Linguistics]] | ||
+ | [[Category:SIG:Linguistics| ]] | ||
+ | |||
+ | == Aims == | ||
+ | |||
+ | This Special Interest Group is meant for those interested in linguistics, in the TEI, and in putting the two together. | ||
+ | |||
+ | == Contact details, information == | ||
+ | |||
+ | * [http://listserv.brown.edu/archives/cgi-bin/wa?A0=TEI-LINGUISTICS Mailing list subscription page] | ||
+ | * [http://www.tei-c.org/Activities/SIG/TEI_for_Linguists/ Official TEI SIG page] | ||
+ | |||
+ | == Activities == | ||
+ | The SIG activities (will) include official meetings at TEI-MMs, conference reports and e-mail exchange on the mailing list. As part of its activity, the SIG will attempt to [[SIG:Linguistics - bibliography|track and record]] papers that deal with using various markup standards for the purpose of encoding linguistic analyses and language resources. | ||
+ | |||
+ | === SF Tickets === | ||
+ | This is an informal list of tickets in the scope of interest of our SIG. '''Edit as appropriate.''' | ||
+ | |||
+ | * Bugs #3174515 [https://sourceforge.net/tracker/?func=detail&atid=644062&aid=3174515&group_id=106328 suggested values of form@type] | ||
+ | * FR #3266021 [https://sourceforge.net/tracker/?func=detail&atid=644065&aid=3266021&group_id=106328 dictionary entires with a single sense] | ||
+ | * Bugs #3080658 [https://sourceforge.net/tracker/?func=detail&atid=644062&aid=3080658&group_id=106328 definition of 'entry'] | ||
+ | * Bugs #3285020 [https://sourceforge.net/tracker/?func=detail&aid=3285020&group_id=106328&atid=644062 irregularities in <gram> syntatic sugar variants] | ||
+ | * FR #3288293 [https://sourceforge.net/tracker/?func=detail&atid=644065&aid=3288293&group_id=106328 create @refLang, add it to att.pointing] | ||
+ | |||
=== Meetings === | === Meetings === | ||
* The first official meeting took place in Zadar, on 13 November 2010; see the [[Zadar, 13 November 2010, Agenda|agenda]] and the [[TEI for Linguists - minutes - 13nov10|minutes]]. The meeting was preceded by the LLiZ (Linguistic Lunch in Zadar) and a poster presentation. | * The first official meeting took place in Zadar, on 13 November 2010; see the [[Zadar, 13 November 2010, Agenda|agenda]] and the [[TEI for Linguists - minutes - 13nov10|minutes]]. The meeting was preceded by the LLiZ (Linguistic Lunch in Zadar) and a poster presentation. | ||
− | * The second meeting took place in [[LingSIG in Würzburg, Oct 2011|Würzburg, on 14 October 2011]]. The meeting presented | + | * The second meeting took place in [[LingSIG in Würzburg, Oct 2011|Würzburg, on 14 October 2011]]. The meeting presented [https://www.zotero.org/groups/tei-lingsig/items LingSIG bibliography maintained as a Zotero library], for everyone to use and add to. Its recurring theme was "keep the SIG aware about the stuff you're doing". |
− | + | ||
− | * The | + | == History == |
+ | * Here's [http://listserv.brown.edu/archives/cgi-bin/wa?A2=ind1007&L=TEI-L&T=0&F=&S=&P=1668 how it began] | ||
+ | * TEI Guidelines have their apocrypha as well, here's one on [http://www.tei-c.org/Activities/Workgroups/SO/sow05.xml corpus annotation]. Note that it is absolutely non-normative, included here to give credit to the original Working Group and to provide a platform to either elaborate on or to diverge from. | ||
+ | |||
+ | The first official meeting of the SIG took place on 13 November 2010 at the [http://ling.unizd.hr/~tei2010/index.en.html TEI-MM in Zadar]. This meeting was preceded by a reconnaissance lunch (we liked both the intel and the food) and a Poster Slam presentation (boyakasha...). | ||
+ | |||
+ | == The most relevant chapters of the Guidelines == | ||
+ | * [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/TS.html 8. Transcriptions of Speech] | ||
+ | * [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/DI.html 9. Dictionaries]: we need to have a plan so that the NLP community does consider this as a default vocabulary for representing NLP lexica (e.g. full form lexica) | ||
+ | * [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/CC.html 15. Language Corpora] | ||
+ | * [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/AI.html 17. Simple Analytic Mechanisms] | ||
+ | * [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/FS.html 18. Feature Structures] | ||
+ | * [http://www.tei-c.org/release/doc/tei-p5-doc/en/html/NH.html 20. Non-hierarchical Structures] | ||
+ | |||
+ | == Related SIGs == | ||
+ | * [[SIG:Ontologies|Ontologies]] | ||
+ | * [[SIG:Tools|Tools]] | ||
+ | * [[SIG:Overlap|Overlap]] | ||
+ | |||
+ | == Bibliography == | ||
+ | The SIG would like to put together and maintain a bibliography of works relevant to linguistics and markup (TEI in particular but not only -- the world is full of sources of inspiration). The [[SIG:Linguistics - bibliography |bibliography]] is maintained as a Zotero resource. | ||
+ | |||
+ | == Projects == | ||
+ | TEI projects with a linguistic focus | ||
+ | * FreeDict http://freedict.org/en/ | ||
+ | |||
+ | == Tools - reports of non-TEI linguistic tools working / not working with TEI == | ||
+ | * [http://gate.ac.uk/ GATE] doesn't do XML see [http://thread.gmane.org/gmane.comp.ai.gate.general/5257/focus=5301 XML parsing issue: consecutive empty elements mishandled] | ||
+ | * See also the TEI-influenced or TEI-based tools: [[Xaira]], [[TXM]], [[Poliqarp]], [[Philologic]] and [[Anotatornia]] |
Revision as of 15:11, 17 October 2013
Contents
Aims
This Special Interest Group is meant for those interested in linguistics, in the TEI, and in putting the two together.
Contact details, information
Activities
The SIG activities (will) include official meetings at TEI-MMs, conference reports and e-mail exchange on the mailing list. As part of its activity, the SIG will attempt to track and record papers that deal with using various markup standards for the purpose of encoding linguistic analyses and language resources.
SF Tickets
This is an informal list of tickets in the scope of interest of our SIG. Edit as appropriate.
- Bugs #3174515 suggested values of form@type
- FR #3266021 dictionary entires with a single sense
- Bugs #3080658 definition of 'entry'
- Bugs #3285020 irregularities in <gram> syntatic sugar variants
- FR #3288293 create @refLang, add it to att.pointing
Meetings
- The first official meeting took place in Zadar, on 13 November 2010; see the agenda and the minutes. The meeting was preceded by the LLiZ (Linguistic Lunch in Zadar) and a poster presentation.
- The second meeting took place in Würzburg, on 14 October 2011. The meeting presented LingSIG bibliography maintained as a Zotero library, for everyone to use and add to. Its recurring theme was "keep the SIG aware about the stuff you're doing".
History
- Here's how it began
- TEI Guidelines have their apocrypha as well, here's one on corpus annotation. Note that it is absolutely non-normative, included here to give credit to the original Working Group and to provide a platform to either elaborate on or to diverge from.
The first official meeting of the SIG took place on 13 November 2010 at the TEI-MM in Zadar. This meeting was preceded by a reconnaissance lunch (we liked both the intel and the food) and a Poster Slam presentation (boyakasha...).
The most relevant chapters of the Guidelines
- 8. Transcriptions of Speech
- 9. Dictionaries: we need to have a plan so that the NLP community does consider this as a default vocabulary for representing NLP lexica (e.g. full form lexica)
- 15. Language Corpora
- 17. Simple Analytic Mechanisms
- 18. Feature Structures
- 20. Non-hierarchical Structures
Related SIGs
Bibliography
The SIG would like to put together and maintain a bibliography of works relevant to linguistics and markup (TEI in particular but not only -- the world is full of sources of inspiration). The bibliography is maintained as a Zotero resource.
Projects
TEI projects with a linguistic focus
- FreeDict http://freedict.org/en/
Tools - reports of non-TEI linguistic tools working / not working with TEI
- GATE doesn't do XML see XML parsing issue: consecutive empty elements mishandled
- See also the TEI-influenced or TEI-based tools: Xaira, TXM, Poliqarp, Philologic and Anotatornia