Difference between revisions of "User:Piotr Banski"
Piotr Banski (talk | contribs) m (+ more info) |
Piotr Banski (talk | contribs) (update) |
||
Line 3: | Line 3: | ||
University of [[wikipedia:Warsaw|Warsaw]], [[wikipedia:Poland|Poland]] | University of [[wikipedia:Warsaw|Warsaw]], [[wikipedia:Poland|Poland]] | ||
− | I was involved in the creation of the [http://korpus.pl/index.php?lang=en&page=welcome IPI PAN Corpus of Modern Polish], mostly experimenting with the possibilities that the TEI and CES offered for its architecture. | + | I was involved in the creation of the [http://korpus.pl/index.php?lang=en&page=welcome IPI PAN Corpus of Modern Polish], mostly experimenting with the possibilities that the TEI and the [http://www.cs.vassar.edu/CES/ CES] offered for its architecture. I was also hired as XML architect for the [http://nkjp.pl/index.php?page=0&lang=1 Polish National Corpus]. |
− | + | My current projects include the open-source [https://apps.sourceforge.net/mediawiki/freedict/index.php?title=Main_Page FreeDict] (TEI-encoded translating dictionary repository) and [https://apps.sourceforge.net/mediawiki/octc/index.php?title=Main_Page OCTC] (TEI-encoded multi-instance stand-off multilingual and partly parallel text corpus <nowiki><note></nowiki>I could go on with these attributes, you know -- Syd says I'm articulate ;-)<nowiki></note></nowiki>). I feel honoured to have been invited to help with sharing resources between FreeDict and the [http://wiki.apertium.org/wiki/Task_ideas_for_Google_Code-in Apertium project] (doing rule-based machine translation). I'm also involved in a Polish Ministry of Science-funded project creating a new Swahili-Polish-Swahili dictionary (N104 050437, directed by Beata Wójtowicz). | |
− | + | Some of my experiences with applying the TEI to large-scale language resources are reflected in my recent Balisage-2010 paper, on [http://www.balisage.net/Proceedings/vol5/html/Banski01/BalisageVol5-Banski01.html "Why TEI stand-off annotation doesn't quite work, and why you might want to use it nevertheless"]. | |
+ | |||
+ | With Andreas Witt and Eleonora Litta, I am co-convener of the brand-new TEI Special Interest Group, "[[SIG:TEI_for_Linguists|TEI for Linguists]]". I am also running for the TEI Council for the 2011-2013 term (see [http://www.tei-c.org/Membership/Meetings/2010/mm52.xml#PB my candidate statement]). | ||
[[Category:Users|Piotr Banski]] | [[Category:Users|Piotr Banski]] |
Revision as of 11:38, 30 October 2010
Piotr Bański,
Institute of English Studies,
University of Warsaw, Poland
I was involved in the creation of the IPI PAN Corpus of Modern Polish, mostly experimenting with the possibilities that the TEI and the CES offered for its architecture. I was also hired as XML architect for the Polish National Corpus.
My current projects include the open-source FreeDict (TEI-encoded translating dictionary repository) and OCTC (TEI-encoded multi-instance stand-off multilingual and partly parallel text corpus <note>I could go on with these attributes, you know -- Syd says I'm articulate ;-)</note>). I feel honoured to have been invited to help with sharing resources between FreeDict and the Apertium project (doing rule-based machine translation). I'm also involved in a Polish Ministry of Science-funded project creating a new Swahili-Polish-Swahili dictionary (N104 050437, directed by Beata Wójtowicz).
Some of my experiences with applying the TEI to large-scale language resources are reflected in my recent Balisage-2010 paper, on "Why TEI stand-off annotation doesn't quite work, and why you might want to use it nevertheless".
With Andreas Witt and Eleonora Litta, I am co-convener of the brand-new TEI Special Interest Group, "TEI for Linguists". I am also running for the TEI Council for the 2011-2013 term (see my candidate statement).