Difference between revisions of "SIG:CMC/Technical Meeting on CMC at DARIAH VCC 2014"
(→TENTATIVE SCHEDULE / PROGRAM) |
m |
||
Line 11: | Line 11: | ||
Main page of the CMC-SIG in this wiki: <u>[[SIG:Computer-Mediated Communication]]</u> | Main page of the CMC-SIG in this wiki: <u>[[SIG:Computer-Mediated Communication]]</u> | ||
− | = DESCRIPTION = | + | == DESCRIPTION == |
See [http://dariah.eu/activities/general-vcc-meetings/4th-general-vcc-meeting/programme/community-sessions.html PDF version] on the DARIAH website | See [http://dariah.eu/activities/general-vcc-meetings/4th-general-vcc-meeting/programme/community-sessions.html PDF version] on the DARIAH website | ||
Line 31: | Line 31: | ||
*a first exchange about experience in automatically structuring and processing CMC data as well as a concept for a common platform (to be set up in 2015) for the documentation and exchange of NLP tools and annotation experiments with other projects and research groups interested in building CMC corpora in different languages; | *a first exchange about experience in automatically structuring and processing CMC data as well as a concept for a common platform (to be set up in 2015) for the documentation and exchange of NLP tools and annotation experiments with other projects and research groups interested in building CMC corpora in different languages; | ||
* plans for international scientific events (extended workshops, conferences) based on these topics in 2015/16. | * plans for international scientific events (extended workshops, conferences) based on these topics in 2015/16. | ||
+ | <br/> | ||
− | + | ==PROGRAM (preliminary)== | |
− | |||
− | =PROGRAM (preliminary)= | ||
*Wed, September 17, 2014: '''Short intro/presentation''' (3-5 mins) on the work of our community | *Wed, September 17, 2014: '''Short intro/presentation''' (3-5 mins) on the work of our community | ||
*Thu, September 18, 2014: '''Community meeting''' (3 hours). | *Thu, September 18, 2014: '''Community meeting''' (3 hours). | ||
− | ==Pt. I: Presentations (Thu 15:00-17:00)== | + | ===Pt. I: Presentations (Thu 15:00-17:00)=== |
* Harald Lüngen & Eliza Margareta (IDS Mannheim): '''Applying the TEI CMC SIG proposal to Wikipedia corpora''' | * Harald Lüngen & Eliza Margareta (IDS Mannheim): '''Applying the TEI CMC SIG proposal to Wikipedia corpora''' | ||
Line 47: | Line 46: | ||
* Michael Beißwenger (TU Dortmund): '''Shared task on linguistic annotation of German CMC: intermediate report from the preparation of ''EmpiriST2015''''' | * Michael Beißwenger (TU Dortmund): '''Shared task on linguistic annotation of German CMC: intermediate report from the preparation of ''EmpiriST2015''''' | ||
− | ==Pt. II: Round table: further work on standards and joint scientific activities (Thu 17:00-18:00)= | + | ===Pt. II: Round table: further work on standards and joint scientific activities (Thu 17:00-18:00)=== |
* Discussion of the current draft schema from the perspective of different projects and initiatives; ideas for annotations on the microlevel (= inner structure of postings) and about the interface between POS annotations and the microstructure in the TEI schema | * Discussion of the current draft schema from the perspective of different projects and initiatives; ideas for annotations on the microlevel (= inner structure of postings) and about the interface between POS annotations and the microstructure in the TEI schema |
Revision as of 20:40, 12 July 2014
This pages describes a workshop and tentative program for a community session/technical meeting on issues related with the modeling of CMC corpora organized by members of the CMC-SIG at the 4th DARIAH-EU VCC meeting 2014 in Rome. Date: Thursday, September 18, 15:00-18:00 Location: Rome, Villa Mirafiori Main page of the CMC-SIG in this wiki: SIG:Computer-Mediated Communication ContentsDESCRIPTIONSee PDF version on the DARIAH website Corpora of computer-mediated communication (CMC) are a desideratum for many scholars in the humanities who are interested in doing empirical research of language use and of emerging communicative genres on the Internet and in social media applications. Important steps for building such corpora and for representing them in an interoperable way are:
Researchers at a European level are already aware that many of the challenges in building CMC corpora in the humanities are the same for every language; therefore CMC corpus projects for different languages can benefit from sharing knowledge and experience with each other and from facing the challenges as a joint task. Since 2013, a group of corpus projects from France, Germany, bItaly and the Netherlands has started to exchange expertise and experience in building CMC corpora (= the network "Building and annotating CMC corpora", https://wiki.itmc.tu-dortmund.de/cmc/) and to jointly work on a proposal for an extension to the TEI standard which is adapted to the particularities of a broad range of CMC genres (= the TEI special interest group on CMC, http://www.tei-c.org/Activities/SIG/). The DARIAH technical meeting will gather a restricted number of researchers, coming from different European countries, involved in projects aiming at building, structuring, annotating and analyzing CMC corpora - including:
The expected outcomes of the meeting are, amongst others:
PROGRAM (preliminary)
Pt. I: Presentations (Thu 15:00-17:00)
Pt. II: Round table: further work on standards and joint scientific activities (Thu 17:00-18:00)
|