SIG:CMC/CoMeRe schema draft for representing CMC in TEI (2014)
Contents
Status of this draft
This page describes a draft for a basic schema for representing genres on computer-mediated communication (CMC) in TEI. The draft has been created by members of the TEI-SIG "Computer-Mediated Communication".
The SIG encourages everybody to discuss this draft and give their feedback/comments using the "discussion" function on top of this page. The comments/discussions will be carefully taken into consideration in the further development of the schema.
The history of the draft is documented on the main wiki page of the SIG. This page should be read in parallel to SIG:CMC/Draft: A metadata schema for CMC
Authors of this draft: Thierry Chanier, N.N., N.N.
Interaction types
Interaction
Participants are in the same interaction space (IS) when they can interact (but not necessarily do it, cf. lurkers). They interact through input devices,(microphone, keyboard, mouse, gloves, etc.), which let them use the modality tools and output devices, mainly producing visual or oral signals. (These however, will not be described in this article). Hence when participants cannot hear nor see the other participants’ actions, they are not in the same IS. Of course, participants may not be participants during the whole time frame of the IS. They can enter late, or leave early.
In an IS, actions occur between participants. Let us call the trace of an action within an environment and one particular modality an “act”. Acts are generated by participants, and sometimes by the system. Some of them may be considered as directly communicative (verbal ones in synchronous text or oral modalities). Others may not be directly communicative but may represent the cause of communicative reaction / interaction (e.g. when participants write collaboratively in an online word processor and comment on their work). Participants see and hear what others are doing. These actions may represent the rationale for participants to be there and to interact (produce something collectively). Hence the distinction between acts, directly communicative or not, is irrelevant.
An important distinction may be made between an IS where only one modality (tool) is used by participants, and an IS where several occur. In the next section, we start presenting some examples of mono modality environments where actions occur en bloc (Beisswenger et al., 2012). A more complicated case appears when an IS uses several modalities. We will also find an example in this article.
Within other multimodal environments verbal (speech, text chat) and nonverbal acts occur simultaneously. The main purpose of transcriptions is then to describe inter-relations amongst acts and within acts: the participant’s utterance may be re-planned when s/he talks depending on other specific acts occurring at the same time (see Wigham & Chanier, 2013). Indeed, written communication can be simultaneously combined with other modalities. For example, there are situations where a participant does not plan an utterance as a one-shot process before it is sent as an en bloc message to a server, which in turn displays it to the other participants as an non modifiable piece of language (e.g. as a text chat turn). Undeniably, an utterance can also be planned, then modified in the throes of the interaction while taking into account what is happening in other modalities of communication (e.g. in an audio chat turn.
CMC environment |
Mode & modality |
TEI element |
Main macrostrcuture issues with TEI |
Corpora presently under processing into TEI |
SMS |
Text |
<post> |
No notion of addresse (in <head> ?) |
Y |
Textchat |
Text |
<post> |
@alias, @type, @subtype |
Y |
|
Text |
<post> |
Addresses, readers, copy, attached file, etc. (in <head>) |
Y |
Discussion Forum |
Text |
<post> |
Threads (opening, answering) : @ref |
Y |
Wikipedia discussion forum |
Text |
<post> |
Reply difficult to identify (indent): @ref |
Y |
Blog |
Text + image |
<post> |
Message & comment : @ref |
Y |
Audio conferencing system (e.g. Skype) |
Text |
<post> |
|
N |
Complex CMC environment |
||||
LMS (Learning Management System) : WebCT, Moodle |
Text |
<post> |
One TEIcorpus file |
Y |
Audio-graphic conference system (e.g. Lyceum, Centra) |
Text + audio + nonverbal |
<post> <u> <prod> |
Every element at the same level, i.e. mixing of
these elements within a <div> |
Y |
Video-graphic or 3D environment (e.g. : Second Life) |
Text + audio + nonverbal |
<post> <u> <prod> |
Idem + set of video files attached to the TEI file |
Y |
Macrostructure
Accordingly to | (Beiswenger et al., 2012), we refer to macro-structure when considering the general information attached to an interaction (adresses, copy-to, title, label, readers, attached files, etc.) as well as issues dealing with way of arranging sets of interactions per modality or interaction space. The micro-structure of the text (next section) refers to the type of elements found in the actual contents of the interaction (<post>, <u> or <prod>) for example interaction words, emoticons, hash code, etc.
Before assembling general proposals concerning <post> and <prod> and their attributes, let us consider some examples of CMC interactions.
Multimodal example
From : cmr-copeas-tei-v1. Context: Context: Lyceum audio-graphic conference environment, 3 learners (English L2) working intoa word processor: one writing, others helping.
- (1.2): collaborative word processor
- (1.3): audio, clarification
- (1.4): textchat, correction (with error)
- (1.5): textchat, request clarification
(1) (1.2)<prod xml:id="cmr-copeas-T8_s101_ecriture_multimodale-a_14481" xml:lang="unk" start="#cmr-copeas-tl_t-w1979" end="#cmr-copeas-tl_t-w1987" who="#AT3" type="text_doc"> (modify),paragraph (ad,For example:to have comparaison between web sites, to know more criterias for a good site)</prod> (1.3)<u xml:id="cmr-copeas-T8_s101_ecriture_multimodale-a_14482" xml:lang="eng" start="#cmr-copeas-tl_t-w1988" end="#cmr-copeas-tl_t-w1993" who="#AT1">euh + to euh + to use the good euh + good euh vocabulary + I think euh + we euh + we wrote euh for me I have euh +++ I've euh much progress in euh in use the good vocabulary to euh + to evaluate euh a website</u> (1.4)<post xml:id="cmr-copeas-T8_s101_ecriture_multimodale-a_14483" xml:lang="unk" synch="#cmr-copeas-tl_t-w1990" who="#AT6" type="chat-message"> <p>according to the differents criteria</p> </post> (1.5)<post xml:id="cmr-copeas-T8_s101_ecriture_multimodale-a_14484" xml:lang="unk" synch="#cmr-copeas-tl_t-w1991" who="#AT6" type="chat-message"> <p>?</p>
Textchat
From cmr-getalp_org-rhone-alpes-tei-v1. Textchat turns correspond here, respectively, to :
- the first 3: messages
- 4: change alias
- 5: change rights
(2) <post xml:id="cmr-get-c065-a21693" when-iso="2004-03-18T14:09" who="#cmr-get-c065-p39174" alias="cortex_taff" type="chat-message"> <p>Apres je vé faire ma physique c aussi les equation bilan</p></post> <post xml:id="cmr-get-c065-a21694" when-iso="2004-03-18T14:09" who="#cmr-get-c065-p39174" alias="cortex_taff" type="chat-message"> <p>Aujourd'hui c la journée equation</p></post> <post xml:id="cmr-get-c065-a21697" when-iso="2004-03-18T14:11" who="#cmr-get-c065-p36208" alias="roulie" type="chat-message"> <p>lol</p></post> <post xml:id="cmr-get-c065-a21699" when-iso="2004-03-18T14:13" who="#cmr-get-c065-p120845" alias="Tsu" type="chat-event" subtype="changementpseudo"> <p>Changement de pseudo: Tsu -> Tsu[H</p><add><code>alias_change(Tsu,Tsu[H)</code></add></post> <post xml:id="cmr-get-c065-a21705" when-iso="2004-03-18T14:18" who="#unknow" type="chat-event" subtype="changementmode"> <p>#Rhone-alpes:changement de mode(s) '+o Mega-Link' par Hera!services@olympe.epiknet.org</p></post>
SMS
From cmr-smslareunion-tei-v1. Two SMS messages sent by the same person to the same addressee. Note that here the adresse is not explicitly encoded (phone numbers of adresses have not been collected, hence are not known, and there is no attribute in TEI to encode adresses (same problem with speech / oral chapter in TEI)
(3) <post xml:id="cmr-slr-c001-a00011" when-iso="2008-04-14T10:17:11" who="#cmr-slr-c001-p010" type="sms"> <p>é@??$?Le + triste c ke tu na aucune phraz agréabl et ke tu va encor me dir ke c moi ki Merde par mon attitu2! Moi je deman2 pa mieu ke klke mot agréabl échangé</p> […] <post xml:id="cmr-slr-c001-a00304" when-iso="2008-04-15T20:23:59" who="#cmr-slr-c001-p010" type="sms"> <p>...2 te comporter comme ca avec moi. Je ve bien admettr mes erreur kan j'agi vraimen mal comm hier mé fo pa exagérer. Si t pa d'accor c ton droi. Si tentain.le rest c à dirreposer dé question sur 1 sujet déjà expliké c pa 1 raison valabl pr ke tu te monte contr moi.pr moi ossi ca suffi.</p>
Discussion forum
From cmr-simuligne-tei-v1. The author of the message is a native speaker of French who is replying to a post made by a learner of French. Each person mentioned has been identified in the message structure (author, list of readers -here shortened-) and in its contents (addressee, signature of the author, attached file). This information may lead to other types of research on discourse and group interactions. For example, who takes the position of a leader, or an animator in a group? Can subgroups of communication be traced within a group, thanks to an analysis of clusters, cliques?
(4) <post xml:id="cmr-Simu-Aquitania-Principal_27.04-13.05.01-59" when="2001-05-02T07:58:00" who="#cmr-Simu-Al5" type="forum-message"> <head> <title>les sons du Suffolk</title> <listPerson> <person corresp="#cmr-Simu-An3"> <event type="Read" when="2001-05-02T07:58:00"> <label>Read</label> </event> </person> [other readers] </listPerson> </head> <p>Puisqu'on parlait de ce qui est par la fenêtre, j'ai mis mon micro tout près de la fenêtre ce soir... Le coucou s'était déjà couché, malheureusement, mais les autres chantent très fort! Les 'Bullocks' ont tous le même père: un grand taureau Charolais, qui est le père de tous les boeufs de la région! bonne nuit à tous<name ref="#cmr-Simu-Al5" type="person"><forename>Marja</forename></name> </p> <trailer> <ref type="attached_file" target="#Simu_Aqui_forum_attach_988833475" >suffolk.qcp</ref> </trailer> </post>
Wikipedia discussion
to be added
Blog
From cmr-infral-tei-v1. One message and its comment .
(5) <post xml:id="cmr-blog-a2" synch="#T2" who="#P2" type="blog-message"> <head> <title>Présentation de ma personne</title> <label>étapeE1 ; </label> </head> <p>Bon soir à tous!<lb/> Maintenant, je vais commencer avec les présentations....... <lb/> Je pense que vous avez vu que je m'appelle <name ref="#P2">Kerstin</name> . J'ai 22 ans. Mon nom est un nom suédois qui est très fréquent en Allemagne. Comme vous savez peut-être, on a commencé nos études en master cette semaine. <lb/> Ma famille - mes parents et mes deux soeurs -habite à Osnabrueck. C'est une ville qui est pas loin de Brême. Après avoir passé mon bac à Osnabrueck, j'ai commencé mes études de francais et de sport à Brême. La raison pour laquelle j'ai choisi ces deux matières est que j'aime faire du sport (jouer au tennis, nager) et que j'adore la culture francaise. J'adore la langue francaise et le pays me plaît beaucoup (le paysage francais....). <lb/> Les deux étés passés, j'ai fait un stage de plus que deux mois en Suisse francophone et en France (près de Lyon) pour améliorer mes connaissances de la langue francaise et la pratique du francais à l'oral. <lb/> En ce qui concerne mes études de francais, ce qui me plaît surtout, c'est, d'explorer la culture francaise d'une manière différente (les textes littéraires, les séquences vidéos......). <lb/> J'attends vos présentations et je vous souhaite encore un bon soir........ <lb/> A bientôt, <name ref="#P2">Kerstin</name></p> </post> <post xml:id="cmr-blog-a3" synch="#T3" who="#P3" type="blog-comment" ref="#cmr-blog-a2"> <head> <title>Hallo Kirstin! J'ai lu que tu as fait des stages e...</title> </head> <p>Hallo<name ref="#P2">Kirstin</name> ! J'ai lu que tu as fait des stages en Suisse francophone ! Où exactement car j'habite près de la frontière suisse (à 1h de Lausanne !)! Je pense qu'on aura l'occasion d'en reparler ! Bis Bald </p> </post>
From cmr-simuligne-tei-v1. On email snet to one person and read by this person.
(6) <post xml:id="cmr-Simu-Aq-At-Outbox-0080" when="2001-05-12T01:15:00" who="#cmr-Simu-At" type="email-message"> <head> <title>ta photo</title> <listPerson> <person corresp="#cmr-Simu-Al6"> <event type="SendTo"> <label>SendTo</label></event></person> <person corresp="#cmr-Simu-Al6"> <event type="Read" when="2001-05-12T01:15:00"> <label>Read</label></event></person> </listPerson> </head> <p>Coucou<name ref="#cmr-Simu-Al6" type="person"><forename>Mia</forename></name>, Tu peux aller te voir dans Publications : maintenant, tu y existe en totalité ! A bientôt,<name ref="#cmr-Simu-At" type="person"><forename>Anna</forename></name> </p> </post>
Microstructure
text text