Difference between revisions of "Ad-hoc committee on encoding of bibliographic citations"
(→Charge: offers of assistance from others) |
(→Background: Kevin's initial notes) |
||
Line 6: | Line 6: | ||
= Background = | = Background = | ||
+ | |||
+ | The TEI Guidelines offer three elements for bibliographic descriptions: | ||
+ | |||
+ | * bibl -- no enforced content structure | ||
+ | * biblStruct -- structured citation follows a TEI content model | ||
+ | * biblFull -- structured citation follows TEI fileDesc model | ||
+ | |||
+ | These elements may be used for a number of different purposes in a TEI document. They may occur within a header to describe the source of a TEI document or within the body of a TEI document. As with other elements used in the body, any of these three elements could be used either: | ||
+ | |||
+ | * to represent citations in a source document (such as a pre-existing print document) | ||
+ | * to encode citations as part of a publishing process (such as a manuscript of a book to be published) | ||
+ | |||
+ | This distinction (which [http://listserv.brown.edu/archives/cgi-bin/wa?A2=ind1002&L=TEI-L&P=R525 Wendell Piez called "retrospective markup" versus "encoding done for the sake of fitting data to a particular application"]) is a fundamental one in the TEI that is significant in discussions of use of the three elements for bibliographic distinctions. | ||
+ | |||
+ | == Encoding citations as represented in a source document (such as a pre-existing print document) == | ||
+ | |||
+ | When encoding a citation as it appears in a source document, most users prefer to leave the transcribed text in the order it is read on the page when tagging; therefore, any elements used for bibliographic descriptions need to be flexible in their internal structure. bibl will clearly work for this purpose, and biblFull will not (unless the source document contains only ISBD-compatible citations without errors!). biblStruct has a complicated content model whose allowed order of elements seems to map well onto common citation formats (at least those used in European languages). However, when including page numbers, it's not clear when to include a biblScope as a child or sibling of imprint. | ||
+ | |||
+ | Examples in the Guidelines sometimes include delimiting punctuation in the elements, which is sensible when encoding citations as represented in a source document. Users often wish to represent the original document as faithfully as possible, including inconsistent punctuation, so they don't want to insert any punctuation automatically. Leaving in punctuation is also simpler when converting from print or electronic source files. | ||
+ | |||
+ | == Encoding citations as part of a publishing process (such as a manuscript of a book to be published) == | ||
+ | |||
+ | When encoding a citation as part of a publishing process, the usual desire is to enforce uniformity of citation style through use of markup. If a particular citation format (MLA, APA, GOST 7.1, etc.) is to be used, it is likely that the possible combination of citation components in that format does not match perfectly with the content model of any of three elements for bibliographic descriptions. While biblFull likely requires too much detail for most purposes, you might use bibl or biblStruct, but either would allow for more flexibility of encoding structure than is required for the citation format being used. Constraining the content model of any of the three elements would break TEI Conformance (?), so a user might instead maintain the TEI content model for the element but use a Schematron schema or other outside tool to check that the citation is encoded as desired. | ||
+ | |||
+ | It is advised not to include delimiting punctuation in citations encoded as part of a publishing process but rather to insert these through a stylesheet, thereby guaranteeing not only consistent use of punctuation but also making it easier to adapt to a different citation style in the future. | ||
= Proposal = | = Proposal = |
Revision as of 00:35, 20 February 2010
Contents
Charge
At its meeting on 2010-02-07, the TEI Council discussed bug 2714682 and reached consensus that biblScope should not be allowed as a child of imprint since the prose definition of the imprint element does not include such content as might be included in one or more biblScope elements. Laurent Romary, Martin Holmes, and Kevin Hawkins were charged with writing a proposal to make it clear how to use biblScope for various types of bibliographic citations. The proposal should corrected, annotated (with XML comments) examples for the Guidelines of encoding various types of citations in biblStruct (and maybe also bibl and biblFull). It was suggested that the ad-hoc committee also look at how citations are handled in other encoding schemes.
Kent Hooper volunteered on TEI-L to help, pointing us to his bibliography at barlach-biblio.org and offering his TEI encoding behind it. Charles Muller offered his pre-P5 bibliography at http://www.acmuller.net/yogacara/bibliography/bibnotes.html for examples.
Background
The TEI Guidelines offer three elements for bibliographic descriptions:
- bibl -- no enforced content structure
- biblStruct -- structured citation follows a TEI content model
- biblFull -- structured citation follows TEI fileDesc model
These elements may be used for a number of different purposes in a TEI document. They may occur within a header to describe the source of a TEI document or within the body of a TEI document. As with other elements used in the body, any of these three elements could be used either:
- to represent citations in a source document (such as a pre-existing print document)
- to encode citations as part of a publishing process (such as a manuscript of a book to be published)
This distinction (which Wendell Piez called "retrospective markup" versus "encoding done for the sake of fitting data to a particular application") is a fundamental one in the TEI that is significant in discussions of use of the three elements for bibliographic distinctions.
Encoding citations as represented in a source document (such as a pre-existing print document)
When encoding a citation as it appears in a source document, most users prefer to leave the transcribed text in the order it is read on the page when tagging; therefore, any elements used for bibliographic descriptions need to be flexible in their internal structure. bibl will clearly work for this purpose, and biblFull will not (unless the source document contains only ISBD-compatible citations without errors!). biblStruct has a complicated content model whose allowed order of elements seems to map well onto common citation formats (at least those used in European languages). However, when including page numbers, it's not clear when to include a biblScope as a child or sibling of imprint.
Examples in the Guidelines sometimes include delimiting punctuation in the elements, which is sensible when encoding citations as represented in a source document. Users often wish to represent the original document as faithfully as possible, including inconsistent punctuation, so they don't want to insert any punctuation automatically. Leaving in punctuation is also simpler when converting from print or electronic source files.
Encoding citations as part of a publishing process (such as a manuscript of a book to be published)
When encoding a citation as part of a publishing process, the usual desire is to enforce uniformity of citation style through use of markup. If a particular citation format (MLA, APA, GOST 7.1, etc.) is to be used, it is likely that the possible combination of citation components in that format does not match perfectly with the content model of any of three elements for bibliographic descriptions. While biblFull likely requires too much detail for most purposes, you might use bibl or biblStruct, but either would allow for more flexibility of encoding structure than is required for the citation format being used. Constraining the content model of any of the three elements would break TEI Conformance (?), so a user might instead maintain the TEI content model for the element but use a Schematron schema or other outside tool to check that the citation is encoded as desired.
It is advised not to include delimiting punctuation in citations encoded as part of a publishing process but rather to insert these through a stylesheet, thereby guaranteeing not only consistent use of punctuation but also making it easier to adapt to a different citation style in the future.