Difference between revisions of "BPG Feedback on Complete Draft"

From TEIWiki
Jump to navigation Jump to search
(The TEI Header: automatic generation of sourceDesc from MARC and tag abuse in author vs. respStmt)
(LEVEL 5: Scholarly Encoding Projects)
Line 101: Line 101:
 
* Need examples.  Syd was going to make scholarly the level 4 examples.  Any updates? [[User:Mdalmau|Mdalmau]]
 
* Need examples.  Syd was going to make scholarly the level 4 examples.  Any updates? [[User:Mdalmau|Mdalmau]]
 
Syd, what do you think about my comments re your WWP example? It can be considered level 5. [[User:Natasha|Natasha]]
 
Syd, what do you think about my comments re your WWP example? It can be considered level 5. [[User:Natasha|Natasha]]
 +
 +
* I've added two verse examples, neither of which inspires much confidence.  I wouldn't be hurt if someone were to replace these with other better (e.g. WWP) examples.  [[User:Gworthey|Gworthey]]
  
 
==General Guidelines for Attribute Usage==
 
==General Guidelines for Attribute Usage==

Revision as of 19:29, 14 April 2009

Back to GBP

The main sections of the BPG are outlined below to facilitate orderly feedback :-). Please provide feedback by Saturday, April 11th so Kevin and I have enough time to compile any of the more significant issues for discussion during our upcoming meeting on April 14th.

Add each new comment as a bullet point and be sure to attribute comments so we know how to follow-up if there's some debate. For example:

Introduction

  • Task Force w/c though historically correct may lose context in this new revision. Consider something more generic like "Working Group" or past tense: "the Task Force attempted to make these ..." Mdalmau

General Recommendations

  • Conisder moving facsimile page image info here as the final bullet point. Mdalmau

Structure of a TEI Document

  • Do we need to reference <facsimile>? Mdalmau

The TEI Header

  • I changed 'the Open Access Initiative (OAI)" to "the Open Archives Initiative (OAI)". Unless there is another initiative that I do not know about.Natasha
  • encodingDesc: Even though Melanie and I wrote most of the header, I don't remember any more why some information about sourceDesc is in encodingDesc/p and other info is in encodingDesc/editorialDecl/p . Seems like this info should be in one place or the other. Suggestions? Note that the P5 schema allows both p and non-p children of encodingDesc, but the prose implies that this isn't allowed. I've reported to SourceForge ( https://sourceforge.net/tracker/?func=detail&atid=644062&aid=2761884&group_id=106328 ), but maybe we should do it one way or the other. All the more reason to fix this.
  • How to allow for automatic generation of sourceDesc from a MARC record yet not commit tag abuse on author and respStmt elements.

Linking between encoded text and images of source documents

  • Consider moving this text up under the General Rec. section. Mdalmau
  • Do we need to say this: "The examples below use the former method." And do we want all examples to use @facs? I think we should mix it up since we are recommending both options. Mdalmau
    • Kevin suggested that we need more info for how to use xml:id with METS. I am not quite sure how to do this at-a-glance and any attempt at detail will lead us into a significant discussion about METS. Here are some things we can say, but may not all of them (and hopefully Chris P. can provide some guidance):
      • Use xml:id to locate content but not explicit representation of content
      • fileSec used to explicitly list all the images (master, jpg, pdf)
      • structMap orders the pages by page break and references each image defined in fileSec Mdalmau

LEVEL 1: Fully Automated Conversion and Encoding

  • Should an automated workflow be described? (Kshawkin)

LEVEL 2: Minimal Encoding

  • Should an automated workflow be described? How does everyone feel about the one Kevin described by email on 2009-04-08? (Kshawkin)

Second paragraph of rationale: why "earlier than 1900"? Restate to explain what it is about certain older source documents that makes it difficult to use automated methods on them.

I think the last paragraph of "Rationale" in essence repeats what has been stated in the first para. Suggestions - combine them and edit to avoid repetition. Natasha

Consider revision? "Level 2 texts do not require any special knowledge or manual intervention below the section level." Natasha

In "Linking between encoded text and images of source documents", we say that "The examples below use the former method", i.e. "Use the @facs attribute on each <pb> element to point to the corresponding page image using a URI." Indeed that's what we use in the Level 1 example: <pb n="114" facs="00000002.tif"/>

However, here is the level 2 example: <pb xml:id="p21198-zz0002mpqr" n="1"/> Natasha

maybe to keep one - Alger Hiss document - example?Natasha

LEVEL 3: Simple Analysis

General Level 3 Recommendations

  • need to check for consistency: sometimes it's pb, sometimes it's <pb> Natasha
  • sorry, but what does "Forme Works" mean? Natasha
  • Need to explain more about how to encode end-of-division notes. They need an n= attribute, but they also need a ptr in the text.
    • Do we handle with: <ref target="#note01">1</ref> that points to end of division [1] <note xml:id="note01">blah</note>?
    • Do we need to support bi-directional linking: <ref target="#note01" xml:id="ref01">1</ref> and <note xml:id="note01"><ref target="#ref01">[1]</ref>blah</note>? Mdalmau
  • Use of the "n" attribute for <l> should be optional. Remove from example? Or add optional "n" usage to the table? Mdalmau (reported by Rich)
  • Check: "Add the "target" attribute (@target) to reference the <pb> identifier to generate links from the index into the text proper." And in general the consistency with how we list attributes, i.e. "target" attribute or target= attribute; or @target, etc. Natasha
    • I revised this to make it somewhat clearer, but I think it should be rewritten and an example (even a fake one) included. It would be good to give guidance on what to do with things like page ranges in an index and errors in the index. (Kshawkin)

Level 3 Examples

Basic Structure: Verse

I am afraid that our examples for level 3 and level 4 verse are not that different. Level 3 verse has n= attribute for <l> element, while level 4 verse does not. Thoughts? Natasha

LEVEL 4: Basic Content Analysis

  • For name tagging, recommend the use of @ref when the target is web accessible and @key when not. Mdalmau
  • Matt has provided an @key example; some possible debate about where the explanation appears in the Header Mdalmau
  • Moved Name Tagging recommendations from bullet point under General Recommendations to its own section. The explanation and examples are too dense for a bullet point. We may need to find a better place for this. Mdalmau

Level 4 Letters

  • This text conflates personal letters (epistles) with situations where one text is embedded within another. These topics should be dealt with separately since there are encoding projects that encode personal correspondence only (where letters are not embedded within other texts), and there are cases of non-epistles included within frame texts. (Kshawkin)
    • I see your point, Kevin. I will try to find a more clear-cut example, but, honestly, I would stay with the one we have :-) Natasha

Level 4 Drama

  • It says, "Stage directions are encoded as <stage> and enclose block level content describing scenery, etc." This is unclear: should block-level content describing the scenery etc. be "enclosed" in <stage> tags?

Level 4 Verse

  • <l rend="">: Matt added a note in the GBP, which I've moved here: "note: I really don't think we want to keep something like this in here; what do others recommend as far as methods for encoding typographically explicit verse breaks, etc.?"
    • Also, what sorts of values should be used on rend=? (Kshawkin)

LEVEL 5: Scholarly Encoding Projects

  • Need examples. Syd was going to make scholarly the level 4 examples. Any updates? Mdalmau

Syd, what do you think about my comments re your WWP example? It can be considered level 5. Natasha

  • I've added two verse examples, neither of which inspires much confidence. I wouldn't be hurt if someone were to replace these with other better (e.g. WWP) examples. Gworthey

General Guidelines for Attribute Usage

  • It says, "For instance, if all text is encoded as <hi> is defined as being rendered in italics, there is no reason to encode text as <hi rend="slant(italics)">" . We need to say where this would be defined, and by whom. But we should also discuss the question of normalization in rendering. If a project wants to encode all monograph titles as italics regardless of how they were displayed in the source document, then there's no need to use rend=. But if the project wants to transcribe the source document faithfully, including all inconsistencies, then rend= should always be used. (Kshawkin)

Acknowledgements

Appendix A: History of this Document

  • Why aren't we using the new rendition feature in P5? Mdalmau