BPG Feedback on Complete Draft

From TEIWiki
Jump to navigation Jump to search

Back to GBP

The main sections of the BPG are outlined below to facilitate orderly feedback :-). Please provide feedback by Saturday, April 11th so Kevin and I have enough time to compile any of the more significant issues for discussion during our upcoming meeting on April 14th.

Add each new comment as a bullet point and be sure to attribute comments so we know how to follow-up if there's some debate. For example:

Introduction

  • Task Force w/c though historically correct may lose context in this new revision. Consider something more generic like "Working Group" or past tense: "the Task Force attempted to make these ..." Mdalmau

General Recommendations

  • Conisder moving facsimile page image info here as the final bullet point. Mdalmau done

Structure of a TEI Document

  • Do we need to reference <facsimile>? Mdalmau
    • We don't. We assume Libraries will use METS for that functionality.

The TEI Header

  • I changed 'the Open Access Initiative (OAI)" to "the Open Archives Initiative (OAI)". Unless there is another initiative that I do not know about.Natasha okay
  • encodingDesc: Even though Melanie and I wrote most of the header, I don't remember any more why some information about sourceDesc is in encodingDesc/p and other info is in encodingDesc/editorialDecl/p . Seems like this info should be in one place or the other. Suggestions? Note that the P5 schema allows both p and non-p children of encodingDesc, but the prose implies that this isn't allowed. I've reported to SourceForge ( https://sourceforge.net/tracker/?func=detail&atid=644062&aid=2761884&group_id=106328 ), but maybe we should do it one way or the other. All the more reason to fix this. Moved everything to editorialDecl.
  • How to allow for automatic generation of sourceDesc from a MARC record yet not commit tag abuse on author and respStmt elements. We now use <title type="MARC245c"> instead of respStmt when generating from MARC.

Linking between encoded text and images of source documents

  • Consider moving this text up under the General Rec. section. Mdalmau done
  • Do we need to say this: "The examples below use the former method." And do we want all examples to use @facs? I think we should mix it up since we are recommending both options. Mdalmau
    • Kevin suggested that we need more info for how to use xml:id with METS. I am not quite sure how to do this at-a-glance and any attempt at detail will lead us into a significant discussion about METS. Here are some things we can say, but may not all of them (and hopefully Chris P. can provide some guidance):
      • Use xml:id used as a "conceptual" identifier for content but not explicit representation of content (it doesn't "point to" content)
      • fileSec used to explicitly list all the images (master, jpg, pdf)
      • structMap orders the pages by page break and references each image defined in fileSec Mdalmau

LEVEL 1: Fully Automated Conversion and Encoding

  • Should an automated workflow be described? (Kshawkin) done

LEVEL 2: Minimal Encoding

  • Should an automated workflow be described? How does everyone feel about the one Kevin described by email on 2009-04-08? (Kshawkin)

Second paragraph of rationale: why "earlier than 1900"? Restate to explain what it is about certain older source documents that makes it difficult to use automated methods on them. Rephrased to "older materials", then removed anyway in description of automated workflow.

I think the last paragraph of "Rationale" in essence repeats what has been stated in the first para. Suggestions - combine them and edit to avoid repetition. Natasha

Consider revision? "Level 2 texts do not require any special knowledge or manual intervention below the section level." Natasha

In "Linking between encoded text and images of source documents", we say that "The examples below use the former method", i.e. "Use the @facs attribute on each <pb> element to point to the corresponding page image using a URI." Indeed that's what we use in the Level 1 example: <pb n="114" facs="00000002.tif"/>

However, here is the level 2 example: <pb xml:id="p21198-zz0002mpqr" n="1"/> Natasha

maybe to keep one - Alger Hiss document - example?Natasha No longer says "The examples below use the former method".

LEVEL 3: Simple Analysis

General Level 3 Recommendations

  • need to check for consistency: sometimes it's pb, sometimes it's <pb> Natasha This will be dealt with in copyediting.
  • sorry, but what does "Forme Works" mean? Natasha clarified this TEI term
  • Need to explain more about how to encode end-of-division notes. They need an n= attribute, but they also need a ptr in the text.
    • Do we handle with: <ref target="#note01">1</ref> that points to end of division [1] <note xml:id="note01">blah</note>?
    • Do we need to support bi-directional linking: <ref target="#note01" xml:id="ref01">1</ref> and <note xml:id="note01"><ref target="#ref01">[1]</ref>blah</note>? Mdalmau
      • See email from Kevin sent on 2009-04-14. Since no feedback from Syd, we went with option b from Kevin's email even though this doesn't agree with the TOC markup, based on P5, which uses ptr instead of ref when it might be expected.
  • Use of the "n" attribute for <l> should be optional. Remove from example? Or add optional "n" usage to the table? Mdalmau (reported by Rich) We decided to remove n= from l element in Level 3.
  • Check: "Add the "target" attribute (@target) to reference the <pb> identifier to generate links from the index into the text proper." And in general the consistency with how we list attributes, i.e. "target" attribute or target= attribute; or @target, etc. Natasha
    • I revised this to make it somewhat clearer, but I think it should be rewritten and an example (even a fake one) included. It would be good to give guidance on what to do with things like page ranges in an index and errors in the index. (Kshawkin) No full example yet provided, but ext revised to make it clearer. People can decide on their own what to do about page ranges and errors.

Level 3 Examples

Basic Structure: Verse

I am afraid that our examples for level 3 and level 4 verse are not that different. Level 3 verse has n= attribute for <l> element, while level 4 verse does not. Thoughts? Natasha Level 3 now has no n=, and Level 4 does.

LEVEL 4: Basic Content Analysis

  • For name tagging, recommend the use of @ref when the target is web accessible and @key when not. Mdalmau
  • Matt has provided an @key example; some possible debate about where the explanation appears in the Header Mdalmau
  • Moved Name Tagging recommendations from bullet point under General Recommendations to its own section. The explanation and examples are too dense for a bullet point. We may need to find a better place for this. Mdalmau
    • Not contested.
  • Natasha to incorporate index "tip" originally in level 3 in level 4:

Back Matter

<div type="index">: Use lists to mark up index entries, with the <ref> element used to mark up page numbers given in the index entry. Add the "target" attribute (@target) to reference the n= attribute of the <pb> of the referenced page.

Kevin just did this.


Level 4 Letters

  • This text conflates personal letters (epistles) with situations where one text is embedded within another. These topics should be dealt with separately since there are encoding projects that encode personal correspondence only (where letters are not embedded within other texts), and there are cases of non-epistles included within frame texts. (Kshawkin)
    • I see your point, Kevin. I will try to find a more clear-cut example, but, honestly, I would stay with the one we have :-) Natasha Kevin clarified the text of the GBP.

Level 4 Drama

  • It says, "Stage directions are encoded as <stage> and enclose block level content describing scenery, etc." This is unclear: should block-level content describing the scenery etc. be "enclosed" in <stage> tags?
    • I just changed this to read: "Stage directions are encoded as <stage> and enclose content describing scenery, stage directions, etc." However, this may miss the point that Kevin indicated above, that basically elements such as <p> are allowed inside of <stage> (Drsweeg) done

Level 4 Verse

  • <l rend="">: Matt added a note in the GBP, which I've moved here: "note: I really don't think we want to keep something like this in here; what do others recommend as far as methods for encoding typographically explicit verse breaks, etc.?"
    • Also, what sorts of values should be used on rend=? (Kshawkin) We decided that use of rend= to mark indents is okay. Discussion of milestones broken out into a new section.

LEVEL 5: Scholarly Encoding Projects

  • Need examples. Syd was going to make scholarly the level 4 examples. Any updates? Mdalmau

Syd, what do you think about my comments re your WWP example? It can be considered level 5. Natasha

  • I've added two verse examples, neither of which inspires much confidence. I wouldn't be hurt if someone were to replace these with other better (e.g. WWP) examples. Gworthey

General Guidelines for Attribute Usage

  • It says, "For instance, if all text is encoded as <hi> is defined as being rendered in italics, there is no reason to encode text as <hi rend="slant(italics)">" . We need to say where this would be defined, and by whom. But we should also discuss the question of normalization in rendering. If a project wants to encode all monograph titles as italics regardless of how they were displayed in the source document, then there's no need to use rend=. But if the project wants to transcribe the source document faithfully, including all inconsistencies, then rend= should always be used. (Kshawkin) done
  • Why aren't we using the new rendition feature in P5? Mdalmau
    • Syd said he doesn't know of any successful implementations. Michelle said she's seen mention of it on TEI-L. People reviewing our draft may ask about this, so we'll need to be prepared to respond. (Kshawkin)
  • Can we move this section up as a subsection to the "General Recommendations"? Gworthey done

Acknowledgements

Appendix A: History of this Document