Difference between revisions of "BPG Feedback on Complete Draft"

Revision as of 18:29, 14 April 2009

The main sections of the BPG are outlined below to facilitate orderly feedback :-). Please provide feedback by Saturday, April 11th so Kevin and I have enough time to compile any of the more significant issues for discussion during our upcoming meeting on April 14th.

Add each new comment as a bullet point and be sure to attribute comments so we know how to follow-up if there's some debate. For example:

feedback blah blah, Mdalmau

1 Introduction
2 General Recommendations
3 Structure of a TEI Document
4 The TEI Header
5 Linking between encoded text and images of source documents
6 LEVEL 1: Fully Automated Conversion and Encoding
7 LEVEL 2: Minimal Encoding
8 LEVEL 3: Simple Analysis
- 8.1 General Level 3 Recommendations
- 8.2 Level 3 Examples
  - 8.2.1 Basic Structure: Verse
9 LEVEL 4: Basic Content Analysis
10 LEVEL 5: Scholarly Encoding Projects
11 General Guidelines for Attribute Usage
12 Acknowledgements
13 Appendix A: History of this Document

Introduction

Task Force w/c though historically correct may lose context in this new revision. Consider something more generic like "Working Group" or past tense: "the Task Force attempted to make these ..." Mdalmau

General Recommendations

Conisder moving facsimile page image info here as the final bullet point. Mdalmau

Structure of a TEI Document

Do we need to reference <facsimile>? Mdalmau

The TEI Header

I changed 'the Open Access Initiative (OAI)" to "the Open Archives Initiative (OAI)". Unless there is another initiative that I do not know about.Natasha

encodingDesc: Even though Melanie and I wrote most of the header, I don't remember any more why some information about sourceDesc is in encodingDesc/p and other info is in encodingDesc/editorialDecl/p . Seems like this info should be in one place or the other. Suggestions? Note that the P5 schema allows both p and non-p children of encodingDesc, but the prose implies that this isn't allowed. I've reported to SourceForge ( https://sourceforge.net/tracker/?func=detail&atid=644062&aid=2761884&group_id=106328 ), but maybe we should do it one way or the other. All the more reason to fix this.

How to allow for automatic generation of sourceDesc from a MARC record yet not commit tag abuse on author and respStmt elements.

Linking between encoded text and images of source documents

Consider moving this text up under the General Rec. section. Mdalmau
Do we need to say this: "The examples below use the former method." And do we want all examples to use @facs? I think we should mix it up since we are recommending both options. Mdalmau
- Kevin suggested that we need more info for how to use xml:id with METS. I am not quite sure how to do this at-a-glance and any attempt at detail will lead us into a significant discussion about METS. Here are some things we can say, but may not all of them (and hopefully Chris P. can provide some guidance):
  - Use xml:id to locate content but not explicit representation of content
  - fileSec used to explicitly list all the images (master, jpg, pdf)
  - structMap orders the pages by page break and references each image defined in fileSec Mdalmau

LEVEL 1: Fully Automated Conversion and Encoding

Should an automated workflow be described? (Kshawkin)

LEVEL 2: Minimal Encoding

Should an automated workflow be described? How does everyone feel about the one Kevin described by email on 2009-04-08? (Kshawkin)

Second paragraph of rationale: why "earlier than 1900"? Restate to explain what it is about certain older source documents that makes it difficult to use automated methods on them.

I think the last paragraph of "Rationale" in essence repeats what has been stated in the first para. Suggestions - combine them and edit to avoid repetition. Natasha

Consider revision? "Level 2 texts do not require any special knowledge or manual intervention below the section level." Natasha

In "Linking between encoded text and images of source documents", we say that "The examples below use the former method", i.e. "Use the @facs attribute on each <pb> element to point to the corresponding page image using a URI." Indeed that's what we use in the Level 1 example: <pb n="114" facs="00000002.tif"/>

However, here is the level 2 example: <pb xml:id="p21198-zz0002mpqr" n="1"/> Natasha

maybe to keep one - Alger Hiss document - example?Natasha

LEVEL 3: Simple Analysis

General Level 3 Recommendations

need to check for consistency: sometimes it's pb, sometimes it's <pb> Natasha

sorry, but what does "Forme Works" mean? Natasha

Need to explain more about how to encode end-of-division notes. They need an n= attribute, but they also need a ptr in the text.
- Do we handle with: <ref target="#note01">1</ref> that points to end of division [1] <note xml:id="note01">blah</note>?
- Do we need to support bi-directional linking: <ref target="#note01" xml:id="ref01">1</ref> and <note xml:id="note01"><ref target="#ref01">[1]</ref>blah</note>? Mdalmau
Use of the "n" attribute for <l> should be optional. Remove from example? Or add optional "n" usage to the table? Mdalmau (reported by Rich)

Check: "Add the "target" attribute (@target) to reference the <pb> identifier to generate links from the index into the text proper." And in general the consistency with how we list attributes, i.e. "target" attribute or target= attribute; or @target, etc. Natasha

- I revised this to make it somewhat clearer, but I think it should be rewritten and an example (even a fake one) included. It would be good to give guidance on what to do with things like page ranges in an index and errors in the index. (Kshawkin)

Level 3 Examples

Basic Structure: Verse

I am afraid that our examples for level 3 and level 4 verse are not that different. Level 3 verse has n= attribute for <l> element, while level 4 verse does not. Thoughts? Natasha

LEVEL 4: Basic Content Analysis

~~For name tagging, recommend the use of @ref when the target is web accessible and @key when not.~~ Mdalmau
~~Matt has provided an @key example; some possible debate about where the explanation appears in the Header~~ Mdalmau
Moved Name Tagging recommendations from bullet point under General Recommendations to its own section. The explanation and examples are too dense for a bullet point. We may need to find a better place for this. Mdalmau

Level 4 Letters

This text conflates personal letters (epistles) with situations where one text is embedded within another. These topics should be dealt with separately since there are encoding projects that encode personal correspondence only (where letters are not embedded within other texts), and there are cases of non-epistles included within frame texts. (Kshawkin)
- I see your point, Kevin. I will try to find a more clear-cut example, but, honestly, I would stay with the one we have :-) Natasha

Level 4 Drama

It says, "Stage directions are encoded as <stage> and enclose block level content describing scenery, etc." This is unclear: should block-level content describing the scenery etc. be "enclosed" in <stage> tags?

Level 4 Verse

<l rend="">: Matt added a note in the GBP, which I've moved here: "note: I really don't think we want to keep something like this in here; what do others recommend as far as methods for encoding typographically explicit verse breaks, etc.?"

- Also, what sorts of values should be used on rend=? (Kshawkin)

LEVEL 5: Scholarly Encoding Projects

Need examples. Syd was going to make scholarly the level 4 examples. Any updates? Mdalmau

Syd, what do you think about my comments re your WWP example? It can be considered level 5. Natasha

I've added two verse examples, neither of which inspires much confidence. I wouldn't be hurt if someone were to replace these with other better (e.g. WWP) examples. Gworthey

General Guidelines for Attribute Usage

It says, "For instance, if all text is encoded as <hi> is defined as being rendered in italics, there is no reason to encode text as <hi rend="slant(italics)">" . We need to say where this would be defined, and by whom. But we should also discuss the question of normalization in rendering. If a project wants to encode all monograph titles as italics regardless of how they were displayed in the source document, then there's no need to use rend=. But if the project wants to transcribe the source document faithfully, including all inconsistencies, then rend= should always be used. (Kshawkin)

Acknowledgements

Appendix A: History of this Document

Why aren't we using the new rendition feature in P5? Mdalmau

@@ Line 101: / Line 101: @@
 * Need examples.  Syd was going to make scholarly the level 4 examples.  Any updates? [[User:Mdalmau|Mdalmau]]
 Syd, what do you think about my comments re your WWP example? It can be considered level 5. [[User:Natasha|Natasha]]
+* I've added two verse examples, neither of which inspires much confidence.  I wouldn't be hurt if someone were to replace these with other better (e.g. WWP) examples.  [[User:Gworthey|Gworthey]]
 ==General Guidelines for Attribute Usage==

Difference between revisions of "BPG Feedback on Complete Draft"

Revision as of 18:29, 14 April 2009

Contents

Introduction

General Recommendations

Structure of a TEI Document

The TEI Header

Linking between encoded text and images of source documents

LEVEL 1: Fully Automated Conversion and Encoding

LEVEL 2: Minimal Encoding

LEVEL 3: Simple Analysis

General Level 3 Recommendations

Level 3 Examples

Basic Structure: Verse

LEVEL 4: Basic Content Analysis

Level 4 Letters

Level 4 Drama

Level 4 Verse

LEVEL 5: Scholarly Encoding Projects

General Guidelines for Attribute Usage

Acknowledgements

Appendix A: History of this Document

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools