A VERY short introduction to document analysis.
STEP 1. Know everything. Understand everything ...
STEP 2. Give up on that. Acknowledge your limitations.
There is an Infinite and Wonderful Variety in books.
Equally infinite choices in what to markup. Markup is always interpretation.
- You can markup what is there.
- You can markup what you choose to find there
- You can markup what is only implicitly there (tagging silences)
One can try to distinguish between categories of features (but they are interrelated)
- Linguistic features
- Presentational features
- Structural/rhetorical features
- Artefacts of print
Example: marking passages you like (no visual cue):
Mostly you will be responding to obvious physical cues. Asking "what is this thing?" "What is it here for?" "How does it relate to the other things here?" Visual cues aren't everything, they can be misleading, but they are a start. Especially if you are writing instructions for someone else to recognize features.
Leverage your knowledge. It helps if you know something about...
- print conventions,
- subject matter
- the work itself
But expect to be ignorant at least sometimes. Use what you know and allow for incomplete tagging
These can still be marked up, despite being (partly) unfamiliar:
- Polish Polish
- BCP BCP
- Cernyweg Cernyweg
- Concordance 1 Concordance 1
- Concordance 2 Concordance 2
- Cornish Cornish
On the other hand, sometimes one is left genuinely at a loss. In my case, here:
An easy sample
SO: pick a couple of samples and look at them: