Tagging Levels

Explanation of Tagging Level Codes

In order to facilitate structured searching the text was marked up (or tagged) in XML. The particular focus of the project is on names, but we have also marked up occupations, places, and dates. Owing to the time-consuming nature of the process, however, the markup is not comprehensive. At different phases of the project the level of thoroughness of the markup varied.

The table below sets out the instructions given to the data developers for checking and tagging the XML files following automated markup. By consulting the tagging level of a particular document, it is possible to determine how intensive the markup for that document was.

Code

Instructions

(All)

  1. Carry out XML validation
  2. Insert standardised date format for tagged dates

A

  1. Check automatically tagged information for errors
  2. Look for information missed by the automated tagging and insert tags:
    • Complete person names (ie, both given name and surname are present)
    • If the automated tagging has been unable to identify the gender of a tagged name, correct this manually (tag as "unknown" if it is indeterminable)
    • All place names
    • All occupational/status information
    • Tag all complete dates (day/month/year are present)
  3. Aa link tagged names to occupation and place information (small number of documents)

B

Changes:

  1. Information missed by the automated tagging:
    • Places and occupation/status information: tag only when clearly related to tagged person names
    • Tag document creation dates only

C

Changes:

  1. Do not check automatically tagged information for errors
  2. Information missed by the automated tagging:
    • Do not look for missed places and occupation/status information
    • Tag person names missed by the automated tagging if noticed during the validation process, but don't specifically look for them
    • Tag minimal key document creation dates only

D

Changes:

  1. Information missed by the automated tagging:
    • Do not attempt to determine gender of names that the automated tagging could not identify (tag all as 'unknown')

E

Changes:

  • Carry out XML Validation and date standardisation only

Summary By Archive

This table gives an overview of tagging levels for each archive included in the site. Tagging codes at the level of individual documents are detailed on the relevant document description pages.

Archive
Code

Document Type Code

Tagging
Code

BA

EP

A

 

AP | IA | MV | RA | RC

C

 

AC | AO | MO | RW

C-D

 

LP

D

BR

IA | MB | MG | PM | RA

B

CC

MC

D

CD

EP

A-B

 

AP | BC | BN

B-C

 

BE | LP | LW | MO | OO | PA | PM | RC | RD | RP | RR

C

 

AO | MV | RV

C-D

CL

IC

D

CO

IC

E

CW

IC

B

DB

BC

C

 

PM

C-D

 

AC | IW | MO | MV | PP | PR | RI | WB

D

HO

CR

D

OB

PS

B-D

SL

PS

B

SM

PS

A-B

 

GO

E

TH

LB | LT | MG

B

 

MC

B-C

 

RH

D

WJ

PS

E