A case study: the Lampeter corpus


See http://www.tu-chemnitz.de/phil/english/real/lampeter/lamphome.htm (or look in the Oxford Text Archive)

  • Fairly typical requirements for language corpora
    • light presentational tagging
    • structural markup for access
    • demographic information about text production
    • small number of tags to ease data capture and validation
  • Implementation
    • tagsets: prose base, and tags from four additional sets
    • some extensions, many exclusions

28 Next | First| Previous TEI and XML: a marriage made in heaven?